top of page

Hello. We are the Gold Diggers.

And this is our project, Analysis on Tweets Supposing the "Golden Era" during the FEM regime, where we dig through the reasons why the "Golden Era" is regarded as it is—hence, our name.

CS132-LOGO.png

About Us

We are a group of BS Computer Science students from the University of the Philippines, Diliman. For our final project in our course, Introduction to Data Science, we prepared this study and presented it during Watch The Facts! A Mini-Conference on Data Science and Disinformation in the Philippines. 

Atienza, Carmelo Ellezandro_Wisdom.jpg

Carmelo Atienza

Hey! I'm Melo. I'm generally interested in emerging technologies and consumer tech. I also enjoy watching NBA (Warriors Bandwagon since 2016), playing RPGs and FPS games, and consuming anime and manga.  

cratienza1@up.edu.ph | 09292942886

received_278560270586780_edited.jpg

Pauline Abcede

Annyeong! I’m Pau, a BS Computer Science student sa umaga, Virtual Assistant of an American influencer sa gabi, and zombie sa madaling araw. In my free time, I’m your basic B–hanging out with friends, listening to KPop, watching KDramas and anime, and reading manga.

mpabcede1@up.edu.ph | 09760314172

322485324_677016920503450_630170431525211851_n.jpg

Calvin James Maximo

Hi! I'm CJ. I like playing FPS and racing games (my faves are Apex and Forza). I play the guitar, and I am a bit fast at solving a Rubik's Cube (check out my WCA profile 2018MAXI03 :>) I also like watching K-Dramas and sports such as tennis, F1, and NBA.

ctmaximo1@up.edu.ph | 09272446167

Product

Here's an overview of our Project.

The noise of misinformation and disinformation deafens the history and democracy of our country.

bongbong-marcos-sara-duterte-unity-cavite-province-3_2022-03-30_16-22-53.jpg

Motivation

With another Marcos seated in the position of President, the issue of historical distortion regarding the administration of former-President Ferdinand Marcos is more alarming than ever. As such, our team is motivated to contribute in any way we can in countering the dis/misinformation spreading rapidly through social media.

EA1866A2-B99D-4CB2-B3B89A15768CC312_source.jpg

Problem

As we believe that education is the primary key in fighting dis/misinformation, it is important to identify which areas of learning we may need to work on more intensively to guide us in formulating a proper plan of action to counter the issue of historical distortion.

Capture.JPG

Solution

To find an answer to this question, our team plans to study dis/misinformative tweets implying how FEM's regime was a “Golden era” and identify which reason appears to be their most common claim. This should help us gain insights on how to approach our nation's problem on dis/misinformation regarding the FEM regime through education.

Research Question

What were the most common reasons in dis/misinformative tweets supposing the “Golden era” during FEM regime?

Hypothesis

The dis/misinformative tweets were most likely to claim a common particular reason leading to the supposed “Golden era.”

Null Hypothesis

The dis/misinformative tweets were equally likely to claim various reasons that led to the supposed “Golden era” during the FEM regime.

Solution

Collect dis/misinformative tweets, identify each tweet's main reason for their claim, and tally and rank their reasons according to frequency.

Product

Let's talk about our data science methodology.

Our team worked with pandas and a bunch of other Python libraries in order to obtain and visualize the data that we want to find out.

pandas.jpg

Data Exploration

Tweets were scraped manually with assistance from a third-party API scraper (Apify), prompted with keywords related to our topic. Then, we cleaned, tokenized, and applied Natural Language Processing to our data before using graphs to visualize our dataset. We then manually clustered each tweet according to their reason mentioned.

biuwer_6_business_machine_learning_use_cases_4dba3d6e91.jpg

Machine Learning

To further explore our data, the group also attempted an approach in topic clustering using Machine Learning, on top of our manual clustering of tweets during the data exploration stage. The topics were clustered according to their implied reason for supposing the "Golden era" during FEM's regime, using the unsupervised learning model Latent Dirichlet Allocation (LDA) and t-Distributed Stochastic Neighbor Embedding (t-SNE).

bar-chart (1).png

Statistical Modeling

We will be using the Chi-square goodness of fit test to check if the distribution of our data are equal or not. This perfectly fits with our objective to find out whether there is a common particular reason standing out from the tweets or not.

Product

Here are our key findings.

After exploring, testing, and analyzing our data, we have figured some important realizations.

01

Economy is ranked first among the reasons implied by the dis/misinformative tweets by manual labeling.

02

Statistical testing shows that there is a significant difference between the tweet distribution by reason.

03

Machine learning results show that the reasons being specified in the tweets are widely overlapping.

What's the bottomline?

Ferdinand-Marcos-1983.jpg

Identify

Given our results, it implies that the supposedly good economy and quality of life during FEM's regime are the primary reasons why the "Golden era" is deemed as it is.

dataset-analysis.png

Improve

We first suggest extending the study by increasing the sample size and exploring other machine learning models and approaches to obtain more accurate results.

3545757.jpg

Inform

Once there is enough information, we can take more concrete actions on fighting dis/misinformation through education while taking our data into account in planning.

Our vision for this study is to be able to identify in which particular fields of knowledge do the dis/misinformed individuals about FEM's regime need enlightenment on. By educating them on these topics, we may be able to shed light upon the truth behind the spreading dis/misinformation. The findings may also help in revising lesson plans or maybe even improve the curriculum so that the younger generations may already absorb the right information at an early age, instead of having to correct them later on.

Contact
bottom of page