Project summary
Dharitrikumari Rathod
COVID-19 Vaccination Visualization










2020 U.S.
Presidential
Election Analysis
Introduction:
- Analysis of 2020 Presidential debates and townhall meetings
-
Comparison of 2020 Presidential debates and 2016 presidential debates
| Research questions: | Method |
|---|---|
| 1. What were the most prominent words said by each candidate during the different stages of debate during the 2020 and 2016 presidential debates? | Word cloud Bigram |
| 2. Which candidate was most talkative during the debate? | Statistics |
| 3. How many times did they get interrupted by others? | Heatmap |
| 4. What were the positive and negative sentiments of the 2020 debate? | Sentiment analysis |
| 5. Which topics were discussed during the 2020 Presidential debates? | TF-IDF, Cosine similarity |
| 6. How does the similarity of the candidate’s speech compare to the other candidate on the same topic? | TF-IDF, LDA model |
| 7. How does the similarity between President Trump's speech during 2020 and 2016? | TF-IDF, LDA model |
Most prominent words: 2020
Biden 2020
Trump 2020
Most Prominant topics: 2020
Bigram: Joe Biden
| Topics |
|---|
| Green infrastructure |
| Social security |
| Mail in ballot |
| Health insurance/ Affordable care act |
| First responders |
| Tax plans |
| Extracted Topics |
|---|
| New york |
| Individual mandate |
| Law enforcement |
| Stock market |
| Justice reform |
| Economy |
| Small businesses |
| Oil industry |
| Forest Management |
| Obamacare/Health Insuran |
#2 Most talkative in 2020:
Debate_1
Debate_2
Crosstalk/Heat moment

Sentiment Analysis:
1st_debate
Overall 2020
2nd_debate
TF-IDF vectorizer
TF-IDF 1st debate
Biden
Trump
TF-IDF 2nd debate

TF-IDF Topics
TFIDF and Cosine similarities
Built the TF-IDF Model to check if the given topic was discussed in the 2020 election or not. Also, compared the topic using cosine similarity at each stage of the Presidential election 2020.
Extracted Topics TFIDF

Bag of words
TF-IDF


Topic extract
What topics?
| BoW | TF-IDF |
|---|---|
| Second, insurance, healthcare, obamacare | Industry, child, website, reform, school, business |
| Industry, business, website, plague, enormous, nuclear, opportunity, | Talk, totally, month, well, police, discredit |
| Family, Clean, filthy, emission, police, scientist, pollutant, environmental, carbon | Family, immigration, political, protest, tremendous, democratic |
| Ballot, election, inaugural, racist, chance | People, million, evidence, healthcare, opposite, condition, fantastic, nuclear |
| Create, building, question, economic, energy, company | Election, chance, separate |
| Excuse, energy, subsidy, dangerous, federal, border, highway, ecnomically | Ballot, number, answer |
| Statement, plant, pollute, chemical, refinery, superpredator | Crosstalk, economy, segment, disaster, vaccine |
| Crosstalk, Hispanic, environment, global warming, gasoline, unemployment | Second, dollar, radical, deserve |
| Dollar, fracke, billion, emission, climate, market | Support, energy, federal, |

HEART DISEASE PREDICTION
Goal: For the Heart Disease Dataset, we want to classify the presence of heart disease using the best combinations of the dataset files (Cleveland, Hungarian, Switzerland, VA) based on the most relevant patient attributes.
Procedure:
- Decided to test different combination of databases rather than only using the Cleveland dataset.
- Imputing missing values.
- Results would show if we can achieve better results for classification.



Clustering on Warehouse data






EV project:
- 944 sample meters fed into model.
- 167 classified as having Level 2 charger (17.7%)
- 777 classified as not having Level 2 charger


EV Owners incentivized to install a solar system:
Solar home ~236.4 V
Non Solar home ~242.7 V
Rebates, Tax credits
Midnight - 7 am
Thank you !
Copy of 2020 U.S Presidential Election Analysis - Team 10
By rathod_dharitri
Copy of 2020 U.S Presidential Election Analysis - Team 10
CAP6307 - Textmining (UCF MSDA)
- 223