Python Data Analysis
The purpose of a full Python ETL data analysis is to extract data from some source, transform it, and load it elsewhere. In this project, which is currently in process, I'm taking the following steps:
- Grabbed weather data using the Meteo.com Python API for each voting group's geographical location on their designated day of voting in the 2020 presidential election.
- Downloaded some data from the US Census Bureau.
- Pulled that data into Python
- Reduced the number of total voters to the approximate number having voted in each location on voting day. This was done using percent of voters in each state who voted early, voted by mail, etc.
- Found the ratio between number of approximate voters in each XX vs. the number of registered voters
- Using zip code IDs, merged the weather for each location on voting data with the above ratio
- Visualized a scatterplot using voting ratio compared with temperature, precipitation, humidity, etc. to explore trends regarding weather statistics vs. percent of people who voted out of all those who were registered.