Giter Site home page Giter Site logo

patilbp / bda594-project-covid-19-analysis Goto Github PK

View Code? Open in Web Editor NEW
1.0 1.0 0.0 6.53 MB

https://sites.google.com/sdsu.edu/covid19/main --- Clean data tables, learn relationships, analyze demographics, measure covid cases and deaths as a spatial time series, assess whether trends exist between demographics and cases/vaccine count, basic text analysis on tweets concerning covid-19 topics.

bda594-project-covid-19-analysis's Introduction

BDA594-Project-Covid-19-Analysis

Project Dashboard Visualization: Tableau Dashboard Link

Project Data Sources:

The vaccination dataset is a time series showing the number of vaccines administered, by U.S. County, since January 2021. It provides detailed descriptions of vaccination numbers by age group and frequencies for those partially and fully vaccinated.

The COVID-19 case count and death table shows the number of cases and deaths in all U.S. counties beginning on January 21, 2020; and updating to the current date. The data were published to GitHub by the New York Times.

The demographics table was collected from the five-year American Community Survey published by the Census Bureau from 2014-2019 (their most recent publication). Units of interest are U.S. Counties using variables measuring median age, median household income, educational attainment and racial composition.

These three tables are linked together by a primary key identifying a county’s FIPS code, permitting relationships and analysis to be conducted between tables inside our COVID-19 database. Analysis will be centered on how COVID-19 spread through counties and what characteristics were shared among worst-impacted counties. Furthermore, what characteristics are shared among counties with high- and low-vaccination rates?

The final table contains text scraped from Twitter, hosted on Kaggle, that analyzes people’s views about the vaccines from a global perspective. Capturing the human response to a suddenly changed world, this table provides a unique qualitative perspective in understanding the spread of disinformation, relief and the “return to normalcy”. It contains approximately 300,000 tweets collected over time from August 2020.

Responsibilities performed in this project were:

  1. Understand and clean the data tables, understand relationships connecting them to theme of project
  2. Analyze demographic characteristics of U.S. counties
  3. Measure Covid cases and deaths reported in U.S. counties as a spatial time series (how do cases and deaths change on the map over time?)
  4. Measure Covid vaccine counts by U.S. counties: which counties boast high vaccine-counts relative to population? Low-vaccine counts?
  5. Assess whether trends exist between county demographics and confirmed Covid cases/vaccine count
  6. Conduct basic text analysis on tweets concerning Covid-19 topics beginning in August 2020

Project Dashboard Visualization: Tableau Dashboard Link

Tableau_Dashboard_9th Dec 2021

bda594-project-covid-19-analysis's People

Contributors

patilbp avatar

Stargazers

Haibing Yan avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.