Giter Site home page Giter Site logo

g0v1ndn / covid-sql-analytics Goto Github PK

View Code? Open in Web Editor NEW
2.0 1.0 0.0 23.32 MB

Covid SQL Analytics is a robust data project leveraging PostgreSQL for in-depth global and India-specific Covid-19 analysis. Offering curated queries and valuable insights.

License: MIT License

data-science sql open-source covid19 postgresql

covid-sql-analytics's Introduction

Covid-SQL-Analytics

About:

This repository serves as a resource for SQL-based data analysis using PostgreSQL, focusing on Covid-19 data. It includes the code, curated data, and observations derived from detailed query analysis, making it a valuable reference for those interested in exploring this dataset.

Usage Instructions:

Prerequisites:

  • SQL and PostgreSQL Knowledge: A fundamental understanding of SQL concepts and experience with PostgreSQL will be beneficial as you navigate through the queries and analysis presented here.
  • Development Environment: Set up a development environment with PostgreSQL installed. You can download PostgreSQL from its official website and follow the installation instructions provided.
  • Basic Data Analysis Skills: Familiarity with data analysis concepts will help you better understand the insights derived from the queries and observations shared in this repository.

Downloading or Cloning the Repository:

  1. Click on the "Code" button on the top right corner of this repository page.
  2. Click on "Download ZIP" to download a compressed zip file of the repository.
  3. Extract the contents of the zip file to a folder on your computer.

OR

  1. Open your terminal or command prompt and navigate to the directory where you want to clone the repository.
  2. Run the command git clone https://github.com/g0v1ndN/Covid-SQL-Analytics to clone the repository to your local machine.

Note

You need to have Git installed on your computer. If you don't have it installed, please download and install Git from the official website.

Exploring the Code:

If you want to dive deep into the what this repository brings to the table, familiarize yourself with the main analysis code located in the analyze_covid.sql file. This code is split into three key parts:

  1. Initial Setup Queries: These queries establish the foundation by creating the necessary database, tables, and importing the Covid-19 data.
  2. Data Analysis Queries for India: This section contains SQL queries tailored to analyzing Covid-19 data specific to India.
  3. Global COVID-19 Data Analysis Queries: This segment comprises SQL queries aimed at dissecting and analyzing the global COVID-19 data landscape. It provides insight to cross-country comparisons and trends in the pandemic's data.

Observations & Findings:

Important

The data used in this project covers the period from January 1, 2020 to August 16, 2023. Please note that the source website's data might have been updated since the start of this project.

Understanding the COVID-19 Situation in India

  • COVID-19 was first reported in India on January 30, 2020.
  • As of the latest data, the infection percentage in India is 3.18%, indicating a relatively moderate spread of COVID-19 within the population.
  • The peak of new COVID-19 cases was recorded on May 2021, with a notable count of 9,284,558 new cases.
  • Currently, the mortality rate in India stands at 1.18%, indicating a comparatively low fatality rate among confirmed cases.
  • 72.50% of the population in India has received at least one dose of the Covid-19 vaccine, and 67.17% are fully vaccinated.
  • India has reported a significant number of COVID-19 deaths, totaling 531,922, which corresponds to 0.04% of its population.

Global COVID-19 Landscape

COVID-19 Outbreak and Global Data:

  • The COVID-19 outbreak, which originated in China, was officially reported on January 4, 2020.
  • The latest data reveals a COVID-19 infection rate of approximately 9.65%.
  • It also discloses a mortality rate of 0.90%, reflecting a relatively low fatality rate among confirmed cases.

Infection Statistics:

  • As of the current data, the top 5 hotspots with the highest infection percentages are Cyprus (73.76%), San Marino (72.37%), Brunei (69.09%), Austria (68.03%), and South Korea (65.42%).
  • In contrast, North Korea and Turkmenistan have zero cases, while Yemen, Niger, and Chad share the lowest infection rate of 0.04%.
  • The highest number of new COVID-19 cases globally occurred on January 2022, with a total of 91,806,015 new cases reported.

Vaccination Statistics:

  • Notably, vaccinations have been administered to more than 5.6 billion people, accounting for 70.48% of the population.
  • Gibraltar, Tokelau, United Arab Emirates, Qatar, Nauru, Brunei, and Pitcairn have successfully attained vaccination percentages exceeding 99%.
  • Vaccination rates vary across income categories, with 32.54% of low-income, 79.86% of high-income, 66.28% of lower-middle-income, and 83.42% of upper-middle-income individuals having been vaccinated at least once.

Regional Deaths and COVID-19 Toll:

  • Peru, Bulgaria, Bosnia and Herzegovina, Hungary, and North Macedonia exhibit higher death percentages, with rates ranging from 0.47% to 0.65%.
  • North America leads with the highest death count (1,127,152), followed by South America (704,659), Asia (531,922), Europe (399,938), Africa (102,595) and Oceania (22,618) on a continental basis.
  • The countries with the highest COVID-19 death toll are as follows: United States (1,127,152 deaths), Brazil (704,659 deaths), India (531,922 deaths), Russia (399,938 deaths), and Mexico (334,336 deaths).

External Resources:

Data Inclusion

  • Data Source: Our World in Data
  • Citation: Edouard Mathieu, Hannah Ritchie, Lucas Rodés-Guirao, Cameron Appel, Charlie Giattino, Joe Hasell, Bobbie Macdonald, Saloni Dattani, Diana Beltekian, Esteban Ortiz-Ospina, and Max Roser (2020) - "Coronavirus Pandemic (COVID-19)". Published online at OurWorldInData.org.
  • License: CC-BY-4.0 License

The Creative Commons Attribution (CC-BY) License is an open-source license that permits usage, modification, and distribution of the content, provided you attribute the original creator.

License:

This project is licensed under the terms & conditions (T&Cs) of the MIT license. If you're interested in reading more about it, please take a look at the LICENSE document.

covid-sql-analytics's People

Contributors

g0v1ndn avatar

Stargazers

Sreenath sunil  avatar Ram Sankar S. avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.