fagan2888 Goto Github PK
Type: User
Location: New York
Type: User
Location: New York
The homework and project of the class titled "Data Mining" @ ZJU
SIADS 532: Data Mining 1
In this project, as a part of data engineering nanodegree (Udacity) I applied what I have learned on data modeling with Apache Cassandra and completed an ETL pipeline using Python. I modelled the data by creating tables in Apache Cassandra to run queries. There is an ETL pipeline that transfers data from a set of CSV files within a directory to create a streamlined CSV file to model and insert data into Apache Cassandra tables.
In this project, as a part of data engineering nanodegree (Udacity) I preformed data modeling with Postgres and built an ETL pipeline using Python. I defined fact and dimension tables for a star schema for a particular analytic focus, and written an ETL pipeline that transfers data from files in two local directories into these tables in Postgres using Python and SQL.
Download data from IMDB movies and parse into useful form
Comparing safety and reliability among the largest U.S. subways using the National Transit Database
Cohort 16 Capstone Project for the Certificate of Data Science at Georgetown University School of Continuing Studies.
Envoy REST/proto API definitions and documentation
The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty in 2015 and 2016.
Chicago Data Portal (data.cityofchicago.org) tree map
Tracking the tools I've found useful
Russian regional economic monitoring datasets
Time series dataset of Rosstat Short-term Economic Indicators ("KEP") publication
Data Scaling Strategies.
The Washington Post is compiling a database of school shootings in the United States since Columbine.
An API
Collection of useful data science topics along with code and articles
Cheat Sheets
Carefully curated resource links for data science in one place
Some completed data science projects intended to showcase my experience with AI, machine learning, deep learning, and big data techniques.
DS4C: Data Science for COVID-19 in South Korea
This is basically a data analysis project in which I've applied many statistical methods to find hidden insights in the data.
code for Data Science From Scratch book
End-to-end projects for practice in learning how to create value with data.
Data Science in the tidyverse, a two-day workshop @ rstudio:conf(2018)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.