Giter Site home page Giter Site logo

dat5's Introduction

DAT5 Course Repository

Course materials for General Assembly's Data Science course in Washington, DC (3/18/15 - 6/3/15).

Instructors: Kevin Markham and Brandon Burroughs

Monday Wednesday
3/18: Introduction and Python
3/23: Git and Command Line 3/25: Exploratory Data Analysis
3/30: Visualization and APIs 4/1: Machine Learning and KNN
4/6: Bias-Variance and Train/Test Split 4/8: Kaggle Titanic (Part 1)
4/13: Web Scraping, Tidy Data, Reproducibility 4/15: Linear Regression
4/20: Logistic Regression and Confusion Matrix 4/22: ROC and Cross-Validation
4/27: Project Presentation #1 4/29: Kaggle Titanic (Part 2)
5/4: Naive Bayes 5/6: Natural Language Processing
5/11: Decision Trees 5/13: Ensembles
5/18: Clustering and Regularization 5/20: Advanced scikit-learn
5/25: No Class 5/27: Databases and SQL
6/1: Course Review 6/3: Project Presentation #2

Key Project Dates

  • 3/30: Deadline for discussing your project idea(s) with an instructor
  • 4/6: Project question and dataset (write-up)
  • 4/27: Project presentation #1 (slides, code, visualizations)
  • 5/18: First draft due (draft of project paper, code, visualizations)
  • 5/25: Peer review due
  • 6/3: Project presentation #2 (project paper, slides, code, visualizations, data, data dictionary)

Key Project Links

Logistics

  • Office hours will take place every Saturday and Sunday.
  • Homework will be assigned every Wednesday and due on Monday, and you'll receive feedback by Wednesday.
  • Our primary tool for out-of-class communication will be a private chat room through Slack.

Submission Forms

Before the Course Begins

Python Resources


Class 1: Introduction and Python

  • Introduction to General Assembly
  • Course overview (slides)
  • Brief tour of Slack
  • Checking the setup of your laptop
  • Python lesson with airline safety data (code)

Homework:

Optional:

  • If we discovered any setup issues with your laptop, please resolve them before Monday.
  • If you're not feeling comfortable in Python, keep practicing using the resources above!

Class 2: Git and Command Line

  • Any questions about the course project?
  • Command line (slides)
  • Git and GitHub (slides)

Homework:

  • Command line exercises with SMS Spam Data (listed at the bottom of Introduction to the Command Line)
  • Note: This homework is not due until Monday. You might want to create a GitHub repo for your homework instead of using Gist!

Optional:

Resources:

dat5's People

Contributors

justmarkham avatar brandonmburroughs avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.