Giter Site home page Giter Site logo

ds-1.1-data-analysis's Introduction

DS 1.1: Data Analysis & Visualization

Course Description

In this course, students learn the foundational skills of data science, including data collection, scrubbing, analysis, and visualization with modern tools and libraries. Students gain a strong grounding in statistical concepts, utilize statistical techniques and master the science and art of data exploration and visualization to tell stories and persuade decision makers with data-driven insights.

Prerequisites:

CS 1.2

Learning Outcomes

By the end of this course, students will be able to...

  1. Conduct data manipulation and visualization
  2. Understand when to reject or accept a null hypothesis
  3. Apply descriptive statistics, probability, and other forms of data analysis techniques
  4. Describe and implement a plan for finding and dealing with problems in a dataset such as null values and outliers
  5. Perform statistical analysis on data collections using a variety of methods

Schedule

Course Dates: Tuesday, March 31 – Thursday, May 14, 2020 (7 weeks)

Class Times: Tuesday and Thursday at 9:30am to 12:15pm (14 class sessions)

Class Date Topics
1 Tue, March 31 Introduction to Data Science
2 Thu, April 2 Simple Data Manipulation
3 Tue, April 7 Data Manipulation & Visualization
4 Thu, April 9 How to Combine DataFrames
5 Tue, April 14 Applied Descriptive Statistics
6 Thu, April 16 Applied Probability to data frame
7 Tue, April 21 [NPS Project Data Wrangling Check-in]
8 Thu, April 23 PDFs, CDFs, and Normal Distributions
9 Tue, April 28 Hypothesis Testing & Acceptable Error
10 Thu, April 30 Confidence Intervals, Outliers, and Statistical Analysis
11 Tue, May 5 Time Series Data & Applications
12 Thu, May 7 [Lesson 12]
13 Tue, May 12 Final Exam
14 Thu, May 14 Presentations

Assignment Schedule

[INSTRUCTOR NOTE] REPLACE THE BELOW WITH LINKS TO YOUR ASSIGNMENTS, CORRECT DATES, AND SUBMISSION FORM

Assignment Date Assigned Due Date Submission Form
Midterm Project - NPS Data Analysis Tue, April 7 Tue, April 21 Submit Assignment
Homework 1 - Histogram Thu, April 23 Thu, April 30 Submit Assignment

Class Assignments

  • Implement a dataset processing with Numpy only and then Pandas
  • Write a function that calculate conditional probability for two arbitrary attributes and arbitrary condition

Tutorials

Students will complete the following guided tutorials in this course:

Projects

Students will complete the following self-guided projects in this course:

Evaluation

To pass this course you must meet the following requirements:

  • Do all in-class activities and one homework
  • Finish all required tutorials and two projects
  • Pass the final exam (summative assessment). The topics for final exam would be:
    • Null hypothesis, the steps to accept or reject it
    • Statistical terms and meanings such as Z-distribution, CDF, SF, ...
    • Histogram, density estimations
    • Outlier detection
    • Correlation

Make School Course Policies

ds-1.1-data-analysis's People

Contributors

aespaldi avatar djprofessorkash avatar ibirnam avatar miladtoutounchian avatar neptunius avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.