Giter Site home page Giter Site logo

ad450's Introduction

AD 450 - Intro to Data Science Development

Class Details

  • Location: IB 3309
  • Time: Hybrid, Thurs 6:00 - 8:40pm
  • Dates: 1/4/2022 - 3/23/2022
  • Instructor: Brenden West | [email protected]
  • Credits: 5
  • Prerequisites: AD 315, AD 325, & acceptance into AD-BAS program or instructor permission

Note: Students should be familiar with Python programming, SQL database queries, basic statistics, and source control with git & Github. Summary

This class introduces core concepts of data science with a focus on the Python programming tools. Students will learn and exercise key skills used by professional data engineers, data analysts, and data scientists.

Required Materials

This course will make use of the following textbooks as well as relevant online resources:

  • Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython, 2nd edition. Wes McKinney
  • Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd Edition. Aurélien Géron

These books should be accessible free of charge through OReilly Books online, but can also be purchased as hard copy if desired.

Students will need the following development resources:

Students choosing to use their own personal computer will need to install Python 3.8+ as well as the Jupyter notebook application. https://www.anaconda.com/distribution/ is a good option for this.

Class Structure & Topics

Each week students will have assigned reading, tutorials or video lectures, as well as in-class lecture and exercises covering the following topics. Sequence and precise list is subject to revision:

  • Intro to Data Science
  • Python for data science
  • Data handling with NumPy & Pandas
  • Data visualization
  • Exploratory statistics
  • Time series and categorical data analysis
  • Probability & hypothesis testing
  • Prediction and classification
  • Introductory Machine Learning
  • Data Science ethics

Grading

Grades will be based on regular assignments (85%) and 3 quizzes (15%).

Homework assignments will be posted on Canvas each Monday and are designed to exercise the skills and concepts covered that week. Assignments are due by the following Sunday and late assignments may be automatically marked down by 10%.

Students should publish completed programming assignments to their github.com repository and submit the repository link in Canvas. Successful submissions can downloadable from that repository, and should execute in Google Colab without errors.

You may discuss how to approach a problem with other students in the class, but your submission must be your own work. Submissions that are substantially identical would be considered cheating and subject to a markdown penalty.

Quizzes will be administered via Canvas and must be completed during the specified time frame to receive credit.

Resources

Note: NSC students can access many technical reference books free of charge through: https://sbctc-seattlecolleges.primo.exlibrisgroup.com/discovery/search?vid=01STATEWA_SEATTLE:NSEATTLE

(you may need to access online texts via this link - https://learning.oreilly.com/home/)

ad450's People

Contributors

brendenwest avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

pouln

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.