Giter Site home page Giter Site logo

werayootk / 2110531_datascience_2022s1 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from kaopanboonyuen/2110531_datascience_2022s1

1.0 1.0 0.0 82.75 MB

Data Science Tools Course at Dept. of Computer Engineering, Chula 2022

License: Apache License 2.0

Jupyter Notebook 99.89% HTML 0.04% Python 0.07%

2110531_datascience_2022s1's Introduction

2110531 Data Science and Data Engineering Tools @Chula 2022

Support-Ukraine

alt text

Short links for exercises:

Week1: Intro to Numpy, Pandas

  1. Numpy: Open In Colab

  2. Pandas: Open In Colab

  3. Pandas with Youtube stat data: Open In Colab

  4. (Advanced) Pandas with Youtube stat data: Open In Colab

Assignment (Pandas with Youtube stat data): Open In Colab

Week2: Data Preparation

  1. EDA: Open In Colab

  2. Impute Missing Value: Open In Colab

  3. Split Train/Test: Open In Colab

  4. Outliers with Log: Open In Colab

  5. Outliers with Log (Titanic DataSet): Open In Colab

Assignment: Open In Colab

Week3-4: Traditional ML

  1. Decision Trees: Open In Colab

  2. Linear Regression: Open In Colab

  3. Logistic Regression: Open In Colab

  4. Neural Network: Open In Colab

  5. K Nearest Neighbors: Open In Colab

  6. SVM: Open In Colab

  7. Save and Load Model: Open In Colab

  8. K-Means: Open In Colab

  9. Market-Basket Analysis: Open In Colab

Assignment for Week3 (Safe to eat or deadly poison?): Open In GitHub

Mushroom

Week5-6: Intro to Deep Learning

  1. Image classification (basic): flower classification Open In Colab

  2. Image classification (advanced): flower classification Open In Colab

  3. Semantic Segmentation (UNET): The Oxford-IIIT pet dataset Open In Colab

  4. LSTM: Stock price prediction Open In Colab

  5. SARIMAX: PM2.5 forecasting Open In Colab

Assignment (Fashion MNIST): Open In Colab

Week8: Data Storage with Redis

Redis Example using local data

Assignment (connect to redis server)

Week9: Data Storage with Redis

  1. Basic Webpage Scarping Open In Colab

  2. Wikipeia Data ExtractionOpen In Colab

  3. Settrade Rest API Open In Colab

  4. Twitter Data Extraction Open In Colab

  5. Selenium Open In Colab

Assignment (Counting วันพระ)Open In Colab

Week10: Data Ingestion with Kafka

  1. Several simple examples including both produxer and consumer in simple folder

  2. Complex example in complex folder

  3. AVRO Producer Open In Colab and Consumer Open In Colab

  4. Group example in group folder

Assignment (Transaction Verifier)Open In Colab

Note: Do not forget to upload the following schema files to your Colab

Week11: Big Data Processing with Spark

  1. Basic Spark Open In Colab

Note: Do not forget to upload the following data file to your Colab

  1. Spark SQL Open In Colab

Note: Do not forget to upload the following data file to your Colab

  1. Spark ML Open In Colab

Note: Do not forget to upload the following data file to your Colab

Assignment (Analyze IMDB)Open In Colab

Note: Do not forget to upload the following data file to your Colab

Week12: Ops Stars

  1. Several airflow examples in (airflow folder)[https://github.com/kaopanboonyuen/2110531_DataScience_2022s1/tree/main/code/week12_orchestration/airflow]

  2. Several fastapi examples in (fastapi folder)[https://github.com/kaopanboonyuen/2110531_DataScience_2022s1/tree/main/code/week12_orchestration/fastapi]

Reference:

  1. https://www.kaggle.com/code
  2. https://www.tensorflow.org/tutorials
  3. https://github.com/topics/machine-learning
  4. https://archive.ics.uci.edu/ml/datasets.php

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.