๐ท๐บ Russian version ๐ท๐บ
โ The course in English started on Feb. 5, 2018 as a series of articles (a "Publication" on Medium) with assignments and Kaggle Inclass competitions. The next session is planned to start on Oct. 1, 2018. Fill in this form to participate:exclamation:
This is the list of published articles on Medium ๐ฌ๐ง, Habrahabr ๐ท๐บ, and jqr.com ๐จ๐ณ. Icons are clickable.
- Exploratory Data Analysis with Pandas ๐ฌ๐ง ๐ท๐บ ๐จ๐ณ
- Visual Data Analysis with Python ๐ฌ๐ง ๐ท๐บ ๐จ๐ณ
- Classification, Decision Trees and k Nearest Neighbors ๐ฌ๐ง ๐ท๐บ ๐จ๐ณ
- Linear Classification and Regression ๐ฌ๐ง ๐ท๐บ
- Bagging and Random Forest ๐ฌ๐ง ๐ท๐บ
- Feature Engineering and Feature Selection ๐ฌ๐ง ๐ท๐บ
- Unsupervised Learning: Principal Component Analysis and Clustering ๐ฌ๐ง ๐ท๐บ
- Vowpal Wabbit: Learning with Gigabytes of Data ๐ฌ๐ง ๐ท๐บ Kaggle Kernel
- Time Series Analysis with Python, part 1 ๐ฌ๐ง ๐ท๐บ. Predicting future with Facebook Prophet, part 2 ๐ฌ๐ง
- Gradient Boosting ๐ฌ๐ง ๐ท๐บ
- "Exploratory data analysis with Pandas", nbviewer. Deadline: Feb. 11, 23.59 CET
- "Analyzing cardiovascular disease data", nbviewer. Deadline: Feb. 18, 23.59 CET
- "Decision trees with a toy task and the UCI Adult dataset", nbviewer. Deadline: Feb. 25, 23.59 CET
- "User Identification with Logistic Regression", nbviewer. Deadline: March 11, 23.59 CET
- "Logistic Regression and Random Forest in the Credit Scoring Problem", nbviewer. Deadline: March 18, 23.59 CET
- Beating benchmarks in two Kaggle Inclass competitons. Part 1, "Alice", nbviewer. Part 2, "Medium", nbviewer. Deadline: April 1, 23.59 CET
- Unsupervised learning: PCA and clustering, nbviewer. Deadline: April 4, 23.59 CET
- Vowpal Wabbit and StackOverflow questions, nbviewer. Deadline: April 15, 23.59 CET
- Time series analysis, nbviewer. Deadline: April 15, 23.59 CET
- Gradient boosting and flight delays, nbviewer. Deadline: April 22, 23.59 CET
- Catch Me If You Can: Intruder Detection through Webpage Session Tracking. Kaggle Inclass
- How good is your Medium article? Kaggle Inclass
Throughout the course we are maintaining a student rating. It takes into account credits scored in assignments and Kaggle competitions. Top-10 students (according to the final rating) will be listed on a special Wiki page.
Discussions between students are held in the #eng_mlcourse_open channel of the OpenDataScience Slack team. Fill in this form to get an invitation. The form will also ask you some personal questions, don't hesitate ๐
- Prerequisites: Python, math and DevOps โ how to get prepared for the course
- Software requirements and Docker container โ this will guide you through installing all necessary stuff for working with course materials
- 1st session in English: all activities accounted for in rating
The course is free but you can support organizers by making a pledge on Patreon