mlcourse.ai

Open Machine Learning Course

Github星跟蹤圖

ODS stickers

mlcourse.ai – Open Machine Learning Course

License: CC BY-NC-SA 4.0
Slack
Donate
Donate

mlcourse.ai is an open Machine Learning course by OpenDataScience. The course is designed to perfectly balance theory and practice. You can take part in several Kaggle Inclass competitions held during the course. From spring 2017 to fall 2019, 6 sessions of mlcourse.ai took place - 26k participants applied, 10k converted to passing the first assignment, about 1500 participants finished the course. Currently, the course is in self-paced mode. A thorough roadmap guiding you through the self-paced mlcourse.ai will be published in February, 2020.

Mirrors (:uk:-only): mlcourse.ai (main site), Kaggle Dataset (same notebooks as Kernels)

Outline

This is the list of published articles on medium.com :uk:, habr.com :ru:. Also notebooks in Chinese are mentioned :cn: and links to Kaggle Kernels (in English) are given. Icons are clickable.

  1. Exploratory Data Analysis with Pandas :uk: :ru: :cn:, Kaggle Kernel
  2. Visual Data Analysis with Python :uk: :ru: :cn:, Kaggle Kernels: part1, part2
  3. Classification, Decision Trees and k Nearest Neighbors :uk: :ru: :cn:, Kaggle Kernel
  4. Linear Classification and Regression :uk: :ru: :cn:, Kaggle Kernels: part1, part2, part3, part4, part5
  5. Bagging and Random Forest :uk: :ru: :cn:, Kaggle Kernels: part1, part2, part3
  6. Feature Engineering and Feature Selection :uk: :ru: :cn:, Kaggle Kernel
  7. Unsupervised Learning: Principal Component Analysis and Clustering :uk: :ru: :cn:, Kaggle Kernel
  8. Vowpal Wabbit: Learning with Gigabytes of Data :uk: :ru: :cn:, Kaggle Kernel
  9. Time Series Analysis with Python, part 1 :uk: :ru: :cn:. Predicting future with Facebook Prophet, part 2 :uk:, :cn: Kaggle Kernels: part1, part2
  10. Gradient Boosting :uk: :ru:, :cn:, Kaggle Kernel

Lectures

Videolectures are uploaded to this YouTube playlist.
Introduction, video, slides

  1. Exploratory data analysis with Pandas, video
  2. Visualization, main plots for EDA, video
  3. Decision trees: theory and practical part
  4. Logistic regression: theoretical foundations, practical part (baselines in the "Alice" competition)
  5. Ensembles and Random Forest – part 1. Classification metrics – part 2. Example of a business task, predicting a customer payment – part 3
  6. Linear regression and regularization - theory, LASSO & Ridge, LTV prediction - practice
  7. Unsupervised learning - Principal Component Analysis and Clustering
  8. Stochastic Gradient Descent for classification and regression - part 1, part 2 TBA
  9. Time series analysis with Python (ARIMA, Prophet) - video
  10. Gradient boosting: basic ideas - part 1, key ideas behind Xgboost, LightGBM, and CatBoost + practice - part 2

Demo assignments

  1. Exploratory data analysis with Pandas, nbviewer, Kaggle Kernel, solution
  2. Analyzing cardiovascular disease data, nbviewer, Kaggle Kernel, solution
  3. Decision trees with a toy task and the UCI Adult dataset, nbviewer, Kaggle Kernel, solution
  4. Sarcasm detection, Kaggle Kernel, solution. Linear Regression as an optimization problem, nbviewer, Kaggle Kernel
  5. Logistic Regression and Random Forest in the credit scoring problem, nbviewer, Kaggle Kernel, solution
  6. Exploring OLS, Lasso and Random Forest in a regression task, nbviewer, Kaggle Kernel, solution
  7. Unsupervised learning, nbviewer, Kaggle Kernel, solution
  8. Implementing online regressor, nbviewer, Kaggle Kernel, solution
  9. Time series analysis, nbviewer, Kaggle Kernel, solution
  10. Beating baseline in a competition, Kaggle Kernel

Kaggle competitions

  1. Catch Me If You Can: Intruder Detection through Webpage Session Tracking. Kaggle Inclass
  2. DotA 2 winner prediction Kaggle Inclass

Community

Discussions are held in the #mlcourse_ai channel of the OpenDataScience (ods.ai) Slack team.

The course is free but you can support organizers by making a pledge on Patreon (monthly support) or a one-time payment on Ko-fi. Thus you'll foster the spread of Machine Learning in the world!

Donate
Donate

主要指標

概覽
名稱與所有者Yorko/mlcourse.ai
主編程語言Python
編程語言Python (語言數: 2)
平台
許可證Other
所有者活动
創建於2017-02-27 08:32:20
推送於2025-05-27 20:57:11
最后一次提交
發布數1
最新版本名稱v1.0.0 (發布於 )
第一版名稱v1.0.0 (發布於 )
用户参与
星數10.1k
關注者數574
派生數5.7k
提交數92
已啟用問題?
問題數134
打開的問題數3
拉請求數467
打開的拉請求數0
關閉的拉請求數188
项目设置
已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?