数据科学小抄2.0

5 页有用的机器学习小抄,可帮助您进行考试复习、面试准备以及任何中间准备工作。『A helpful 5-page machine learning cheatsheet to assist with exam reviews, interview prep, and anything in-between.』

Github星跟踪图

Data Science Cheatsheet 2.0

A helpful 5-page data science cheatsheet to assist with exam reviews, interview prep, and anything in-between. It covers over a semester of introductory machine learning, and is based on MIT's Machine Learning courses 6.867 and 15.072. The reader should have at least a basic understanding of statistics and linear algebra, though beginners may find this resource helpful as well.

Inspired by Maverick's Data Science Cheatsheet (hence the 2.0 in the name), located here.

Topics covered:

  • Linear and Logistic Regression
  • Decision Trees and Random Forest
  • SVM
  • K-Nearest Neighbors
  • Clustering
  • Boosting
  • Dimension Reduction (PCA, LDA, Factor Analysis)
  • Natural Language Processing
  • Neural Networks
  • Recommender Systems
  • Reinforcement Learning
  • Anomaly Detection
  • Time Series
  • A/B Testing

This cheatsheet will be occasionally updated with new/improved info, so consider a follow or star to stay up to date.

Future additions (ideas welcome):

  • Time Series Added!
  • Statistics and Probability Added!
  • Data Imputation
  • Generative Adversarial Networks
  • Graph Neural Networks

Screenshots

Here are screenshots of a couple pages - the link to the full cheatsheet is above!


Why is Python/SQL not covered in this cheatsheet?

I planned for this resource to cover mainly algorithms, models, and concepts, as these rarely change and are common throughout industries. Technical languages and data structures often vary by job function, and refreshing these skills may make more sense on keyboard than on paper.

License

Feel free to share this resource in classes, review sessions, or to anyone who might find it helpful :)

This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Images are used for educational purposes, created by me, or borrowed from my colleagues here

Contact

Feel free to suggest comments, updates, and potential improvements!

Author - Aaron Wang

If you'd like to support this cheatsheet, you can buy me a coffee here. I also do resume, application, and tech consulting - send me a message if interested.

主要指标

概览
名称与所有者aaronwangy/Data-Science-Cheatsheet
主编程语言TeX
编程语言TeX (语言数: 1)
平台
许可证
所有者活动
创建于2021-02-05 06:01:57
推送于2023-03-15 22:16:54
最后一次提交
发布数0
用户参与
星数5.2k
关注者数153
派生数739
提交数47
已启用问题?
问题数9
打开的问题数6
拉请求数1
打开的拉请求数2
关闭的拉请求数1
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?