Chinese translation of Andrew Ng's new book "Machine learning yearning". 已经有很多人在做同样的工作了,但我想自己试试独立完成翻译一本技术书籍。
- 为什么使用机器学习 Why Machine Learning Strategy
- 如何使用本书来帮助你的团队 How to use this book to help your team
- 预备知识 Prerequisites and Notation
- 规模驱动的机器学习进步 Scale drives machine learning progress
- 验证集和测试集 Your development and test sets
- 验证集和测试集需要满足相同的分布 Your dev and test sets should come from the same distribution
- 如何设置验证/测试集的大小 How large do the dev/test sets need to be?
- 建立用于优化的单一评估指标 Establish a single-number evaluation metric for your team to optimize
- 优化指标 Optimizing and satisficing metrics
- 验证集和评估指标可以加速迭代 Having a dev set and metric speeds up iterations
- 如何修改验证/测试集和评估指标 When to change dev/test sets and metrics
- Takeaways: Setting up development and test sets
- Build your first system quickly, then iterate
- Error analysis: Look at dev set examples to evaluate ideas
- Evaluating multiple ideas in parallel during error analysis
- Cleaning up mislabeled dev and test set examples
- If you have a large dev set, split it into two subsets, only one of which you look at
- How big should the Eyeball and Blackbox dev sets be?
- Takeaways: Basic error analysis
- Bias and Variance: The two big sources of error
- Examples of Bias and Variance
- Comparing to the optimal error rate
- Addressing Bias and Variance
- Bias vs. Variance tradeoff
- Techniques for reducing avoidable bias
- Error analysis on the training set
- Techniques for reducing variance
- Diagnosing bias and variance: Learning curves
- Plotting training error
- Interpreting learning curves: High bias
- Interpreting learning curves: Other cases
- Plotting learning curves
- Why we compare to human-level performance
- How to define human-level performance
- Surpassing human-level performance
- When you should train and test on different distributions
- How to decide whether to use all your data
- How to decide whether to include inconsistent data
- Weighting data
- Generalizing from the training set to the dev set
- Identifying Bias, Variance, and Data Mismatch Errors
- Addressing data mismatch
- Artificial data synthesis
- The Optimization Verification test
- General form of Optimization Verification test
- Reinforcement learning example
- The rise of end-to-end learning
- More end-to-end learning examples
- Pros and cons of end-to-end learning
- Choosing pipeline components: Data availability
- Choosing pipeline components: Task simplicity
- Directly learning rich outputs
- Error analysis by parts
- Attributing error to one part
- General case of error attribution
- Error analysis by parts and comparison to human-level performance
- Spotting a flawed ML pipeline
- Building a superhero team - Get your teammates to read this