machine-learning-zoomcamp/04-evaluation/08-summary.md at master · DataExpert-io-Community/machine-learning-zoomcamp

4.8 Summary

Notes

General definitions:

Metric: A single number that describes the performance of a model
Accuracy: Fraction of correct answers; sometimes misleading
Precision and recall are less misleading when we have class imbalance
ROC Curve: A way to evaluate the performance at all thresholds; okay to use with imbalance
K-Fold CV: More reliable estimate for performance (mean + std)

In brief, this weeks was about different metrics to evaluate a binary classifier. These measures included accuracy, confusion table, precision, recall, ROC curves(TPR, FPR, random model, and ideal model), and AUROC. Also, we talked about a different way to estimate the performance of the model and make the parameter tuning with cross-validation.

The code of this project is available in this jupyter notebook.

Add notes from the video (PRs are welcome)

⚠️	The notes are written by the community. If you see an error here, please create a PR with a fix.

Notes from Maximilien Eyengue

Navigation

Machine Learning Zoomcamp course
Session 4: Evaluation Metrics for Classification
Previous: Cross-Validation
Next: Explore more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

4.8 Summary

Notes

Navigation

FilesExpand file tree

08-summary.md

Latest commit

History

08-summary.md

File metadata and controls

4.8 Summary

Notes

Navigation