Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
18 changes: 16 additions & 2 deletions doc/over_sampling.rst
Original file line number Diff line number Diff line change
@@ -1,11 +1,25 @@
.. _over-sampling:

=============
Over-sampling
=============

.. currentmodule:: imblearn.over_sampling

.. warning::

Oversampling methods such as RandomOverSampler, SMOTE and ADASYN
artificially modify the class distribution of the training data.
While they can improve the recall of the minority class, they do not
necessarily improve ranking metrics such as ROC-AUC and can lead to
poorly calibrated predicted probabilities.

In many situations, using ``class_weight`` or ``sample_weight`` in the
classifier is a simpler and often more reliable alternative.

There is ongoing discussion in the research and practitioner community
about the usefulness of oversampling methods. These techniques should be
used with care and always validated using appropriate metrics, including
both discrimination (e.g. ROC-AUC) and probability calibration.

A practical guide
=================

Expand Down