upsampling and downsampling
Degree of imbalance:
- Mid: 20-40%
- Moderate: 1-20%
- Extreme: 1%
Up-sampling
- Randomly duplicating observations from minority class
- Tools:
sklearn.utils.resample
Down-sampling
- Randomly removing observations from the majority class