Question d’entretien chez Capital One

How do you deal with unbalanced classification problem?

Réponses aux questions d'entretien

Utilisateur anonyme

25 févr. 2020

Sampling technique like oversampling and downsampling. Models that adapt to unbalanced data like Ada-boosting. Or use anomaly detection methods.

Utilisateur anonyme

12 oct. 2021

The basic idea is sampling (oversampling like resampling) which is not a good option when dealing with huge imbalance. Another option is to set weights for class labels (larger weights for classes with fewer samples). A really good, but tough and many times not feasible to implement, is to do oversampling by using generative networks (i.e. GANs).

Utilisateur anonyme

24 avr. 2022

over/undersampling, GANs (I believe they require lots of data), and class weights (sometimes refer to class weights)