ACTIVE SMOTE for Imbalanced Medical Data Classification - Université Paris Nanterre
Communication Dans Un Congrès Année : 2024

ACTIVE SMOTE for Imbalanced Medical Data Classification

Résumé

Classifying imbalanced data is a big challenge for machine learning techniques, especially for medical data. To deal with this challenge, many solutions have been proposed. The most famous methods are based on the Synthetic Minority Over-sampling Technique (SMOTE), which creates new synthetic instances in the minority class. In this paper, we study the efficiency of the SMOTE-based methods on some imbalanced data sets. We then propose extending these techniques with Active Learning to control the evolution of the minority class better. Active Learning uses uncertainty and diversity sampling to choose wisely the data points from which the synthetic samples will be generated. To evaluate our approach, we make comprehensive experimental studies on two medical data sets for diabetes diagnosis and breast cancer diagnosis.
Fichier principal
Vignette du fichier
Active_Smote_paper_ICIKS2023 (2).pdf (1.02 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04462505 , version 1 (16-02-2024)

Identifiants

Citer

Raul Sena, Sana Ben Hamida. ACTIVE SMOTE for Imbalanced Medical Data Classification. International Conference on Information and Knowledge Systems, Inès Saad, Camille Rosenthal-Sabroux, Faiez Gargouri, Salem Chakhar, Nigel Williams, Ella Haig, Jun 2023, Portsmouth, United Kingdom. pp.81-97, ⟨10.1007/978-3-031-51664-1_6⟩. ⟨hal-04462505⟩
24 Consultations
33 Téléchargements

Altmetric

Partager

More