ACTIVE SMOTE for Imbalanced Medical Data Classification - Université Paris Nanterre Access content directly
Conference Papers Year : 2024

ACTIVE SMOTE for Imbalanced Medical Data Classification


Classifying imbalanced data is a big challenge for machine learning techniques, especially for medical data. To deal with this challenge, many solutions have been proposed. The most famous methods are based on the Synthetic Minority Over-sampling Technique (SMOTE), which creates new synthetic instances in the minority class. In this paper, we study the efficiency of the SMOTE-based methods on some imbalanced data sets. We then propose extending these techniques with Active Learning to control the evolution of the minority class better. Active Learning uses uncertainty and diversity sampling to choose wisely the data points from which the synthetic samples will be generated. To evaluate our approach, we make comprehensive experimental studies on two medical data sets for diabetes diagnosis and breast cancer diagnosis.
Fichier principal
Vignette du fichier
Active_Smote_paper_ICIKS2023 (2).pdf (1.02 Mo) Télécharger le fichier
Origin Files produced by the author(s)

Dates and versions

hal-04462505 , version 1 (16-02-2024)



Raul Sena, Sana Ben Hamida. ACTIVE SMOTE for Imbalanced Medical Data Classification. International Conference on Information and Knowledge Systems, Inès Saad, Camille Rosenthal-Sabroux, Faiez Gargouri, Salem Chakhar, Nigel Williams, Ella Haig, Jun 2023, Portsmouth, United Kingdom. pp.81-97, ⟨10.1007/978-3-031-51664-1_6⟩. ⟨hal-04462505⟩
10 View
19 Download



Gmail Mastodon Facebook X LinkedIn More