Tackling the Problem of Data Imbalancing for Melanoma Classification

Mojdeh Rastgoo, Guillaume Lemaitre, Joan Massich, Olivier Morel, Franck Marzani, Rafael Garcia, Fabrice Meriaudeau

2016

Abstract

Malignant melanoma is the most dangerous type of skin cancer, yet melanoma is the most treatable kind of cancer when diagnosed at an early stage. In this regard, Computer-Aided Diagnosis systems based on machine learning have been developed to discern melanoma lesions from benign and dysplastic nevi in dermoscopic images. Similar to a large range of real world applications encountered in machine learning, melanoma classification faces the challenge of imbalanced data, where the percentage of melanoma cases in comparison with benign and dysplastic cases is far less. This article analyzes the impact of data balancing strategies at the training step. Subsequently, Over-Sampling (OS) and Under-Sampling (US) are extensively compared in both feature and data space, revealing that NearMiss-2 (NM2) outperform other methods achieving Sensitivity (SE) and Specificity (SP) of 91.2% and 81.7%, respectively. More generally, the reported results highlight that methods based on US or combination of OS and US in feature space outperform the others.

Download


Paper Citation


in Harvard Style

Rastgoo M., Lemaitre G., Massich J., Morel O., Marzani F., Garcia R. and Meriaudeau F. (2016). Tackling the Problem of Data Imbalancing for Melanoma Classification . In Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 2: BIOIMAGING, (BIOSTEC 2016) ISBN 978-989-758-170-0, pages 32-39. DOI: 10.5220/0005703400320039

in Bibtex Style

@conference{bioimaging16,
author={Mojdeh Rastgoo and Guillaume Lemaitre and Joan Massich and Olivier Morel and Franck Marzani and Rafael Garcia and Fabrice Meriaudeau},
title={Tackling the Problem of Data Imbalancing for Melanoma Classification},
booktitle={Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 2: BIOIMAGING, (BIOSTEC 2016)},
year={2016},
pages={32-39},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005703400320039},
isbn={978-989-758-170-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Joint Conference on Biomedical Engineering Systems and Technologies - Volume 2: BIOIMAGING, (BIOSTEC 2016)
TI - Tackling the Problem of Data Imbalancing for Melanoma Classification
SN - 978-989-758-170-0
AU - Rastgoo M.
AU - Lemaitre G.
AU - Massich J.
AU - Morel O.
AU - Marzani F.
AU - Garcia R.
AU - Meriaudeau F.
PY - 2016
SP - 32
EP - 39
DO - 10.5220/0005703400320039