A LEARNING METHOD FOR IMBALANCED DATA SETS

Jorge de la Calleja, Olac Fuentes, Jesús González, Rita M. Aceves-Pérez

2009

Abstract

Many real-world domains present the problem of imbalanced data sets, where examples of one class significantly outnumber examples of other classes. This situation makes learning difficult, as learning algorithms based on optimizing accuracy over all training examples will tend to classify all examples as belonging to the majority class. In this paper we introduce a method for learning from imbalanced data sets which is composed of three algorithms. Our experimental results show that our method performs accurate classification in the presence of significant class imbalance and using small training sets.

Download


Paper Citation


in Harvard Style

de la Calleja J., Fuentes O., González J. and M. Aceves-Pérez R. (2009). A LEARNING METHOD FOR IMBALANCED DATA SETS . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009) ISBN 978-989-674-011-5, pages 307-310. DOI: 10.5220/0002305303070310

in Bibtex Style

@conference{kdir09,
author={Jorge de la Calleja and Olac Fuentes and Jesús González and Rita M. Aceves-Pérez},
title={A LEARNING METHOD FOR IMBALANCED DATA SETS},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)},
year={2009},
pages={307-310},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002305303070310},
isbn={978-989-674-011-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)
TI - A LEARNING METHOD FOR IMBALANCED DATA SETS
SN - 978-989-674-011-5
AU - de la Calleja J.
AU - Fuentes O.
AU - González J.
AU - M. Aceves-Pérez R.
PY - 2009
SP - 307
EP - 310
DO - 10.5220/0002305303070310