Relevance and Mutual Information-based Feature Discretization

Artur Ferreira, Mario Figueiredo

2013

Abstract

In many learning problems, feature discretization (FD) techniques yield compact data representations, which often lead to shorter training time and higher classification accuracy. In this paper, we propose two new FD techniques. The first method is based on the classical Linde-Buzo-Gray quantization algorithm, guided by a relevance criterion, and is able to work in unsupervised, supervised, or semi-supervised scenarios, depending on the adopted measure of relevance. The second method is a supervised technique based on the maximization of the mutual information between each discrete feature and the class label. For both methods, our experiments on standard benchmark datasets show their ability to scale up to high-dimensional data, attaining in many cases better accuracy than other FD approaches, while using fewer discretization intervals.

Download


Paper Citation


in Harvard Style

Ferreira A. and Figueiredo M. (2013). Relevance and Mutual Information-based Feature Discretization . In Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM, ISBN 978-989-8565-41-9, pages 68-77. DOI: 10.5220/0004268000680077

in Bibtex Style

@conference{icpram13,
author={Artur Ferreira and Mario Figueiredo},
title={Relevance and Mutual Information-based Feature Discretization},
booktitle={Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,},
year={2013},
pages={68-77},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004268000680077},
isbn={978-989-8565-41-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 2nd International Conference on Pattern Recognition Applications and Methods - Volume 1: ICPRAM,
TI - Relevance and Mutual Information-based Feature Discretization
SN - 978-989-8565-41-9
AU - Ferreira A.
AU - Figueiredo M.
PY - 2013
SP - 68
EP - 77
DO - 10.5220/0004268000680077