Applying a Hybrid Targeted Estimation of Distribution Algorithm to Feature Selection Problems

Geoffrey Neumann, David Cairns

2013

Abstract

This paper presents the results of applying the hybrid Targeted Estimation of Distribution Algorithm (TEDA) to feature selection problems with 500 to 20,000 features. TEDA uses parent fitness and features to provide a target for the number of features required for classification and can quickly drive down the size of the selected feature set even when the initial feature set is relatively large. TEDA is a hybrid algorithm that transitions between the selection and crossover approaches of a Genetic Algorithm (GA) and those of an Estimation of Distribution Algorithm (EDA) based on the reliability of the estimated probability distribution.Targeting the number of features in this way has two key benefits. Firstly, it enables TEDA to efficiently find good solutions for cases with very low signal to noise ratios where the majority of available features are not associated with the given classification task. Secondly, due to the tendency of TEDA to select the smallest and most promising initial feature set, it builds compact classifiers that are able to evaluate populations more quickly than other approaches.

Download


Paper Citation


in Harvard Style

Neumann G. and Cairns D. (2013). Applying a Hybrid Targeted Estimation of Distribution Algorithm to Feature Selection Problems . In Proceedings of the 5th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2013) ISBN 978-989-8565-77-8, pages 136-143. DOI: 10.5220/0004553301360143

in Bibtex Style

@conference{ecta13,
author={Geoffrey Neumann and David Cairns},
title={Applying a Hybrid Targeted Estimation of Distribution Algorithm to Feature Selection Problems},
booktitle={Proceedings of the 5th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2013)},
year={2013},
pages={136-143},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004553301360143},
isbn={978-989-8565-77-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 5th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2013)
TI - Applying a Hybrid Targeted Estimation of Distribution Algorithm to Feature Selection Problems
SN - 978-989-8565-77-8
AU - Neumann G.
AU - Cairns D.
PY - 2013
SP - 136
EP - 143
DO - 10.5220/0004553301360143