CLUSTER ENSEMBLE SELECTION - Using Average Cluster Consistency

F. Jorge F. Duarte, João M. M. Duarte, M. Fátima C. Rodrigues, Ana L. N. Fred

2009

Abstract

In order to combine multiple data partitions into a more robust data partition, several approaches to produce the cluster ensemble and various consensus functions have been proposed. This range of possibilities in the multiple data partitions combination raises a new problem: which of the existing approaches, to produce the cluster ensembles’ data partitions and to combine these partitions, best fits a given data set. In this paper, we address the cluster ensemble selection problem. We proposed a new measure to select the best consensus data partition, among a variety of consensus partitions, based on a notion of average cluster consistency between each data partition that belongs to the cluster ensemble and a given consensus partition. We compared the proposed measure with other measures for cluster ensemble selection, using 9 different data sets, and the experimental results shown that the consensus partitions selected by our approach usually were of better quality in comparison with the consensus partitions selected by other measures used in our experiments.

Download


Paper Citation


in Harvard Style

Jorge F. Duarte F., M. M. Duarte J., Fátima C. Rodrigues M. and L. N. Fred A. (2009). CLUSTER ENSEMBLE SELECTION - Using Average Cluster Consistency . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009) ISBN 978-989-674-011-5, pages 85-95. DOI: 10.5220/0002308500850095

in Bibtex Style

@conference{kdir09,
author={F. Jorge F. Duarte and João M. M. Duarte and M. Fátima C. Rodrigues and Ana L. N. Fred},
title={CLUSTER ENSEMBLE SELECTION - Using Average Cluster Consistency},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)},
year={2009},
pages={85-95},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002308500850095},
isbn={978-989-674-011-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2009)
TI - CLUSTER ENSEMBLE SELECTION - Using Average Cluster Consistency
SN - 978-989-674-011-5
AU - Jorge F. Duarte F.
AU - M. M. Duarte J.
AU - Fátima C. Rodrigues M.
AU - L. N. Fred A.
PY - 2009
SP - 85
EP - 95
DO - 10.5220/0002308500850095