Evidence Accumulation Clustering using Pairwise Constraints

João M. M. Duarte, Ana L. N. Fred, F. Jorge F. Duarte

2012

Abstract

Recent work on constrained data clustering have shown that the incorporation of pairwise constraints, such as must-link and cannot-link constraints, increases the accuracy of single run data clustering methods. It was also shown that the quality of a consensus partition, resulting from the combination of multiple data partitions, is usually superior than the quality of the partitions produced by single run clustering algorithms. In this paper we test the effectiveness of adding pairwise constraints to the Evidence Accumulation Clustering framework. For this purpose, a new soft-constrained hierarchical clustering algorithm is proposed and is used for the extraction of the consensus partition from the co-association matrix. It is also studied whether there are advantages in selecting the must-link and cannot-link constraints on certain subsets of the data instead of selecting these constraints at random on the entire data set. Experimental results on 7 synthetic and 7 real data sets have shown the use of soft constraints improves the performance of the Evidence Accumulation Clustering.

Download


Paper Citation


in Harvard Style

M. M. Duarte J., L. N. Fred A. and F. Duarte F. (2012). Evidence Accumulation Clustering using Pairwise Constraints . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012) ISBN 978-989-8565-29-7, pages 293-299. DOI: 10.5220/0004171902930299

in Bibtex Style

@conference{kdir12,
author={João M. M. Duarte and Ana L. N. Fred and F. Jorge F. Duarte},
title={Evidence Accumulation Clustering using Pairwise Constraints},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)},
year={2012},
pages={293-299},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004171902930299},
isbn={978-989-8565-29-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval - Volume 1: KDIR, (IC3K 2012)
TI - Evidence Accumulation Clustering using Pairwise Constraints
SN - 978-989-8565-29-7
AU - M. M. Duarte J.
AU - L. N. Fred A.
AU - F. Duarte F.
PY - 2012
SP - 293
EP - 299
DO - 10.5220/0004171902930299