SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS

Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu

2007

Abstract

Current search engines present their search results as a ranked list of Web pages. However, as the number of pages on theWeb increases exponentially, so does the number of search results for any given query. We present a novel subspace clustering based algorithm to organize keyword search results by simultaneously clustering and identifying distinguishing terms for each cluster. Our system, named Scuba Diver, enables users to better interpret the coverage of millions of search results and to refine their search queries through a keyword guided interface. We present experimental results illustrating the effectiveness of our algorithm by measuring purity, entropy and F-measure of generated clusters based on Open Directory Project (ODP).

Download


Paper Citation


in Harvard Style

Gelgi F., Vadrevu S. and Davulcu H. (2007). SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-972-8865-78-8, pages 334-339. DOI: 10.5220/0001288503340339

in Bibtex Style

@conference{webist07,
author={Fatih Gelgi and Srinivas Vadrevu and Hasan Davulcu},
title={SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2007},
pages={334-339},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001288503340339},
isbn={978-972-8865-78-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS
SN - 978-972-8865-78-8
AU - Gelgi F.
AU - Vadrevu S.
AU - Davulcu H.
PY - 2007
SP - 334
EP - 339
DO - 10.5220/0001288503340339