SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS
Fatih Gelgi, Srinivas Vadrevu, Hasan Davulcu
2007
Abstract
Current search engines present their search results as a ranked list of Web pages. However, as the number of pages on theWeb increases exponentially, so does the number of search results for any given query. We present a novel subspace clustering based algorithm to organize keyword search results by simultaneously clustering and identifying distinguishing terms for each cluster. Our system, named Scuba Diver, enables users to better interpret the coverage of millions of search results and to refine their search queries through a keyword guided interface. We present experimental results illustrating the effectiveness of our algorithm by measuring purity, entropy and F-measure of generated clusters based on Open Directory Project (ODP).
DownloadPaper Citation
in Harvard Style
Gelgi F., Vadrevu S. and Davulcu H. (2007). SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS . In Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-972-8865-78-8, pages 334-339. DOI: 10.5220/0001288503340339
in Bibtex Style
@conference{webist07,
author={Fatih Gelgi and Srinivas Vadrevu and Hasan Davulcu},
title={SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS},
booktitle={Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2007},
pages={334-339},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001288503340339},
isbn={978-972-8865-78-8},
}
in EndNote Style
TY - CONF
JO - Proceedings of the Third International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - SCUBA DIVER: SUBSPACE CLUSTERING OF WEB SEARCH RESULTS
SN - 978-972-8865-78-8
AU - Gelgi F.
AU - Vadrevu S.
AU - Davulcu H.
PY - 2007
SP - 334
EP - 339
DO - 10.5220/0001288503340339