LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS
Mahmoud Gindiyeh, Gintare Grigonyte, Johann Haller, Algirdas Avižienis
2009
Abstract
Organizing documents and performing search is a common but not a trivial task in information systems. With the increasing number of documents, it is becoming crucial to automate these processes. Clustering is a solution for organizing large amount of documents. In this article we propose a method of improving document retrieval that was implemented in RKB Knowledge Base. Our method heavily relies on linguistic analysis, which aims to identify document specific noun phrases. We apply an adjusted hierarchical clustering algorithm for learning clusters of documents.
DownloadPaper Citation
in Harvard Style
Gindiyeh M., Grigonyte G., Haller J. and Avižienis A. (2009). LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS . In Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2009) ISBN 978-989-674-013-9, pages 324-327. DOI: 10.5220/0002308703240327
in Bibtex Style
@conference{kmis09,
author={Mahmoud Gindiyeh and Gintare Grigonyte and Johann Haller and Algirdas Avižienis},
title={LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS},
booktitle={Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2009)},
year={2009},
pages={324-327},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002308703240327},
isbn={978-989-674-013-9},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2009)
TI - LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS
SN - 978-989-674-013-9
AU - Gindiyeh M.
AU - Grigonyte G.
AU - Haller J.
AU - Avižienis A.
PY - 2009
SP - 324
EP - 327
DO - 10.5220/0002308703240327