LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS

Mahmoud Gindiyeh, Gintare Grigonyte, Johann Haller, Algirdas Avižienis

2009

Abstract

Organizing documents and performing search is a common but not a trivial task in information systems. With the increasing number of documents, it is becoming crucial to automate these processes. Clustering is a solution for organizing large amount of documents. In this article we propose a method of improving document retrieval that was implemented in RKB Knowledge Base. Our method heavily relies on linguistic analysis, which aims to identify document specific noun phrases. We apply an adjusted hierarchical clustering algorithm for learning clusters of documents.

Download


Paper Citation


in Harvard Style

Gindiyeh M., Grigonyte G., Haller J. and Avižienis A. (2009). LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS . In Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2009) ISBN 978-989-674-013-9, pages 324-327. DOI: 10.5220/0002308703240327

in Bibtex Style

@conference{kmis09,
author={Mahmoud Gindiyeh and Gintare Grigonyte and Johann Haller and Algirdas Avižienis},
title={LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS},
booktitle={Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2009)},
year={2009},
pages={324-327},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002308703240327},
isbn={978-989-674-013-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2009)
TI - LINGUISTICALLY ENHANCED CLUSTERING OF TECHNICAL PUBLICATIONS
SN - 978-989-674-013-9
AU - Gindiyeh M.
AU - Grigonyte G.
AU - Haller J.
AU - Avižienis A.
PY - 2009
SP - 324
EP - 327
DO - 10.5220/0002308703240327