Retrieval, Visualization and Validation of Affinities between Documents

Luis Trigo, Martin Víta, Rui Sarmento, Pavel Brazdil

2015

Abstract

We present an Information Retrieval tool that facilitates the task of the user when searching for a particular information that is of interest to him. Our system processes a given set of documents to produce a graph, where nodes represent documents and links the similarities. The aim is to offer the user a tool to navigate in this space in an easy way. It is possible to collapse/expand nodes. Our case study shows affinity groups based on the similarities of text production of researchers. This goes beyond the already established communities revealed by co-authorship. The system characterizes the activity of each author by a set of automatically generated keywords and by membership to a particular affinity group. The importance of each author is highlighted visually by the size of the node corresponding to the number of publications and different measures of centrality. Regarding the validation of the method, we analyse the impact of using different combinations of titles, abstracts and keywords on capturing the similarity between researchers.

Download


Paper Citation


in Harvard Style

Trigo L., Víta M., Sarmento R. and Brazdil P. (2015). Retrieval, Visualization and Validation of Affinities between Documents . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 3: KITA, (IC3K 2015) ISBN 978-989-758-158-8, pages 452-459. DOI: 10.5220/0005662904520459

in Bibtex Style

@conference{kita15,
author={Luis Trigo and Martin Víta and Rui Sarmento and Pavel Brazdil},
title={Retrieval, Visualization and Validation of Affinities between Documents},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 3: KITA, (IC3K 2015)},
year={2015},
pages={452-459},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005662904520459},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 3: KITA, (IC3K 2015)
TI - Retrieval, Visualization and Validation of Affinities between Documents
SN - 978-989-758-158-8
AU - Trigo L.
AU - Víta M.
AU - Sarmento R.
AU - Brazdil P.
PY - 2015
SP - 452
EP - 459
DO - 10.5220/0005662904520459