KEYWORDS EXTRACTION - Selecting Keywords in Natural Language Texts with Markov Chains and Neural Networks

Błażej Zyglarski, Piotr Bała

2010

Abstract

In this paper we show our approach to keywords extraction by natural language processing. We present revised and extended version of previously shown document analysis method, based on Khonen Neural Networks with Reinforcement, which uses data from the large document repository to check and improve results. We describe new improvements, which we’ve achieved with preprocessing set of words and creating initial ranking using Markov Chains. Our method shows, that keywords can be selected from the text with great accuracy. In this paper we present evaluation and comparison of both methods and example results of keywords selection upon random documents.

Download


Paper Citation


in Harvard Style

Zyglarski B. and Bała P. (2010). KEYWORDS EXTRACTION - Selecting Keywords in Natural Language Texts with Markov Chains and Neural Networks . In Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2010) ISBN 978-989-8425-30-0, pages 315-321. DOI: 10.5220/0003088003150321

in Bibtex Style

@conference{kmis10,
author={Błażej Zyglarski and Piotr Bała},
title={KEYWORDS EXTRACTION - Selecting Keywords in Natural Language Texts with Markov Chains and Neural Networks},
booktitle={Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2010)},
year={2010},
pages={315-321},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003088003150321},
isbn={978-989-8425-30-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Management and Information Sharing - Volume 1: KMIS, (IC3K 2010)
TI - KEYWORDS EXTRACTION - Selecting Keywords in Natural Language Texts with Markov Chains and Neural Networks
SN - 978-989-8425-30-0
AU - Zyglarski B.
AU - Bała P.
PY - 2010
SP - 315
EP - 321
DO - 10.5220/0003088003150321