Challenges and Potentials for Keyword Extraction from Company Websites for the Development of Regional Knowledge Maps

Christian Wartena, Montserrat Garcia Alsina

2013

Abstract

Regional Innovation Systems describe the relations between actors, structures and infrastructures in a region in order to stimulate innovation and regional development. For these systems the collection and organization of information is crucial. In the present paper we investigate the possibilities to extract information from websites of companies. First we describe regional innovation systems and the information types that are necessary to create them. Then we discuss the possibilities of text mining and keyword extraction techniques to extract this information from company websites. Finally, we describe a small scale experiment in which keywords related to economic sectors and commodities are extracted from the websites of over 200 companies. This experiment shows what the main challenges are for information extraction from websites for regional innovation systems.

Download


Paper Citation


in Harvard Style

Wartena C. and Garcia Alsina M. (2013). Challenges and Potentials for Keyword Extraction from Company Websites for the Development of Regional Knowledge Maps . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: SSTM, (IC3K 2013) ISBN 978-989-8565-75-4, pages 241-248. DOI: 10.5220/0004660002410248

in Bibtex Style

@conference{sstm13,
author={Christian Wartena and Montserrat Garcia Alsina},
title={Challenges and Potentials for Keyword Extraction from Company Websites for the Development of Regional Knowledge Maps},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: SSTM, (IC3K 2013)},
year={2013},
pages={241-248},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004660002410248},
isbn={978-989-8565-75-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: SSTM, (IC3K 2013)
TI - Challenges and Potentials for Keyword Extraction from Company Websites for the Development of Regional Knowledge Maps
SN - 978-989-8565-75-4
AU - Wartena C.
AU - Garcia Alsina M.
PY - 2013
SP - 241
EP - 248
DO - 10.5220/0004660002410248