EMPIRICAL TEXT MINING FOR GENRE DETECTION

Vasiliki Simaki, Sofia Stamou, Nikos Kirtsis

2012

Abstract

In this paper, we report on a preliminary study we carried out for identifying patterns that characterize the genre type of Greek texts. In the course of our study, we address four distinct genre types, we record their observable stylistic elements and we indicate their exploitation for automatic genre-based document classi-fication. The findings of our study demonstrate that texts contain lexical features with discriminative power as far as genre is concerned, however modeling those features so that they can be explored by computer-based applications is still in early stages.

Download


Paper Citation


in Harvard Style

Simaki V., Stamou S. and Kirtsis N. (2012). EMPIRICAL TEXT MINING FOR GENRE DETECTION . In Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-08-2, pages 733-737. DOI: 10.5220/0003956207330737

in Bibtex Style

@conference{webist12,
author={Vasiliki Simaki and Sofia Stamou and Nikos Kirtsis},
title={EMPIRICAL TEXT MINING FOR GENRE DETECTION},
booktitle={Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2012},
pages={733-737},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003956207330737},
isbn={978-989-8565-08-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - EMPIRICAL TEXT MINING FOR GENRE DETECTION
SN - 978-989-8565-08-2
AU - Simaki V.
AU - Stamou S.
AU - Kirtsis N.
PY - 2012
SP - 733
EP - 737
DO - 10.5220/0003956207330737