Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem

Ana Paula Silva, Arlindo Silva, Irene Rodrigues

2012

Abstract

In this paper we present an evolutionary approach to the part-of-speech tagging problem. The goal of part-of-speech tagging is to assign to each word of a text its part-of-speech. The task is not straightforward, because a large percentage of words has more than one possible part-of-speech, and the right choice is determined by the surrounding word’s part-of-speeches. This means that to solve this problem we need a method to disambiguate a word’s possible tags set. Traditionally there are two groups of methods used to tackle this task. The first group is based on statistical data concerning the different context’s possibilities for a word, while the second group is based on rules, normally designed by human experts, that capture the language properties. In this work we present a solution that tries to incorporate both these approaches. The proposed system is divided in two components. First, we use an evolutionary algorithm that for each part-of-speech tag of the training corpus, evolves a set of disambiguation rules. We then use a second evolutionary algorithm, guided by the rules found earlier, to solve the tagging problem. The results obtained on two different corpora are amongst the best ones published for those corpora.

Download


Paper Citation


in Harvard Style

Paula Silva A., Silva A. and Rodrigues I. (2012). Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem . In Proceedings of the 4th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2012) ISBN 978-989-8565-33-4, pages 5-14. DOI: 10.5220/0004112000050014

in Bibtex Style

@conference{ecta12,
author={Ana Paula Silva and Arlindo Silva and Irene Rodrigues},
title={Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem},
booktitle={Proceedings of the 4th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2012)},
year={2012},
pages={5-14},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004112000050014},
isbn={978-989-8565-33-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 4th International Joint Conference on Computational Intelligence - Volume 1: ECTA, (IJCCI 2012)
TI - Tagging with Disambiguation Rules - A New Evolutionary Approach to the Part-of-Speech Tagging Problem
SN - 978-989-8565-33-4
AU - Paula Silva A.
AU - Silva A.
AU - Rodrigues I.
PY - 2012
SP - 5
EP - 14
DO - 10.5220/0004112000050014