A Formal Modeling Method to Enrich the Arabic Treebank ATB with Syntactic Properties

Raja Bensalem Bahloul, Kais Haddar, Philippe Blache

2015

Abstract

The enrichment of an Arabic treebank with syntactic properties can facilitate many types of parsing processes. This enrichment allows also the increase of its use in different NLP applications, the acquirement of new linguistic resources and the ease of the probabilistic parsing process by using statistics to limit the properties to the satisfied ones or to the most frequent ones. In this context, our proposed enrichment method is based on a formalization phase, a Property Grammar induction phase from a source treebank and a treebank regeneration phase with a new syntactic property-based representation. Starting with a formalization phase in our enrichment problem may succeed its resolution procedure. In fact, it limits the specification of the data sets and the interactions between them to the used ones, which avoids any duplication. The formalization allows also the anticipation of the constraints to respect in the problem. The implementation of this enrichment method is experimented essentially on the Arabic treebank ATB. This experiment provides us with good and encouraging results and various properties of different types.

Download


Paper Citation


in Harvard Style

Bensalem Bahloul R., Haddar K. and Blache P. (2015). A Formal Modeling Method to Enrich the Arabic Treebank ATB with Syntactic Properties . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KEOD, (IC3K 2015) ISBN 978-989-758-158-8, pages 108-117. DOI: 10.5220/0005617001080117

in Bibtex Style

@conference{keod15,
author={Raja Bensalem Bahloul and Kais Haddar and Philippe Blache},
title={A Formal Modeling Method to Enrich the Arabic Treebank ATB with Syntactic Properties},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KEOD, (IC3K 2015)},
year={2015},
pages={108-117},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005617001080117},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: KEOD, (IC3K 2015)
TI - A Formal Modeling Method to Enrich the Arabic Treebank ATB with Syntactic Properties
SN - 978-989-758-158-8
AU - Bensalem Bahloul R.
AU - Haddar K.
AU - Blache P.
PY - 2015
SP - 108
EP - 117
DO - 10.5220/0005617001080117