COMPARATIVE STUDY OF ARABIC AND FRENCH STATISTICAL LANGUAGE MODELS
Karima Meftouh, Kamel Smaili, Mohamed Tayeb Laskri
2009
Abstract
In this paper, we propose a comparative study of statistical language models of Arabic and French. The objective of this study is to understand how to better model both Arabic and French. Several experiments using different smoothing techniques have been carried out. For French, trigram models are most appropriate whatever the smoothing technique used. For Arabic, the n-gram models of higher order smoothed with Witten Bell method are more efficient. Tests are achieved with comparable corpora and vocabularies in terms of size.
DownloadPaper Citation
in Harvard Style
Meftouh K., Smaili K. and Tayeb Laskri M. (2009). COMPARATIVE STUDY OF ARABIC AND FRENCH STATISTICAL LANGUAGE MODELS . In Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8111-66-1, pages 156-160. DOI: 10.5220/0001537501560160
in Bibtex Style
@conference{icaart09,
author={Karima Meftouh and Kamel Smaili and Mohamed Tayeb Laskri},
title={COMPARATIVE STUDY OF ARABIC AND FRENCH STATISTICAL LANGUAGE MODELS},
booktitle={Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2009},
pages={156-160},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001537501560160},
isbn={978-989-8111-66-1},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - COMPARATIVE STUDY OF ARABIC AND FRENCH STATISTICAL LANGUAGE MODELS
SN - 978-989-8111-66-1
AU - Meftouh K.
AU - Smaili K.
AU - Tayeb Laskri M.
PY - 2009
SP - 156
EP - 160
DO - 10.5220/0001537501560160