PRE-PROCESSING TASKS FOR RULE-BASED ENGLISH-KOREAN MACHINE TRANSLATION SYSTEM

Sung-Dong Kim

2011

Abstract

This paper presents necessary pre-processing tasks for practical English-Korean machine translation. The pre-processing task consists of a problem that requires pre-processing and a solution for the problem. There are many differences between English and Korean, so it is difficult to resolve the differences using parsing and transfer rules. Also, source sentences often include non-word elements, such as parentheses, quotation marks, and list markers. In order to resolve the differences efficiently and make source sentences appropriate to translation system by arranging them, we propose pre-processing for source sentences. This paper studies various pre-processing tasks and classifies into several groups according to the time when the tasks are performed in English-Korean machine translation system. In experiment, we show the usefulness of the defined pre-processing tasks for generating better translation results.

Download


Paper Citation


in Harvard Style

Kim S. (2011). PRE-PROCESSING TASKS FOR RULE-BASED ENGLISH-KOREAN MACHINE TRANSLATION SYSTEM . In Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART, ISBN 978-989-8425-40-9, pages 257-262. DOI: 10.5220/0003151702570262

in Bibtex Style

@conference{icaart11,
author={Sung-Dong Kim},
title={PRE-PROCESSING TASKS FOR RULE-BASED ENGLISH-KOREAN MACHINE TRANSLATION SYSTEM },
booktitle={Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,},
year={2011},
pages={257-262},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003151702570262},
isbn={978-989-8425-40-9},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence - Volume 1: ICAART,
TI - PRE-PROCESSING TASKS FOR RULE-BASED ENGLISH-KOREAN MACHINE TRANSLATION SYSTEM
SN - 978-989-8425-40-9
AU - Kim S.
PY - 2011
SP - 257
EP - 262
DO - 10.5220/0003151702570262