Automated Construction of Arabic-English Parallel Corpus

doi:10.21608/asc.2009.158223

Automated Construction of Arabic-English Parallel Corpus

Document Type : Original Article

10.21608/asc.2009.158223

Abstract

Large-scale parallel corpus has become a reliable resource to cross the
language barriers between the user and the web. These parallel texts provide the
primary training material for statistical translation models and testing machine
translation systems. Arabic-English parallel texts are not available in sufficient
quantities and manual construction is time consuming. Therefore, this paper
presents a technique that aims to construct an Arabic-English corpus automatically
through web mining. The proposed technique is straightforward, automated, and
portable to any pair of languages.

Keywords

Cross language information retrieval
parallel corpus construction
web mining
parallelism matching

Journal of the ACS Advances in Computer Science

Automated Construction of Arabic-English Parallel Corpus

Volume 3, Issue 1 - Serial Number 1 2009Pages 57-69

Files

Share

How to cite

Statistics

Volume 3, Issue 1 - Serial Number 1
2009
Pages 57-69