Automated Construction of Arabic-English Parallel Corpus | ||||
Journal of the ACS Advances in Computer Science | ||||
Article 5, Volume 3, Issue 1, 2009, Page 57-69 PDF (1.4 MB) | ||||
Document Type: Original Article | ||||
DOI: 10.21608/asc.2009.158223 | ||||
View on SCiNiTO | ||||
Abstract | ||||
Large-scale parallel corpus has become a reliable resource to cross the language barriers between the user and the web. These parallel texts provide the primary training material for statistical translation models and testing machine translation systems. Arabic-English parallel texts are not available in sufficient quantities and manual construction is time consuming. Therefore, this paper presents a technique that aims to construct an Arabic-English corpus automatically through web mining. The proposed technique is straightforward, automated, and portable to any pair of languages. | ||||
Keywords | ||||
Cross language information retrieval; parallel corpus construction; web mining; parallelism matching | ||||
Statistics Article View: 144 PDF Download: 158 |
||||