Automated Construction of Arabic-English Parallel Corpus

doi:10.21608/asc.2009.158223

	Automated Construction of Arabic-English Parallel Corpus
Journal of the ACS Advances in Computer Science
Article 5, Volume 3, Issue 1, 2009, Page 57-69 PDF (1.4 MB)
Document Type: Original Article
DOI: 10.21608/asc.2009.158223
View on SCiNiTO
Abstract
Large-scale parallel corpus has become a reliable resource to cross the language barriers between the user and the web. These parallel texts provide the primary training material for statistical translation models and testing machine translation systems. Arabic-English parallel texts are not available in sufficient quantities and manual construction is time consuming. Therefore, this paper presents a technique that aims to construct an Arabic-English corpus automatically through web mining. The proposed technique is straightforward, automated, and portable to any pair of languages.
Keywords
Cross language information retrieval; parallel corpus construction; web mining; parallelism matching


Statistics Article View: 144 PDF Download: 158