BASMA: BibAlex Standard Arabic Morphological Analyzer | ||||
The Egyptian Journal of Language Engineering | ||||
Article 3, Volume 3, Issue 1, April 2016, Page 24-33 PDF (1012.13 K) | ||||
Document Type: Original Article | ||||
DOI: 10.21608/ejle.2016.60166 | ||||
View on SCiNiTO | ||||
Author | ||||
Sameh Alansary | ||||
Phonetics and Linguistics Department, Faculty of Arts, Alexandria University | ||||
Abstract | ||||
Arabic morphology poses special challenges to computational natural language processing systems. Its rich morphology and the highly complex word formation process of roots and patterns make computational approaches to Arabic very challenging. Morphological analyzers are preprocessors for text analysis. This paper sheds the light on BASMA-Tool (BibAlex Standard Arabic Morphological Analyzer) that has been initiated at Bibliotheca Alexandrina (BA). The BASMA tool is based on Buckwalter Arabic Morphological Analyzer (BAMA). It focuses on fixing its problems, adding a set of useful morphological features that BAMA does not provide, and disambiguating its multiple solutions. This is done depending on a well training data and a hybrid system (Rule based and memory based). Precision and Recall are the evaluation measures used to evaluate BASMA tool. At this point, precision measurement was 93.37% while recall measurement was 96.9%. The percentages are expected to rise by implementing the improvements while working on larger amounts of data. | ||||
Keywords | ||||
Buckwalter Arabic Morphological Analyzer; BASMA-Tool; Morphological analyzers | ||||
Statistics Article View: 202 PDF Download: 468 |
||||