Correctness, Strength and Similarity Evaluation of Stemming Algorithms for Arabic | ||
The Egyptian Journal of Language Engineering | ||
Article 2, Volume 1, Issue 1, January 2014, Pages 17-23 PDF (301.11 K) | ||
Document Type: Original Article | ||
DOI: 10.21608/ejle.2014.59847 | ||
Authors | ||
Daoud Daoud* 1; Christian Boitet2 | ||
1Princess Somaya University for Technology | ||
2GETALP, LIG, Université Joseph Fourier, France | ||
Abstract | ||
In this paper, we present a comprehensive evaluation of four Arabic stemmers, based on metrics for correctness, strength and similarity. Two data sets were used in this study. For correctness evaluation, we used a list of 8697 Arabic words grouped into 1606 conceptual classes. For similarity and strength evaluation, we used a list of 72,000 unique Arabic words. Conclusions about correctness, strength and similarity of the four Arabic stemming algorithms are reported. | ||
Statistics Article View: 183 PDF Download: 317 |