PREPROCESSING THE EGYPTIAN ARABIC DIALECT FOR PERSONALITY TRAITS PREDICTION | ||||
International Journal of Intelligent Computing and Information Sciences | ||||
Article 1, Volume 19, Issue 1, June 2019, Page 1-12 PDF (416.98 K) | ||||
Document Type: Original Article | ||||
DOI: 10.21608/ijicis.2019.62603 | ||||
View on SCiNiTO | ||||
Authors | ||||
marwa salim; sally Saad; mostafa aref | ||||
Department of Computer Sciences, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt. | ||||
Abstract | ||||
Each individual has his own distinct character, making his own decisions which is based on his personality. Researchers in computer science field have tried to reach a model for extracting personality traits relying on user’s profiles on social network sites as an input. Content created by users such as text posts, photos and shared activities in social network sites are considered as a huge source of data. Regarding user-created text, it has been proved that text pre-processing has a great impact if was applied to text before using it in research. In this paper, the effect of pre-processing (stemming and stop word removal) and adding numerical features is tested on the performance of Arabic personality prediction using AraPersonality dataset, which yielded 3.0% and 6.7% overall improvement to baseline experiments in binary representation and multiclass representation respectively | ||||
Keywords | ||||
Personality Recognition; social media; AraPersonality Dataset; Stop Word; Stemming | ||||
Statistics Article View: 508 PDF Download: 620 |
||||