SMOTE-RUS : Combined Oversampling and Undersampling Technique to Classify the Imbalanced Autism Spectrum disorder dataset | ||||
International Journal of Intelligent Computing and Information Sciences | ||||
Volume 23, Issue 3, September 2023, Page 83-94 PDF (821.01 K) | ||||
Document Type: Original Article | ||||
DOI: 10.21608/ijicis.2023.216833.1278 | ||||
View on SCiNiTO | ||||
Authors | ||||
Eman ismail 1; Walaa Gad 2; Mohamed Hashem3 | ||||
1Information Systems Department, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt | ||||
2Information Systems Department, Faculty of Computer and Information Sciences, Ain Shams University, Cairo, Egypt | ||||
3Department of Information Systems, Faculty of Computers and Information Sciences, Ain Shams University,Cairo , Egypt | ||||
Abstract | ||||
The imbalanced distribution of classes is a common issue in almost classification problems. Therefore, we must be familiar with class-imbalanced techniques to handle this problem. Autism spectrum disorder(ASD) disease affects the development of the brain. Therefore, patients with autism have some limitations to interact with others on the social level. So, it is necessary to predict the genes related to ASD for early diagnosis and treatment. Recent studies utilize different machine learning techniques to predict ASD genes that suffer from the imbalanced ASD dataset problem. In this paper, recent ASD gene prediction models are utilized to compare different techniques influence using undersampling and oversampling algorithms on the model performance. Moreover, a new combined technique(SMOTE-RUS) is proposed using Synthetic Oversampling Technique(SMOTE) and random undersampling(RUS) technique to solve the imbalanced dataset problem. SMOTE-RUS is used to build an effective model to predict ASD genes. The proposed technique results prove that it is effective to get a more robust gene prediction model. Moreover, it outperforms other models using a single resampling technique. | ||||
Keywords | ||||
Oversampling; Undersampling; SMOTE; Gene prediction; Class imbalance problem | ||||
Statistics Article View: 164 PDF Download: 270 |
||||