A Predictive Framework Based on Students' Academic Performance in Higher Education | ||||
Artificial Intelligence Information Security | ||||
Volume 3, Issue 7, February 2025, Page 241-271 PDF (1.08 MB) | ||||
Document Type: Refereed research papers. | ||||
DOI: 10.21608/aiis.2025.361760.1018 | ||||
![]() | ||||
Authors | ||||
Mohamed Ali Elhayes ![]() | ||||
1Faculty of Business, Economics & Information Systems Misr University for Science& Technology | ||||
2Giza Higher Institute for Managerial Sciences, Tomah | ||||
3Giza Higher Institute of Management Sciences. | ||||
Abstract | ||||
This research proposes a predictive framework for assessing the academic performance of students in higher education, leveraging data mining and machine learning techniques. The study addresses the challenge of imbalanced datasets, which often skew the performance of classification models, by employing the Synthetic Minority Oversampling Technique (SMOTE) to enhance prediction accuracy. The framework integrates various supervised learning algorithms, including J48, Random Forest (RF), and K-Nearest Neighbors (KNN), to predict student performance and recommend suitable academic paths based on historical data. The data set, collected from the Higher Institute for Management Sciences, spans multiple academic years, and includes student records from three departments: Information Systems, Management, and Accounting. The research demonstrates that handling class imbalance through SMOTE significantly improves model performance, with the Random Forest classifier achieving the highest accuracy of 93.70%. The study also highlights the importance of feature selection, data preprocessing, and normalization in optimizing predictive outcomes. The findings underscore the potential of educational data mining to support early academic interventions and personalized guidance, aiding students in making informed decisions about their academic trajectories. Future work will explore additional sampling techniques and expand the dataset to further enhance model accuracy. | ||||
Keywords | ||||
Students performance; SMOTE; Educational data mining; Data mining algorithms | ||||
Statistics Article View: 870 PDF Download: 1,261 |
||||