Feature Selection and Classification in Machine Learning: Methods and Models for Peer-to-Peer Lending | ||||
MSA-Management Sciences Journal | ||||
Volume 4, Issue 4, November 2025, Page 96-113 PDF (657.93 K) | ||||
DOI: 10.21608/msamsj.2025.405941.1121 | ||||
![]() | ||||
Authors | ||||
Markus Atef ![]() ![]() | ||||
1Faculty of Management Sciences, October University for Modern Sciences and Arts (MSA), Giza, Egypt | ||||
2Department of Information Systems, Faculty of Commerce and Business Administration, Helwan University, Cairo, Egypt | ||||
3Department of Business Administration, Faculty of Commerce and Business Administration, Helwan University, Cairo, Egypt | ||||
Abstract | ||||
Feature Selection (FS) is a pivotal technique in machine learning (ML) that improves predictive performance, model interpretability, and computational efficiency by reducing data dimensionality and isolating the most informative variables. In the dynamic environment of peer-to-peer (P2P) lending, FS is crucial for accurate credit risk assessment, borrower profiling, and loan default prediction. P2P platforms generate vast and heterogeneous datasets encompassing demographic, financial, behavioural, and transactional information, where redundant or irrelevant features can degrade model accuracy and scalability. This review provides a comprehensive examination of FS methodologies, including filter, wrapper, and embedded approaches, analysing their trade-offs in accuracy potential, computational cost, and interpretability. The study further explores classification models, supervised learning algorithms designed to predict borrower repayment behaviour, covering linear, non-linear, and tree-based ensembles widely applied in credit scoring. Classification methods address critical challenges in P2P lending, such as class imbalance, explainability, and the need for scalable, high-performing predictive systems. By synthesizing recent advances and practical applications, this review offers a structured guide for researchers and practitioners to select FS techniques and classification models aligned with P2P lending’s requirements. Emphasis is placed on optimizing predictive accuracy, enhancing interpretability, and supporting data-driven decision-making to strengthen credit evaluation processes, mitigate default risk, and promote sustainable growth across P2P lending platforms. | ||||
Keywords | ||||
Feature selection; machine learning; classification models; peer-to-peer lending; loan default analysis | ||||
Statistics Article View: 81 PDF Download: 75 |
||||