A Performance Evaluation of Microscopic Geological Datasets for Mineral Classification Using Machine and Deep Learning Techniques

Amer, Badr G.; Mousa, Hamdy M.; Dawoud, Maher; Youssef, Anas

doi:10.21608/ijci.2025.379424.1195

	A Performance Evaluation of Microscopic Geological Datasets for Mineral Classification Using Machine and Deep Learning Techniques
IJCI. International Journal of Computers and Information
Articles in Press, Accepted Manuscript, Available Online from 30 October 2025
Document Type: Original Article
DOI: 10.21608/ijci.2025.379424.1195
Authors
Badr G. Amer^* ¹; Hamdy M. Mousa²; Maher Dawoud³; Anas Youssef⁴
¹CS Dept., Faculty of Computers and Artificial Intelligence, Matrouh University
²CS Dept., Faculty of Computers and Information, Menoufia University
³Geology dep., faculty of science, Menofia University
⁴Computer Science, Faculty of Computers and Information, Menoufia University
Abstract
Classifying minerals from thin-section microscopic rock images plays a vital role in modern geoscience, supporting tasks ranging from petrographic analysis to mineral exploration and educational applications. While recent advances in machine and deep learning have enabled significant progress in automated image classification, the success of these models depends heavily on the quality and sufficiency of the training datasets. Despite the availability of several geological image datasets, questions remain about whether these datasets are adequate to support robust, generalizable artificial intelligence models in real-world geological contexts. This paper addresses a critical yet underexplored issue in this field, which is the sufficiency of the currently available thin-section image datasets for training reliable classification models. To investigate this issue, a set of widely used machine and deep learning models, namely: KNN, DT, SVM, CNN, and ANN, were applied to selected geological datasets, namely: Igneous and Metamorphic Dataset, Sedimentary Dataset, and Thin Section-1 Dataset, under fair experimental conditions. The results, which ranged from 3.4% to 73.7% accuracy, reveal notable limitations in the generalization performance of these models across different datasets, pointing to issues such as class imbalance, low image quality, and limited class diversity. The findings highlight the urgent need for developing a more comprehensive, balanced, and high-resolution geological dataset that can effectively support automated mineral classification systems. This study contributes to the foundational understanding of dataset sufficiency in geological AI applications and sets the stage for future work in dataset construction and model optimization.
Keywords
Mineral Classification; Microscopic Rock Images; Geological Datasets; Machine Learning; Deep Learning

Statistics Article View: 61