Modified Dwarf Mongoose Optimization for Feature Selection in Imbalanced Student Performance Prediction Tasks
Abstract
Student performance prediction through Educational Data Mining (EDM) methods has become increasingly critical to educational decision-making and intervention. But educational datasets are high-dimensional and imbalanced, presenting serious problems for standard machine learning models. This paper presents an innovative feature selection methodology based on the Modified Dwarf Mongoose Optimization (MDMO), an enhanced version of standard DMO by adding three essential components: adaptive alpha guidance, scout-based diversity, and enhanced babysitter exchange criteria. These modifications boost the exploration-exploitation balance and prevent premature convergence, enabling more efficient search in high-dimensional binary feature spaces. The proposed MDMO is integrated as a wrapper method with five popular classifiers, LogitBoost, linear discriminant analysis, naive bayes, k-nearest neighbors, and decision trees, to form a robust predictive model for student performance. Furthermore, adaptive synthetic oversampling handles data imbalance. Based on real-world student datasets, experimental results indicate that MDMO outperforms other metaheuristic solutions with high classification accuracy and optimal feature reduction. This research illustrates the potential of behavior-inspired role-based swarm intelligence for tackling knotty optimization problems in education.
Full Text:
PDFReferences
S. M. Dol and P. M. Jawandhiya, "Systematic review and analysis of EDM for predicting the academic performance of students," Journal of The Institution of Engineers (India): Series B, vol. 105, no. 4, pp. 1021-1071, 2024.
M. Shoaib, N. Sayed, J. Singh, J. Shafi, S. Khan, and F. Ali, "AI student success predictor: Enhancing personalized learning in campus management systems," Computers in Human Behavior, vol. 158, p. 108301, 2024.
G. Feng and H. Chen, "Educational process mining: A study using a public educational data set from a machine learning repository," Education and Information Technologies, vol. 30, no. 6, pp. 8187-8214, 2025.
M. S. Ahammad, S. A. Sinthia, M. M. Ahmed, F. Y. Mou, M. N. A. Ikram, and M. Ghosh, "Sentiment Analysis of User Generated Reviews of Online Education Applications Using Deep Learning and Transformer Learning Algorithms," in 2024 15th International Conference on Computing Communication and Networking Technologies (ICCCNT), 2024: IEEE, pp. 1-7.
A. Bolívar, V. García, R. Alejo, R. Florencia-Juárez, and J. S. Sánchez, "Data-centric solutions for addressing big data veracity with class imbalance, high dimensionality, and class overlapping," Applied Sciences, vol. 14, no. 13, p. 5845, 2024.
B. Alnasyan, M. Basheri, and M. Alassafi, "A Comprehensive Comparative Analysis of Deep Learning Models for Student Performance Prediction in Virtual Learning Environments: Leveraging the OULA Dataset and Advanced Resampling Techniques," IEEE Access, 2025.
V. Balachandar and K. Venkatesh, "A multi-dimensional student performance prediction model (MSPP): An advanced framework for accurate academic classification and analysis," MethodsX, vol. 14, p. 103148, 2025.
F. Han, Y.-H. Wang, and F.-Y. Li, "A novel feature selection method based on adaptive search particle swarm optimization," Neural Computing and Applications, vol. 37, no. 12, pp. 7767-7783, 2025.
Y. Xue, H. Zhu, J. Liang, and A. Słowik, "Adaptive crossover operator based multi-objective binary genetic algorithm for feature selection in classification," Knowledge-Based Systems, vol. 227, p. 107218, 2021.
W. Ma, X. Zhou, H. Zhu, L. Li, and L. Jiao, "A two-stage hybrid ant colony optimization for high-dimensional feature selection," Pattern Recognition, vol. 116, p. 107933, 2021.
M. H. Nadimi-Shahraki, H. Zamani, and S. Mirjalili, "Enhanced whale optimization algorithm for medical feature selection: A COVID-19 case study," Computers in biology and medicine, vol. 148, p. 105858, 2022.
J. O. Agushaka, A. E. Ezugwu, and L. Abualigah, "Dwarf mongoose optimization algorithm," Computer methods in applied mechanics and engineering, vol. 391, p. 114570, 2022.
H. Turabieh et al., "Enhanced Harris Hawks optimization as a feature selection for the prediction of student performance," Computing, vol. 103, no. 7, pp. 1417-1438, 2021.
M. Kamal et al., "Metaheuristics method for classification and prediction of student performance using machine learning predictors," Mathematical Problems in Engineering, vol. 2022, no. 1, p. 2581951, 2022.
M. R. Apriyadi and D. P. Rini, "Hyperparameter optimization of support vector regression algorithm using metaheuristic algorithm for student performance prediction," International Journal of Advanced Computer Science and Applications, vol. 14, no. 2, 2023.
X. Song, "Student performance prediction employing k-nearest neighbor classification model and meta-heuristic algorithms," Multiscale and Multidisciplinary Modeling, Experiments and Design, vol. 7, no. 4, pp. 4397-4412, 2024.
C. Ma, "Improving the Prediction of Student Performance by Integrating a Random Forest Classifier with Meta-Heuristic Optimization Algorithms," International Journal of Advanced Computer Science & Applications, vol. 15, no. 6, 2024.
H. Shou and Y. Lu, "Student Performance Evaluation Technique By Applying Support Vector Classification And Metaheuristic Algorithms On The SVC Model’s Reliability," Journal of Applied Science and Engineering, vol. 28, no. 3, pp. 653-666, 2025.
Z. Ye, Y. Yang, Y. Chen, and H. Chen, "Predicting Academic Performance Levels in Higher Education: A Data-Driven Enhanced Fruit Fly Optimizer Kernel Extreme Learning Machine Model," Journal of Bionic Engineering, pp. 1-23, 2025.
M. Li, C. Huang, D. Wang, Q. Hu, J. Zhu, and Y. Tang, "Improved randomized learning algorithms for imbalanced and noisy educational data classification," Computing, vol. 101, pp. 571-585, 2019.
S. Alraddadi, S. Alseady, and S. Almotiri, "Prediction of students academic performance utilizing hybrid teaching-learning based feature selection and machine learning models," in 2021 International Conference of Women in Data Science at Taif University (WiDSTaif), 2021: IEEE, pp. 1-6.
T. Thaher and R. Jayousi, "Prediction of student’s academic performance using feedforward neural network augmented with stochastic trainers," in 2020 IEEE 14th International Conference on Application of Information and Communication Technologies (AICT), 2020: IEEE, pp. 1-7.
DOI: https://doi.org/10.31449/inf.v49i22.10148
This work is licensed under a Creative Commons Attribution 3.0 License.








