Ensemble Learning for Software Defect Prediction: Performance, Practicality and Future Directions

  • Bassey Isong North-West University, South Africa
  • Ekoro Igo University of Education and Entrepreneurship, Nigeria
Keywords: Software Defect Prediction, Ensemble Methods, Class Imbalance, Machine Learning

Abstract

Ensemble learning is a leading approach in software defect prediction (SDP), offering improved predictive performance on imbalanced and high-dimensional datasets. Despite growing research interest, persistent gaps remain in model interpretability, generalizability, and reproducibility, limiting its practical adoption. This paper presents a comprehensive analysis of 56 peer-reviewed studies published between 2020 and 2025, spanning both journal and conference venues. Findings show that ensemble methods, especially when combined with sampling, feature selection, or optimisation, consistently outperform single classifiers on important metrics such as F1-score, area under the curve, and Matthew correlation coefficient. Nonetheless, few studies incorporate explainability frameworks, effort-aware evaluation, or cross-project validation. Additionally, most models are static, rely on within-project testing, and depend on legacy datasets such as PROMISE and NASA, which limit external validity. Building on this synthesis, the review highlights future research priorities, including interpretable ensemble architectures, adaptive modelling, dynamic imbalance handling, semantic feature integration, and real-time prediction. Standardised benchmarks, transparent, scalable designs are recommended to bridge the gap between experimental performance and deployment-ready SDP solutions.

Downloads

Download data is not yet available.
Published
2025-09-25
Abstract views: 22 times
Download PDF: 19 times
How to Cite
Isong, B., & Igo, E. (2025). Ensemble Learning for Software Defect Prediction: Performance, Practicality and Future Directions. Journal of Information Systems and Informatics, 7(3), 2245-2291. https://doi.org/10.51519/journalisi.v7i3.1171
Section
Articles

Most read articles by the same author(s)