A Comparative Analysis of Machine Learning Techniques and Explainable AI on Voice Biomarkers for Effective Parkinson’s Disease Prediction
Abstract
Parkinson's disease (PD) is a neurological movement disorder that remains difficult to diagnose, although it affects millions globally. Early diagnosis can lead to more effective and improved patient outcomes. Diagnosis through traditional methods is subjective and often lacks transparency, raising concerns about reliability. In this study, the CRISP-DM framework was applied to compare eight ML algorithms, including Random Forest and Support Vector Machine (SVM). Recursive Feature Elimination (RFE) was used to preprocess, balance, refine the data and find the eight most predictive vocal features. With 195 recordings coming from the UCI Parkinson’s Speech Dataset, which contains voice measurements from 31 individuals (23 with PD and 8 healthy controls), Random Forest (Entropy) had the best performance (????₁ = 96.6%, ROC AUC = 0.98). Explainable AI tools (SHAP and LIME) were integrated, allowing both global and instance-level understanding of model predictions thereby identifying measures of pitch variability (MDVP: RAP, spread1, PPE) as key predictors of PD. This research contributes to the practical deployment of reliable, transparent PD prediction tools in real-world medical settings, supporting early diagnosis and improved patient care. This raises the issue of the urgent need to detect PD early among Africa's aging populations to help protect the cultural heritage contained in the voices of the elders. this research contributes to the practical deployment of reliable, transparent PD prediction tools in real-world medical settings, supporting early diagnosis and improved patient care.Future work should embark on validating these findings over much more varied cohorts, integrating additional data modalities (e.g., gait, imaging), and enhancing model robustness. Real-time speech analysis-based tools, in the end, will allow remote screening, early intervention, and tailored care.
Downloads
References
V. Dentamaro, D. Impedovo, L. Musti, G. Pirlo, and P. Taurisano, “Enhancing early Parkinson’s disease detection through multimodal deep learning and explainable AI: insights from the PPMI database,” Sci Rep, vol. 14, no. 1, Dec. 2024, doi: 10.1038/s41598-024-70165-4.
E. A. Sarfo, J. S. Yendork, and A. V. Naidoo, “Examining the intersection between marriage, perceived maturity and child marriage: perspectives of community elders in the Northern region of Ghana,” Cult Health Sex, vol. 23, no. 7, pp. 991–1005, 2021, doi: 10.1080/13691058.2020.1749934.
A. E. Iyare, E. Imafidon, and K. U. Abudu, “Ageing, Ageism, Cultural Representations of the Elderly and the Duty to Care in African Traditions,” in Essays on Contemporary Issues in African Philosophy, Springer International Publishing, 2021, pp. 281–299. doi: 10.1007/978-3-030-70436-0_18.
H. Byeon, “Development of a depression in Parkinson’s disease prediction model using machine learning,” World J Psychiatry, vol. 10, no. 10, pp. 234–244, Oct. 2020, doi: 10.5498/wjp.v10.i10.234.
C. Viscogliosi et al., “Importance of Indigenous elders’ contributions to individual and community wellness: results from a scoping review on social participation and intergenerational solidarity,” 1997, doi: 10.17269/s41997-019-00292-3/Published.
A. Rana, A. Dumka, R. Singh, M. K. Panda, N. Priyadarshi, and B. Twala, “Imperative Role of Machine Learning Algorithm for Detection of Parkinson’s Disease: Review, Challenges and Recommendations,” Aug. 01, 2022, Multidisciplinary Digital Publishing Institute (MDPI). doi: 10.3390/diagnostics12082003.
B. Ndlovu, K. Maguraushe, and O. Mabikwa, “Machine Learning and Explainable AI for Parkinson’s Disease Prediction: A Systematic Review,” The Indonesian Journal of Computer Science, vol. 14, no. 2, Apr. 2025, doi: 10.33022/ijcs.v14i2.4837.
M. Abu Sayed et al., “Parkinson’s Disease Detection through Vocal Biomarkers and Advanced Machine Learning Algorithms,” 2023, doi: 10.32996/jcsts.
Sharma, R. Sahu, and J. K. Sandhu, “A Survey of Machine Learning Approaches for Parkinson’s Disease Prediction,” 4th Int. Conf. Innov. Pract. Technol. Manag. 2024, ICIPTM 2024, no. August, 2024, doi: 10.1109/ICIPTM59628.2024.10563210.
A. Balakrishnan, J. Medikonda, P. K. Namboothiri, and M. Natarajan, “Role of Wearable Sensors with Machine Learning Approaches in Gait Analysis for Parkinson’s Disease Assessment: A Review,” 2022, Engineered Science Publisher. doi: 10.30919/es8e622.
K. Shyamala and T. M. Navamani, “Design of an Efficient Prediction Model for Early Parkinson’s Disease Diagnosis,” IEEE Access, 2024, doi: 10.1109/ACCESS.2024.3421302.
N. Fadavi and N. Fadavi, “Early Recognition of Parkinson’s Disease Through Acoustic Analysis and Machine Learning,” Jul. 2024, [Online]. Available: http://arxiv.org/abs/2407.16091
Y. Miao, X. Lou, and H. Wu, “The Diagnosis of Parkinson’s Disease Based on Gait, Speech Analysis and Machine Learning Techniques,” in Proceedings of the 2021 International Conference on Bioinformatics and Intelligent Computing, BIC 2021, Association for Computing Machinery, Inc, Jan. 2021, pp. 358–371. doi: 10.1145/3448748.3448804.
S. Saravanan, K. Ramkumar, K. Narasimhan, S. Vairavasundaram, K. Kotecha, and A. Abraham, “Explainable Artificial Intelligence (EXAI) Models for Early Prediction of Parkinson’s Disease Based on Spiral and Wave Drawings,” IEEE Access, vol. 11, pp. 68366–68378, 2023, doi: 10.1109/ACCESS.2023.3291406.
Y. H. Park et al., “Machine learning based risk prediction for Parkinson’s disease with nationwide health screening data,” Sci Rep, vol. 12, no. 1, Dec. 2022, doi: 10.1038/s41598-022-24105-9.
N. Salari, M. Kazeminia, H. Sagha, A. Daneshkhah, A. Ahmadi, and M. Mohammadi, “The performance of various machine learning methods for Parkinson’s disease recognition: a systematic review,” Current Psychology, vol. 42, no. 20, pp. 16637–16660, Jul. 2023, doi: 10.1007/s12144-022-02949-8.
N. Burgos and O. Colliot, “Machine learning for classification and prediction of brain diseases: recent advances and upcoming challenges,” Curr Opin Neurol, vol. 2020, no. 4, pp. 439–450, doi: 10.1097/WCO.0000000000000838ï.
J. Galper et al., “Prediction of motor and non-motor Parkinson’s disease symptoms using serum lipidomics and machine learning: a 2-year study,” NPJ Parkinsons Dis, vol. 10, no. 1, Dec. 2024, doi: 10.1038/s41531-024-00741-y.
J. Mei, C. Desrosiers, and J. Frasnelli, “Machine Learning for the Diagnosis of Parkinson’s Disease: A Review of Literature,” May 06, 2021, Frontiers Media S.A. doi: 10.3389/fnagi.2021.633752.
S. Priyadharshini et al., “A Comprehensive framework for Parkinson’s disease diagnosis using explainable artificial intelligence empowered machine learning techniques,” Alexandria Engineering Journal, vol. 107, pp. 568–582, Nov. 2024, doi: 10.1016/j.aej.2024.07.106.
W. Wang, J. Lee, F. Harrou, and Y. Sun, “Early Detection of Parkinson’s Disease Using Deep Learning and Machine Learning,” IEEE Access, vol. 8, pp. 147635–147646, 2020, doi: 10.1109/ACCESS.2020.3016062.
M. Nilashi et al., “Predicting Parkinson’s Disease Progression: Evaluation of Ensemble Methods in Machine Learning,” J Healthc Eng, vol. 2022, p. 2793361, 2022, doi: 10.1155/2022/2793361.
H. W. Loh et al., “Application of deep learning models for automated identification of parkinson’s disease: A review (2011–2021),” Nov. 01, 2021, MDPI. doi: 10.3390/s21217034.
N. W.C Mukura and B. Ndlovu, “Performance Evaluation of Artificial Intelligence in Decision Support System for Heart Disease Risk Prediction,” pp. 83–93, 2023, doi: 10.46254/ap04.20230043.
K. Maguraushe, P. Ndayizigamiye, and T. Bokaba, “Trends and Developments in the Use of Machine Learning for Disaster Management: A Bibliometric Analysis,” in IFIP Advances in Information and Communication Technology, Springer Science and Business Media Deutschland GmbH, 2024, pp. 92–104. doi: 10.1007/978-3-031-50192-0_9.
M. T. Roseno, S. Oktarina, Y. Nearti, H. Syaputra, and N. Jayanti, “Comparing CNN Models for Rice Disease Detection: ResNet50, VGG16, and MobileNetV3-Small,” Journal of Information Systems and Informatics, vol. 6, no. 3, pp. 2099–2109, Sep. 2024, doi: 10.51519/journalisi.v6i3.865.
S. Hadebe, B. Ndlovu, and K. Maguraushe, “Managing Diabetes Using Machine Learning and Digital Twins,” Indonesian Journal of Innovation and Applied Sciences (IJIAS), vol. 5, no. 2, pp. 145–162, Jun. 2025, doi: 10.47540/ijias.v5i2.1981.
A. Yusuf, I. Wardiah, and N. Lestari Putri, “Predicting Respiratory Conditions Using Random Forest and XGBoost,” Journal of Information Systems and Informatics, vol. 7, no. 2, 2025, doi: 10.51519/journalisi.v7i2.1124.
A. Suppa et al., “Voice in Parkinson’s Disease: A Machine Learning Study,” Front Neurol, vol. 13, Feb. 2022, doi: 10.3389/fneur.2022.831428.
J. Martorell-Marugán, M. Chierici, S. Bandres-Ciga, G. Jurman, and P. Carmona-Sáez, “Machine Learning Applications in the Study of Parkinson’s Disease: A Systematic Review,” 2023, Bentham Science Publishers. doi: 10.2174/1574893618666230406085947.
P. R. Magesh, R. D. Myloth, and R. J. Tom, “An Explainable Machine Learning Model for Early Detection of Parkinson’s Disease using LIME on DaTSCAN Imagery.,” Comput. Biol. Med., vol. 126, p. 104041, Nov. 2020, doi: 10.1016/j.compbiomed.2020.104041
R. Wirth and J. Hipp, “CRISP-DM: Towards a Standard Process Model for Data Mining, 2000”. [Online]. Available: https://api.semanticscholar.org/CorpusID:1211505.
S. Patil, S. Jaybhaye, S. Bokariya, P. Jain, S. Phapale, and T. Hande, “Parkinson’s Disease Prediction System in Machine Learning,” ITM Web of Conferences, vol. 56, p. 05002, 2023, doi: 10.1051/itmconf/20235605002.
V. Ulagamuthalvi, “Identification of Parkinson’s Disease Using Machine Learning Algorithms,” Biosci Biotechnol Res Commun, vol. 13, no. 2, pp. 576–579, Jun. 2020, doi: 10.21786/bbrc/13.2/32.
S. S Band et al., “Application of explainable artificial intelligence in medical health: A systematic review of interpretability methods,” Inform Med Unlocked, vol. 40, Jan. 2023, doi: 10.1016/j.imu.2023.101286.
M. B. Makarious et al., “Multi-modality machine learning predicting Parkinson’s disease,” NPJ Parkinsons Dis, vol. 8, no. 1, Dec. 2022, doi: 10.1038/s41531-022-00288-w.
X. Yang, Q. Ye, G. Cai, Y. Wang, and G. Cai, “PD-ResNet for Classification of Parkinson’s Disease from Gait,” IEEE J Transl Eng Health Med, vol. 10, 2022, doi: 10.1109/JTEHM.2022.3180933.
F. Khanom, S. Biswas, M. S. Uddin, and R. Mostafiz, “XEMLPD: an explainable ensemble machine learning approach for Parkinson disease diagnosis with optimized features,” Int J Speech Technol, Dec. 2024, doi: 10.1007/s10772-024-10152-2.
M. J. Tosam, “Healthcare and Spirituality: A Traditional African Perspective,” Annali di studi religiosi, vol. 22, pp. 255–277, 2021.
H. Im and J. Neff, “Spiral Loss of Culture: Cultural Trauma and Bereavement of Bhutanese Refugee Elders,” J Immigr Refug Stud, vol. 19, no. 2, pp. 99–113, 2021, doi: 10.1080/15562948.2020.1736362.


Copyright (c) 2025 Journal of Information Systems and Informatics

This work is licensed under a Creative Commons Attribution 4.0 International License.
- I certify that I have read, understand and agreed to the Journal of Information Systems and Informatics (Journal-ISI) submission guidelines, policies and submission declaration. Submission already using the provided template.
- I certify that all authors have approved the publication of this and there is no conflict of interest.
- I confirm that the manuscript is the authors' original work and the manuscript has not received prior publication and is not under consideration for publication elsewhere and has not been previously published.
- I confirm that all authors listed on the title page have contributed significantly to the work, have read the manuscript, attest to the validity and legitimacy of the data and its interpretation, and agree to its submission.
- I confirm that the paper now submitted is not copied or plagiarized version of some other published work.
- I declare that I shall not submit the paper for publication in any other Journal or Magazine till the decision is made by journal editors.
- If the paper is finally accepted by the journal for publication, I confirm that I will either publish the paper immediately or withdraw it according to withdrawal policies
- I Agree that the paper published by this journal, I transfer copyright or assign exclusive rights to the publisher (including commercial rights)