Optimized Ensemble Machine-Learning-Driven Transparent Thyroid Cancer Prediction Using Explainable Artificial Intelligence

Authors

  • Syed Younus Ali Department of Computer Sciences, Green International University, Pakistan
  • Bilal Shoaib Khan Department of Computer Sciences, Green International University, Pakistan
  • Abdul Hanan Khan Department of Computer Sciences, Green International University, Pakistan
  • Muhammad Adnan Khan School of Computing, Horizon University College, United Arab Emirates and Applied Science Research Center, Applied Science Private University, Jordan
  • Asghar Ali Shah Department of Computer Science, Kateb University, Afghanistan
  • Sagheer Abbas Department of Computer Science, Prince Mohammad Bin Fahd University, Saudi Arabia
  • Khan Muhammad Adnan Department of Software, Gachon University, Republic of Korea

DOI:

https://doi.org/10.47852/bonviewJCCE52026503

Keywords:

thyroid cancer prediction, Explainable Artificial Intelligence (XAI), ensemble learning, soft-voting, machine learning, computational intelligence

Abstract

Thyroid cancer disease diagnosis is a critical medical challenge, requiring accurate and reliable predictions to support clinical decision-making. Patients may suffer from an incomplete diagnosis while using traditional machine learning models. In this study, we present an optimized ensemble machine-learning framework for predicting thyroid cancer disease. The methodology integrates multiple classifiers, including support vector machine, random forest, Naïve Bayes, K-nearest neighbors, and decision tree. The final classification decision is determined with the help of soft voting by a total predictive support vector machine, which selects the classifier with the highest confidence score among the ensemble models. The ensemble strategy enhances predictive accuracy and robustness by combining the strengths of individual classifiers. The model was trained and evaluated, achieving an impressive accuracy of 0.9633 and an area under the receiver operating characteristic curve of 0.9914. The proposed method of this study is very accurate, but there is still a black box problem. To overcome this issue and to ensure interpretability, Explainable Artificial Intelligence techniques, including Shapley Additive Explanations and Local Interpretable Model-agnostic Explanations, are implemented, providing insights into feature contributions towards the performance of the proposed method and model decisions. The dataset contains a total of 30 features with 3,772 different cases consisting of two classes.

 

Received: 16 June 2025 | Revised: 14 August 2025 | Accepted: 24 August 2025

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/bidemiayinde/thyroid-sickness-determination. The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/bhargavchirumamilla/thyroid-cancer-risk-dataset. The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/sikandaraidev/thyroid-dataset.

 

Author Contribution Statement

Syed Younus Ali: Conceptualization, Software, Data curation, Writing – original draft, Writing – review & editing. Bilal Shoaib Khan: Conceptualization, Methodology, Resources, Writing – original draft, Writing – review & editing, Visualization, Supervision. Abdul Hanan Khan: Software, Validation. Muhammad Adnan Khan: Formal analysis, Investigation. Asghar Ali Shah: Validation, Investigation. Sagheer Abbas: Formal analysis, Project administration. Khan Muhammad Adnan: Methodology, Resources.


Downloads

Published

2025-10-17

Issue

Section

Research Articles

How to Cite

Ali, S. Y., Khan, B. S., Khan, A. H., Khan, M. A., Shah, A. A., Abbas, S., & Adnan, K. M. (2025). Optimized Ensemble Machine-Learning-Driven Transparent Thyroid Cancer Prediction Using Explainable Artificial Intelligence. Journal of Computational and Cognitive Engineering. https://doi.org/10.47852/bonviewJCCE52026503