A Multi-class Obesity Risk Prediction Using Machine Learning and Explainable Artificial Intelligence

Authors

  • Tarequl Hasan Sakib CSE Department, Chittagong University of Engineering and Technology, Bangladesh
  • Mahfuzulhoq Chowdhury CSE Department, Chittagong University of Engineering and Technology, Bangladesh https://orcid.org/0000-0002-3006-4596

DOI:

https://doi.org/10.47852/bonviewAIA62027134

Keywords:

obesity risk prediction, ensemble feature selection, hyperparameter tuning, XGBoost, explainable AI (XAI)

Abstract

Because of the intricate relationships between dietary practices, physical characteristics, and lifestyle, weight-related health issues have grown to be a global concern. The shortcomings of current machine learning (ML) techniques include inefficient feature selection, imbalanced datasets, a binary classification focus, decreased accuracy, and inadequate hyperparameter tweaking. This paper uses a clinically validated dataset of 1,638 patients from Bangladeshi healthcare facilities to provide a complete framework for ML-based multiclass obesity risk prediction in order to fill these gaps. The proposed approach combines 5-fold cross-validation with GridSearchCV for systematic hyperparameter tuning and ensemble feature selection. The Synthetic Minority Over-sampling Technique was used just on the training set to address class imbalance and guarantee balanced learning across the seven weight categories. The proposed XGBoost outperformed the other ML algorithms that were assessed for obesity risk prediction due to their high accuracy score of 95.4%. According to the results, the proposed method outperformed previous works by at least 5.18% in accuracy increase and 10.30% in F1-score gain. Furthermore, explainable Artificial Intelligence methods based on SHAP offer insights into the decision-making process through feature contributions.

 

Received: 8 August 2025 | Revised: 25 December 2025 | Accepted: 30 January 2026

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

The data that support the findings of this study are openly available in GitHub at  https://github.com/pymche/Machine-LearningObesity-Classification.

 

Author Contribution Statement

Tarequl Hasan Sakib: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation. Mahfuzulhoq Chowdhury: Conceptualization, Methodology, Validation, Writing – original draft, Writing – review & editing, Visualization, Supervision, Project administration.


Downloads

Published

2026-02-11

Issue

Section

Research Article

How to Cite

Sakib, T. H., & Chowdhury, M. (2026). A Multi-class Obesity Risk Prediction Using Machine Learning and Explainable Artificial Intelligence. Artificial Intelligence and Applications. https://doi.org/10.47852/bonviewAIA62027134