Advancing Interpretable AI for CardiovascularRisk Assessment: A Stacking RegressionApproach in Clinical Data from Bangladesh

Authors

  • Suhana Tasnim Department of Life Sciences, Independent University Bangladesh, Bangladesh
  • Mohammad Mamun Department of Computer Science and Engineering, Jahangirnagar University and Department of Computer Science and Engineering, Bangladesh University, Bangladesh
  • Safiul Haque Chowdhury Department of Computer Science and Engineering, Jahangirnagar University, Bangladesh https://orcid.org/0009-0003-7098-2476
  • Mohammed Ibrahim Hussain Department of Computer Science and Engineering, Bangladesh University, Bangladesh
  • Muhammad Minoar Hossain Department of Computer Science and Engineering, Bangladesh University, Bangladesh

DOI:

https://doi.org/10.47852/bonviewMEDIN52027812

Keywords:

cardiovascular diseases, machine learning, ensemble model, Ridge Regressor, Theil-Sen Regressor, Gradient Boosting Regressor, explainable AI

Abstract

Cardiovascular diseases (CVDs) are complex conditions affecting a large portion of the global population, and their early, accurate, and timely prediction remains a significant challenge. Conventional CVD risk assessment often relies on limited parameters and fails to capture the complex interactions among genetic, lifestyle, and environmental factors. Recent machine learning studies have improved predictive performance; however, they often rely on small or retrospective datasets, lack real-time or external validation, and offer limited interpretability for clinical use. This study introduces a novel stacking ensemble framework that integrates Ridge Regression, Theil-Sen Regressor, and Gradient Boosting Regressor. To our knowledge, this is the first application of a regression-based stacking approach for CVD risk prediction that embeds explainable artificial intelligence as a core component, a combination rarely explored in low-resource healthcare contexts. Using a real-world dataset of 1,529 patients from Jamalpur Medical College Hospital, Bangladesh, the proposed model achieved 96% predictive accuracy, outperforming most existing methods. The dataset itself represents a rare contribution, as most prior studies rely on UCI, Framingham, or other benchmark repositories rather than contemporary hospital data from underrepresented populations. Through SHapley Additive exPlanations analysis, our model identifies BMI, diabetes, and blood pressure as the most influential factors, aligning with established medical knowledge and providing clinically actionable insights. Unlike prior black-box models, our framework not only improves prediction accuracy but also delivers transparent explanations that enhance trust and support public health decision-making. This integration of accuracy, explainability, and context-specific clinical insight underscores the novelty and practical relevance of our approach for advancing interpretable AI in CVD prediction, particularly in resource-limited healthcare settings. 

 

Received: 1 October 2025 | Revised: 10 November 2025 | Accepted: 30 November 2025

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

 

Author Contribution Statement

Suhana Tasnim: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization, Project administration. Mohammad Mamun: Supervision. Safiul Haque Chowdhury: Writing – review & editing, Supervision. Mohammed Ibrahim Hussain: Supervision. Muhammad Minoar Hossain: Supervision. 

 

 


Author Biography

  • Safiul Haque Chowdhury, Department of Computer Science and Engineering, Jahangirnagar University, Bangladesh
    Safiul Haque Chowdhury is a Bangladeshi computer researcher and engineer, renowned for his pioneering work in health care automated systems, including newborn weight prediction and liver disease diagnosis. He was born and raised in Dhaka, Bangladesh, and is currently working as a Machine Learning Coordinator at NuArca. Safiul's early interest in technology and programming led him to a career in computer science, where he has focused on Machine Learning, Deep Learning, Feature Engineering, Explainable AI, and Quantum Computing. He began his career as a Machine Learning Coordinator at Orion Informatics, where he leads the Machine Learning Content team working on the GPT system for NuArca USA. He is dedicated to advancing the field through his research and has authored multiple publications that contribute to the understanding and application of artificial intelligence. Passionate about leveraging AI to address real-world challenges, particularly in health care, he aims to create predictive models and automated systems to improve health outcomes. Through his work, he continues to bridge the gap between theoretical research and practical applications, making significant contributions to data science and artificial intelligence.

Downloads

Published

2025-12-12

Issue

Section

Research Articles

How to Cite

Tasnim, S., Mamun, M., Chowdhury, S. H., Hussain, M. I., & Hossain, M. M. (2025). Advancing Interpretable AI for CardiovascularRisk Assessment: A Stacking RegressionApproach in Clinical Data from Bangladesh. Medinformatics. https://doi.org/10.47852/bonviewMEDIN52027812