Advancing Clinical Diagnosis Through Comparative Analysis of Machine Learning and Transformer-Based Models for Hepatitis B Detection

Authors

  • Akinyemi Omololu Akinrotimi Department of Information Systems and Technology, Kings University, Nigeria https://orcid.org/0000-0002-0907-9769
  • Israel Oluwabusayo Omotosho Department of Management Information Systems, Bowie State University, USA
  • Olugbenga Olayinka Owolabi Department of Electrical and Electronics Engineering, Adeleke University, Nigeria
  • Oluwaseun Adewale Olubunmi Department of Computer Engineering, Federal University Oye-Ekiti, Nigeria https://orcid.org/0009-0005-8482-1029
  • Ibrahim Garba Department of Biological Sciences, Njala University, Sierra Leone https://orcid.org/0009-0005-4654-6126
  • Ndie Ngalame Dionysius Department of Biochemistry, University of Buea, Cameroon

DOI:

https://doi.org/10.47852/bonviewMEDIN62027119

Keywords:

clinical diagnosis, gradient boosted trees, hepatitis B, machine learning, tabular transformers

Abstract

Early and accurate diagnosis of hepatitis B remains a critical issue in low-resource healthcare settings, where reliance on straightforward laboratory tests might limit the sensitivity of diagnosis. This research explores the prediction of hepatitis B infection based on conventional biochemical markers and demographic data by four machine learning models: XGBoost, LightGBM, TabPFN, and TabKANet. The goals are twofold: first, to compare which model type is best, the Gradient Boosted Decision Trees (GBDTs) or transformer-based models, and second, to compare the feasibility of such models being included in clinical workflows. The models were evaluated against a real-world dataset of a Ghanaian hospital-based screening program. Performance was calculated in terms of accuracy, precision, recall, F1-score, and area under the curve (AUC). Results showed that transformer models, particularly TabKANet, outperformed traditional GBDT models in recall (97.0%), F1-score (65.5%), and AUC (0.92), with high ability to detect true positive cases. Confusion matrix analyses also confirmed the validity of TabKANet in minimizing false negatives, which is very much required in clinical diagnosis. These findings support the application of artificial intelligence-based technology in clinical laboratories as a second confirmatory system for disease diagnosis. The study also encourages researching the use of transformer models even more in healthcare, especially in settings where confirmatory testing is not accessible.


Received: 7 August 2025 | Revised: 23 December 2025 | Accepted: 20 January 2026


Conflicts of Interest
The authors declare that they have no conflicts of interest to this work.


Data Availability Statement
The dataset used in this study is publicly available and was accessed in compliance with its original ethical approvals and usage terms. The data can be accessed via the bioRxiv preprint server at https://doi.org/10.1101/2024.05.01.24306678. Access and use of the dataset were in accordance with bioRxiv's policies for public preprint content, ensuring ethical and permitted usage for research purposes.


Author Contribution Statement
Akinrotimi Akinyemi Omololu: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization, Supervision, Project administration. Omotosho Israel Oluwabusayo: Resources, Data curation. Owolabi Olugbenga Olayinka: Resources, Data curation. Olubunmi Oluwaseun Adewale: Validation, Resources, Data curation. Garba Ibrahim: Validation, Formal analysis, Investigation, Writing – review & editing. Dionysius Ndie Ngalame: Validation, Formal analysis, Investigation, Writing – review & editing.

Downloads

Published

2026-02-09

Issue

Section

Research Articles

How to Cite

Akinrotimi, A. O., Omotosho, I. O., Owolabi, O. O., Olubunmi, O. A., Garba, I., & Dionysius, N. N. (2026). Advancing Clinical Diagnosis Through Comparative Analysis of Machine Learning and Transformer-Based Models for Hepatitis B Detection. Medinformatics. https://doi.org/10.47852/bonviewMEDIN62027119