Hybrid AI Ensemble and Blockchain-Based Chatbot for Decentralized Toddler Nutritional Status Classification
DOI:
https://doi.org/10.47852/bonviewJCCE52026415Keywords:
chatbot, AI ensemble, Ethereum, healthcare, IPFS, CID, SHA-256Abstract
The accurate and timely classification of toddlers' nutritional status is critical for early intervention, particularly in remote or underserved communities with limited access to healthcare professionals. However, data security, especially for children's health data, is equally essential to ensure safe storage and access. To address these challenges, this study proposes a hybrid AI-powered chatbot that integrates ensemble learning, blockchain, and decentralized storage to support both nutritional status classification and educational interaction. The system combines a random forest model for classification with GPT-3.5 Turbo for bilingual (Indonesian–English) stunting education deployed via Telegram. Preprocessing includes standardizing, normalizing, and encoding Indonesian-language nutrition data to ensure machine learning readiness. Six ensemble algorithms are evaluated using stratified five-fold cross-validation, with classification results hashed using SHA-256 and immutably stored on the Interplanetary File System (IPFS) and a local Ethereum blockchain. The chatbot effectively manages both structured inputs and natural language queries, ensuring secure, transparent, and real-time nutritional assessments. Results demonstrate high classification performance, with the random forest model achieving the highest mean F1-score (0.9987) and the lowest deviation. Its robustness was validated by a 20% hold-out test set and stratified five-fold cross-validation, which obtained excellent balanced performance across nutritional status categories (F1-macro, precision, recall, accuracy ≈ 0.99; ROC AUC = 1.00). External validation also yielded robust and consistent results (F1-macro = 0.97, precision = 0.97, recall = 0.96, ROC AUC = 0.98, and accuracy = 0.97), demonstrating the model's generalization ability and mitigating concerns regarding overfitting. Blockchain evaluation confirmed stable and linear CID transaction throughput (blocks 29–46) with no observed latency, ensuring reliable and continuous data recording. Furthermore, gas prices decreased by ~87.5%, highlighting significant improvements in cost efficiency and scalability, which reinforces blockchain's feasibility for decentralized, AI-driven health data management.
Received: 9 June 2025 | Revised: 29 September 2025 | Accepted: 31 October 2025
Conflicts of Interest
The authors declare that they have no conflicts of interest to this work.
Data Availability Statement
The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/rendiputra/stunting-balita-detection-121k-rows and https://www.kaggle.com/datasets/jabirmuktabir/stunting-wasting-dataset.
Author Contribution Statement
Wa Ode Siti Nur Alam: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization, Project administration. Riri Fitri Sari: Conceptualization, Writing – review & editing, Supervision, Funding acquisition.
Metrics
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Authors

This work is licensed under a Creative Commons Attribution 4.0 International License.
How to Cite
Funding data
-
Universitas Indonesia
Grant numbers PKS-243/UN2.RST/HKP.05.00/2025