Hybrid AI Ensemble and Blockchain-Based Chatbot for Decentralized Toddler Nutritional Status Classification

Authors

DOI:

https://doi.org/10.47852/bonviewJCCE52026415

Keywords:

chatbot, AI ensemble, Ethereum, healthcare, IPFS, CID, SHA-256

Abstract

The accurate and timely classification of toddlers' nutritional status is critical for early intervention, particularly in remote or underserved communities with limited access to healthcare professionals. However, data security, especially for children's health data, is equally essential to ensure safe storage and access. To address these challenges, this study proposes a hybrid AI-powered chatbot that integrates ensemble learning, blockchain, and decentralized storage to support both nutritional status classification and educational interaction. The system combines a random forest model for classification with GPT-3.5 Turbo for bilingual (Indonesian–English) stunting education deployed via Telegram. Preprocessing includes standardizing, normalizing, and encoding Indonesian-language nutrition data to ensure machine learning readiness. Six ensemble algorithms are evaluated using stratified five-fold cross-validation, with classification results hashed using SHA-256 and immutably stored on the Interplanetary File System (IPFS) and a local Ethereum blockchain. The chatbot effectively manages both structured inputs and natural language queries, ensuring secure, transparent, and real-time nutritional assessments. Results demonstrate high classification performance, with the random forest model achieving the highest mean F1-score (0.9987) and the lowest deviation. Its robustness was validated by a 20% hold-out test set and stratified five-fold cross-validation, which obtained excellent balanced performance across nutritional status categories (F1-macro, precision, recall, accuracy ≈ 0.99; ROC AUC = 1.00). External validation also yielded robust and consistent results (F1-macro = 0.97, precision = 0.97, recall = 0.96, ROC AUC = 0.98, and accuracy = 0.97), demonstrating the model's generalization ability and mitigating concerns regarding overfitting. Blockchain evaluation confirmed stable and linear CID transaction throughput (blocks 29–46) with no observed latency, ensuring reliable and continuous data recording. Furthermore, gas prices decreased by ~87.5%, highlighting significant improvements in cost efficiency and scalability, which reinforces blockchain's feasibility for decentralized, AI-driven health data management.

 

Received: 9 June 2025 | Revised: 29 September 2025 | Accepted: 31 October 2025

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

The data that support the findings of this study are openly available in Kaggle at https://www.kaggle.com/datasets/rendiputra/stunting-balita-detection-121k-rows and https://www.kaggle.com/datasets/jabirmuktabir/stunting-wasting-dataset.

 

Author Contribution Statement

Wa Ode Siti Nur Alam: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization, Project administration. Riri Fitri Sari: Conceptualization, Writing – review & editing, Supervision, Funding acquisition.


Metrics

Metrics Loading ...

Downloads

Published

2025-12-16

Issue

Section

Research Articles

How to Cite

Alam, W. O. S. N., & Sari, R. F. . (2025). Hybrid AI Ensemble and Blockchain-Based Chatbot for Decentralized Toddler Nutritional Status Classification. Journal of Computational and Cognitive Engineering. https://doi.org/10.47852/bonviewJCCE52026415

Funding data