Leveraging a Hybrid Fuzzy C-Means-PCA Model for Identifying Speech Therapy Needs in Children
DOI:
https://doi.org/10.47852/bonviewJDSIS62028311Keywords:
Fuzzy C-Means, principal component analysis, acoustic–prosodic feature, speech therapy, developmental language disorderAbstract
This study proposes a hybrid unsupervised learning framework that integrates principal component analysis (PCA) and Fuzzy C-Means (FCM) to support early identification of speech therapy needs in children. The model utilizes secondary structured datasets containing acoustic–prosodic features such as MFCC, jitter, shimmer, harmonic-to-noise ratio, pitch, duration, fluency, and temporal indicators. PCA was applied to reduce dimensional redundancy and address the high-dimensional, low-sample-size characteristics of pediatric speech data, producing four principal components that retained 83.27% of the total variance. These components were subsequently clustered using FCM to capture partial membership patterns that reflect the continuous nature of children's speech deviations. The proposed PCA-FCM model achieved the best cluster compactness and separation, with a 25.6% improvement in the XieBeni (XB) index compared to the baseline FCM model (XB = 0.421). Three interpretable clusters including Normal, Mild Deviation, and Severe Deviation were identified, each associated with distinct acoustic–prosodic profiles. These findings demonstrate the potential of hybrid unsupervised learning to provide an objective, interpretable, and efficient early-screening mechanism for guiding personalized speech therapy interventions in children.
Received: 19 November 2025 | Revised: 16 April 2026 | Accepted: 11 May 2026
Conflicts of Interest
The authors declare that they have no conflicts of interest to this work.
Data Availability Statement
The analytical dataset supporting this study is publicly available in Figshare at https://doi.org/10.6084/M9.FIGSHARE.31410669.
Author Contribution Statement
Muhammad Rizal Haris: Conceptualization, Methodology, Validation, Investigation, Resources, Data curation, Writing – original draft, Writing – review & editing, Visualization. Muhammad Faisal: Conceptualization, Methodology, Software, Formal analysis, Investigation, Resources, Data curation, Writing – review & editing, Supervision. Rizki Yusliana Bakti: Methodology, Software, Validation, Data curation, Project administration. Muhyiddin AM Hayat: Validation, Formal analysis, Investigation, Resources, Project administration. Titik Khawa Abd Rahman: Conceptualization, Methodology, Software, Validation, Writing – review & editing, Project administration. Muhammad Syafaat S. Kuba: Validation, Writing – review & editing, Supervision. Titin Wahyuni: Data curation, Software, Project administration.Downloads
Published
Issue
Section
License
Copyright (c) 2026 Authors

This work is licensed under a Creative Commons Attribution 4.0 International License.