Weakly Supervised Detection of Baby Cry
DOI:
https://doi.org/10.47852/bonviewAIA42022164Keywords:
baby cry, multiple instance learning, audio classification, anomaly detectionAbstract
Detection of baby cry is an important part of baby monitoring. Almost all existing methods use supervised SVM, CNN, or their varieties. In this work, we propose to use weakly supervised anomaly detection to detect baby cry. In this weak supervision framework, we only need weak annotation if there is a cry in an audiofile. We design a data mining technique using the pre-trained VGGish feature extractor and an anomaly detection network on long untrimmed audiofiles. The obtained datasets are used to train a delicately designed super lightweight CNN for cry/non-cry classification. This CNN is then used as a feature extractor in an anomaly detection framework to achieve better cry detection performance.
Received: 27 November 2023 | Revised: 18 February 2024 | Accepted: 10 May 2024
Conflicts of Interest
The authors declare that they have no conflicts of interest to this work.
Data Availability Statement
The data that support the findings of this study are openly available at 1) https://github.com/giulbia/baby cry detection, reference number [11]; 2) audioSet at https://research.google.com/audioset/dataset/index.html, reference [10]; 3) ESC-50 at https://github.com/karolpiczak/ESC-50, reference [23]; 4) https://github.com/gveres/donateacry-corpus, reference [27].
Author Contribution Statement
Weijun Tan: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing - original draft, Writing - review & editing, Visualization, Supervision, Project administration. Qi Yao: Resources, Data curation. Jingfeng Liu: Resources, Data curation, Project administration.
Metrics
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Authors
This work is licensed under a Creative Commons Attribution 4.0 International License.