Weakly Supervised Detection of Baby Cry

Authors

  • Weijun Tan LinkSprite Technologies, USA and Deepcam Information Technologies, China https://orcid.org/0000-0003-2344-0773
  • Qi Yao Deepcam Information Technologies, China
  • Jingfeng Liu Deepcam Information Technologies, China

DOI:

https://doi.org/10.47852/bonviewAIA42022164

Keywords:

baby cry, multiple instance learning, audio classification, anomaly detection

Abstract

Detection of baby cry is an important task of baby monitoring application. Effective and real-time detection of baby detection makes the baby well cared for while releasing the care giver’s pressure. Almost all existing methods for detection of baby cry use supervised support vector machines, CNN, or their varieties. In this work, we propose to use weakly supervised anomaly detection to detect baby cry in which baby cry is detected as an anomalous audio event. In this weak supervision framework, we only need weak annotation of if there is a cry in an audio file. We design a data mining technique using the pre-trained VGGish feature extractor and an anomaly detection network to obtain short audio files from long untrimmed audio files. The obtained dataset is used to train a delicately designed super lightweight CNN for cry/non-cry classification. This CNN is then used as a feature extractor in an anomaly detection framework to achieve better cry detection performance on untrimmed audio files or streams.

 

Received: 27 November 2023 | Revised: 18 February 2024 | Accepted: 10 May 2024

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

The data that support the findings of this study are openly available in (1) GitHub at GitHub - giulbia/baby_cry_detection: Recognition of baby cry audio signal, https://github.com/gveres/donateacry-corpus;(2) audioset at https://doi.org/10.1109/ICASSP.2017.7952261, reference [11]; (3) ESC 50 at https://doi.org/10.1145/2733373.2806390, reference [10].

 

Author Contribution Statement

Weijun Tan: Conceptualization, Methodology, Software, Validation, Formal analysis, Investigation, Data curation, Writing - original draft, Writing - review & editing, Visualization, Supervision, Project administration. Qi Yao: Resources, Data curation. Jingfeng Liu: Resources, Data curation, Project administration.


Metrics

Metrics Loading ...

Downloads

Published

2024-06-10

Issue

Section

Research Article

How to Cite

Tan, W., Yao, Q., & Liu, J. (2024). Weakly Supervised Detection of Baby Cry. Artificial Intelligence and Applications. https://doi.org/10.47852/bonviewAIA42022164