Weakly Supervised Detection of Baby Cry

Authors

  • Weijun Tan LinkSprite Technologies, USA and Deepcam Information Technologies, China https://orcid.org/0000-0003-2344-0773
  • Qi Yao Deepcam Information Technologies, China
  • Jingfeng Liu Deepcam Information Technologies, China

DOI:

https://doi.org/10.47852/bonviewAIA42022164

Keywords:

baby cry, multiple instance learning, audio classification, anomaly detection

Abstract

Detection of baby cry is an important part of baby monitoring. Almost all existing methods use supervised SVM, CNN, or their varieties. In this work, we propose to use weakly supervised anomaly detection to detect baby cry. In this weak supervision framework, we only need weak annotation if there is a cry in an audiofile. We design a data mining technique using the pre-trained VGGish feature extractor and an anomaly detection network on long untrimmed audiofiles. The obtained datasets are used to train a delicately designed super lightweight CNN for cry/non-cry classification. This CNN is then used as a feature extractor in an anomaly detection framework to achieve better cry detection performance.

 

Received: 27 November 2023 | Revised: 18 February 2024 | Accepted: 10 May 2024

 

Conflicts of Interest

The authors declare that they have no conflicts of interest to this work.

 

Data Availability Statement

The data that support the findings of this study are openly available at 1) https://github.com/giulbia/baby cry detection, reference number [11]; 2) audioSet at https://research.google.com/audioset/dataset/index.html, reference [10]; 3) ESC-50 at https://github.com/karolpiczak/ESC-50, reference [23]; 4) https://github.com/gveres/donateacry-corpus, reference [27].

 

 

Metrics

Metrics Loading ...

Downloads

Published

2024-06-10

How to Cite

Tan, W., Yao, Q., & Liu, J. (2024). Weakly Supervised Detection of Baby Cry. Artificial Intelligence and Applications. https://doi.org/10.47852/bonviewAIA42022164

Issue

Section

Research Article