Infant Cry Classification with Graph Convolutional Networks

We propose an approach of graph convolutional networks for robust infant cry classification. We construct non-fully connected graphs with weighted edges based on the similarities among the relevant nodes and feed them into convolutional neural networks to consider the short-term and long-term effects of infant cry signals related to inner-class and inter-class effects. The approach captures the diversity of variations within infant cries and gives encouraging results in both supervised and semi-supervised node classification. The effectiveness of this approach is evaluated on Baby Chillanto database and Baby2020 database. With limited 20% of labeled training data, our model outperforms the CNN model with 80% of labeled training data and the accuracy stably improves as the number of labeled training samples increases. The best results give significant improvements of 7.36% and 3.59% compared with the results of the CNN models on Baby Chillanto database and Baby2020 database, respectively.

[1]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[2]  Wei Zhao,et al.  The Methods of Realizing Baby Crying Recognition and Intelligent Monitoring Based on DNN-GMM-HMM , 2018, 2018 2nd IEEE Advanced Information Management,Communicates,Electronic and Automation Control Conference (IMCEC).

[3]  Chuan-Yu Chang,et al.  Application of deep learning for recognizing infant cries , 2016, 2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW).

[4]  Aurelio Uncini,et al.  Deep Region of Interest and Feature Extraction Models for Palmprint Verification Using Convolutional Neural Networks Transfer Learning , 2018, Applied Sciences.

[5]  Yoshua Bengio,et al.  How Transferable Are Features in Convolutional Neural Network Acoustic Models across Languages? , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Vasudev Singh,et al.  Graph Representation learning for Audio & Music genre Classification , 2019, ArXiv.

[7]  Sergio Daniel Cano-Ortiz,et al.  Evolutionary-Neural System to Classify Infant Cry Units for Pathologies Identification in Recently Born Babies , 2008, 2008 Seventh Mexican International Conference on Artificial Intelligence.

[8]  Chakib Tadj,et al.  A fully automated approach for baby cry signal segmentation and boundary detection of expiratory and inspiratory episodes , 2017, The Journal of the Acoustical Society of America.

[9]  Rong Tong,et al.  Transfer learning for children's speech recognition , 2017, 2017 International Conference on Asian Language Processing (IALP).

[10]  M. Hariharan,et al.  Automatic Infant Cry Pattern Classification for a Multiclass Problem , 2016 .

[11]  Oscar Beijbom,et al.  Transfer Learning and Deep Feature Extraction for Planktonic Image Data Sets , 2017, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[12]  Lin Zhong,et al.  Distinguishing infant cry from adult voice based on spectrum analysis , 2012, 2012 International Conference on Audio, Language and Image Processing.

[13]  Yong Qin,et al.  Few-Shot Audio Classification with Attentional Graph Neural Networks , 2019, INTERSPEECH.

[14]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Monica Dascalu,et al.  Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs , 2018, 2018 41st International Conference on Telecommunications and Signal Processing (TSP).

[16]  Yi Pan,et al.  Using Transfer Learning, SVM, and Ensemble Classification to Classify Baby Cries Based on Their Spectrogram Images , 2019, 2019 IEEE 16th International Conference on Mobile Ad Hoc and Sensor Systems Workshops (MASSW).

[17]  Horia Cucu,et al.  Automatic methods for infant cry classification , 2016, 2016 International Conference on Communications (COMM).

[18]  Zhiyuan Liu,et al.  Graph Neural Networks: A Review of Methods and Applications , 2018, AI Open.