Infant cry classification using CNN – RNN

The study of infant cry recognition aims to identify what an infant needs through her cry. Different crying sound can give a clue to caregivers about how to response to the infant's needs. Appropriate responses on infant cry may influence emotional, behavioral, and relational development of infant while growing up. From a pattern recognition perspective, recognizing particular needs or emotions from an infant cry is much more difficult than recognizing emotions from an adult's speech because infant cry usually does not contain verbal information. In this paper, we study the problem of classifying five different types emotion or needs expressed by infant cry, namely hunger, sleepiness, discomfort, stomachache, and indications that the infant wants to burp. We propose a novel approach using a combination of Convolutional Neural Networks (CNN) and Recurrent Neural Networks (RNN) that acts as feature extraction and classifier method at once. Particularly, CNN learns salient features from raw spectrogram information and RNN learns temporal information of CNN obtained features. We also apply 5-folds cross-validation on 200 training data set and 50 validation data set. The model with the best weight is tested on 65 test set. Evaluation in Dunstan Baby Language dataset shows that our CNN-RNN model outperforms the previous method by average classification accuracy up to 94.97%. The encouraging result demonstrates that the application of CNN-RNN and 5-folds cross-validation offers accurate and robust result.

[1]  Yuexian Zou,et al.  Investigation on Joint Representation Learning for Robust Feature Extraction in Speech Emotion Recognition , 2018, INTERSPEECH.

[2]  Lichuan Liu,et al.  Infant cry language analysis and recognition: an experimental approach , 2019, IEEE/CAA Journal of Automatica Sinica.

[3]  Narissara Eiamkanitchat,et al.  Application of neuro-fuzzy approaches to recognition and classification of infant cry , 2014, TENCON 2014 - 2014 IEEE Region 10 Conference.

[4]  Sharifah Mumtazah Syed Ahmad,et al.  An accurate infant cry classification system based on continuos Hidden Markov Model , 2010, 2010 International Symposium on Information Technology.

[5]  Fillia Makedon,et al.  Deep Visual Attributes vs. Hand-Crafted Audio Features on Multidomain Speech Emotion Recognition , 2017, Comput..

[6]  Wootaek Lim,et al.  Speech emotion recognition using convolutional and Recurrent Neural Networks , 2016, 2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA).

[7]  V. M. Sardar An Automatic Infants Cry Detection Using Linear Frequency Cepstrum Coefficients(LFCC) , 2015 .

[8]  Sameena Bano,et al.  Decoding baby talk: A novel approach for normal infant cry signal classification , 2015, 2015 International Conference on Soft-Computing and Networks Security (ICSNS).

[9]  Chastine Fatichah,et al.  Application development for recognizing type of infant's cry sound , 2016, 2016 International Conference on Information & Communication Technology and Systems (ICTS).

[10]  Xuetian Wang,et al.  Speech Emotion Recognition Using Convolutional- Recurrent Neural Networks with Attention Model , 2017 .

[11]  A. Murray Infant crying as an elicitor of parental behavior: an examination of two models. , 1979, Psychological bulletin.

[12]  Che-Wei Huang,et al.  Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[13]  Rick Caulfield Social and emotional development in the first two years , 1996 .

[14]  Monica Dascalu,et al.  Testing the Universal Baby Language Hypothesis - Automatic Infant Speech Recognition with CNNs , 2018, 2018 41st International Conference on Telecommunications and Signal Processing (TSP).

[15]  Ma Ning,et al.  Pitch Analysis of Infant Crying , 2013 .

[16]  Nattawoot Suwannata,et al.  The Features Extraction of Infants Cries by Using Discrete Wavelet Transform Techniques , 2016 .

[17]  Lichuan Liu,et al.  Infant cry signal detection, pattern extraction and recognition , 2018, 2018 International Conference on Information and Computer Technologies (ICICT).

[18]  Wan Khairunizam,et al.  A review: survey on automatic infant cry analysis and classification , 2018, Health and Technology.

[19]  Premanand K. Kadbe,et al.  System propose for Be acquainted with newborn cry emotion using linear frequency cepstral coefficient , 2016, 2016 International Conference on Electrical, Electronics, and Optimization Techniques (ICEEOT).

[20]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.