Decoding Baby Talk: Basic Approach for Normal Classification of Infant Cry Signal

The analysis of infant cry has become more prevalent due to advances in areas such as digital signal processing, pattern recognition and soft computing. The analysis of infant cry has changed the diagnostic ability of physicians to correctly diagnose new-born. This work presents an approach to decode baby talk by classifying infant cry signal. We use normal infant cry signal of ages 1day to six months old. In particular there are fixed cry attributes for a healthy infant cry, which can be classified into five groups such as: Neh, Eh, Owh, Eairh and Heh. The infant cry signal is segmented by using Pitch frequency and features are extracted using MFC (melfrequency cepstrum) coefficients over MATLAB. Statistical properties are calculated for the extracted features of MFCC and KNN classifier is used to classify the cry signal. KNN is the most successful classifiers used for audio data when their temporal structure is not important. This study is based on five different databases such as, Neh, Eh, Owh, Eairh, and Heh databases. Each has 50 samples of data 40 samples used for training and 10 samples used for testing. Percentages of results are Neh 80%, Eh 90%, Owh 80%, Eairh 90%, and Heh 90% respectively. Decoding baby talk supports the mother’s built-in intuition about knowing and responding to their baby’s needs, and physician to treat infant early. General Terms Digital signal processing, pattern recognition, soft computing

[1]  Leonardo Bocchi,et al.  Study of cry patterns in infants at high risk for autism , 2011, MAVEBA.

[2]  Erik Kaestner Multimedia Content Analysis Theory And Applications , 2016 .

[3]  Marc Moonen,et al.  Fifty Years of Acoustic Feedback Control: State of the Art and Future Challenges , 2011, Proceedings of the IEEE.

[4]  K. Michelsson,et al.  Twenty-Five Years of Scandinavian Cry Research , 1985 .

[5]  J.O. Garcia,et al.  Mel-frequency cepstrum coefficients extraction from infant cry for classification of normal and pathological cry with feed-forward neural networks , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[6]  J. Movellan,et al.  Automatic cry detection in early childhood education settings , 2008, 2008 7th IEEE International Conference on Development and Learning.

[7]  David Gerhard,et al.  Pitch Extraction and Fundamental Frequency: History and Current Techniques , 2003 .

[8]  M. Haith,et al.  Social and emotional development in infancy and early childhood , 2009 .

[9]  G. Várallyay The melody of crying. , 2007, International journal of pediatric otorhinolaryngology.

[10]  Chakib Tadj,et al.  A Cry-Based Babies Identification System , 2010, ICISP.

[11]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[12]  J. Lind,et al.  The Infant Cry. A Spectrographic and Auditory Analysis , 1969 .

[13]  V. Dubowitz,et al.  The infant cry. A spectrographic and auditory analysis , 1970 .