Significance of Higher-Order Spectral Analysis in Infant Cry Classification

In this paper, higher-order spectral analysis is applied to infant cry signals for classification of normal infant cries from pathological infant cries. From the family of higher-order spectra, bispectrum is considered for the proposed task. Bispectrum is the Fourier transform of the third-order cumulant function. To extract features from the bispectrum, application of higher-order singular value decomposition theorem is proposed. Experimental results show the average classification accuracy of $${82.44} \pm {4.03}{ \%}$$82.44±4.03% and Matthew’s correlation coefficient (MCC) of 0.62 with proposed bispectrum features. In all of the experiments reported in this paper, support vector machine with radial basis function kernel is used as the pattern classifier. Performance of the proposed features is also compared with the state-of-the-art methods such as linear frequency cepstral coefficients, Mel frequency cepstral coefficients, perceptual linear prediction coefficients, linear prediction coefficients, linear prediction cepstral coefficients and perceptual linear prediction cepstral coefficients, and is found to be better than that given by these feature sets. The proposed bispectrum-based features are shown to be robust under signal degradation or noisy conditions at various SNR levels. Performance in the presence of noise is compared with the state-of-the-art spectral feature sets using MCC scores. In addition, effectiveness of cryunit segmentation in normal and pathological infant cry classification task is reported.

[1]  Luis Carlos Altamirano,et al.  On the implementation of a method for automatic detection of infant cry units , 2012 .

[2]  H. E. Baeck,et al.  Study of acoustic features of newborn cries that correlate with the context , 2001, 2001 Conference Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[3]  Carlos A. Reyes García,et al.  Identifying Pain and Hunger in Infant Cry with Classifiers Ensembles , 2005, International Conference on Computational Intelligence for Modelling, Control and Automation and International Conference on Intelligent Agents, Web Technologies and Internet Commerce (CIMCA-IAWTIC'06).

[4]  Hemant A. Patil,et al.  Classification of normal and pathological infant cries using bispectrum features , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[5]  Alfred S. Malowany,et al.  Classification of infant cry vocalizations using artificial neural networks (ANNs) , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Peter F. Ostwald,et al.  The Communicative and Diagnostic Significance of Infant Sounds , 1985 .

[7]  C. L. Nikias,et al.  Signal processing with higher-order spectra , 1993, IEEE Signal Processing Magazine.

[8]  Wei Liu,et al.  Feature Extraction and Classification Based on Bispectrum for Underwater Targets , 2010, 2010 International Conference on Intelligent System Design and Engineering Application.

[9]  Dror Lederman,et al.  Classification of cries of infants with cleft-palate using parallel hidden Markov models , 2008, Medical & Biological Engineering & Computing.

[10]  A. S. Malowany,et al.  A crosscorrelation-based method for improved visualization of infant cry vocalizations , 1994, 1994 Proceedings of Canadian Conference on Electrical and Computer Engineering.

[11]  Marc Medale,et al.  The Very First Cry: A Multidisciplinary Approach toward a Model , 2012, The Annals of otology, rhinology, and laryngology.

[12]  M.R. Raghuveer,et al.  Bispectrum estimation: A digital signal processing framework , 1987, Proceedings of the IEEE.

[13]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[14]  Zheng Rong Yang,et al.  RONN: the bio-basis function neural network technique applied to the detection of natively disordered regions in proteins , 2005, Bioinform..

[15]  Joos Vandewalle,et al.  A Multilinear Singular Value Decomposition , 2000, SIAM J. Matrix Anal. Appl..

[16]  Rabab K. Ward,et al.  Automatic infant cry analysis and recognition , 1993 .

[17]  Rabab Kreidieh Ward,et al.  Automatic Assessment of Infants' Levels-of-Distress from the Cry Signals , 1996, IEEE Trans. Speech Audio Process..

[18]  Hemant A. Patil,et al.  Data collection and corpus design for analysis of nonnal and pathological infant cry , 2013, 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE).

[19]  Yi Liu,et al.  Feature Extraction of Lung Sounds Based on Bispectrum Analysis , 2010, 2010 Third International Symposium on Information Processing.

[20]  Rohilah Sahak,et al.  Mel-frequency cepstrum coefficient analysis of infant cry with hypothyroidism , 2009, 2009 5th International Colloquium on Signal Processing & Its Applications.

[21]  H. A. Patil,et al.  Analysis of normal and pathological infant cries using bispectrum features derived using HOSVD , 2015, 2015 International Conference on BioSignal Analysis, Processing and Systems (ICBAPS).

[22]  Joseph Soltis,et al.  The signal functions of early infant crying , 2004, Behavioral and Brain Sciences.

[23]  Cynthia A. Stifter,et al.  Crying Behaviour and its Impact on Psychosocial Child Development , 2005 .

[24]  Rahul Gupta,et al.  Pathological speech processing: State-of-the-art, current challenges, and future directions , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[25]  A. S. Malowany,et al.  A robust and accurate cross-correlation-based fundamental frequency (F/sub 0/) determination method for the improved analysis of infant cries , 1995, Proceedings of 17th International Conference of the Engineering in Medicine and Biology Society.

[26]  Sazali Yaacob,et al.  Analysis of Infant Cry Through Weighted Linear Prediction Cepstral Coefficients and Probabilistic Neural Network , 2012, Journal of Medical Systems.

[27]  H. A. Patil,et al.  “Cry Baby”: Using Spectrographic Analysis to Assess Neonatal Health Status from an Infant’s Cry , 2010 .