Classification of Emotional Speech of Children Using Probabilistic Neural Network

Child emotions are highly flexible and overlapping. The recognition is a difficult task when single emotion conveys multiple informations. We analyze the relevance and importance of these features and use that information to design classifier architecture. Designing of a system for recognition of children emotions with reasonable accuracy is still a challenge specifically with reduced feature set. In this paper, Probabilistic neural network (PNN) has been designed for such task of classification. PNN has faster training ability with continuous class probability density functions. It provides better classification even with reduced feature set. LP_VQC and pH vectors are used as the features for the classifier. It has been attempted to design the PNN classifier with these features. Various emotions like angry, bore, sad and happy have been considered for this piece of work. All these emotions have been collected from children in three different languages as English, Hindi, and Odia. Result shows remarkable classification accuracy for these classes of emotions. It has been verified in standard databse EMO-DB to validate the result.

[1]  Chung-Hsien Wu,et al.  Emotion recognition using acoustic features and textual content , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[2]  M. Omidvari,et al.  Journal of mathematics and computer science , 2014 .

[3]  K. Fischer,et al.  DESPERATELY SEEKING EMOTIONS OR: ACTORS, WIZARDS, AND HUMAN BEINGS , 2000 .

[4]  Rosângela Coelho,et al.  Time-Frequency Feature and AMS-GMM Mask for Acoustic Emotion Classification , 2014, IEEE Signal Processing Letters.

[5]  Juan Carlos,et al.  Review of "Discrete-Time Speech Signal Processing - Principles and Practice", by Thomas Quatieri, Prentice-Hall, 2001 , 2003 .

[6]  L. Padma Suresh,et al.  Artificial Intelligence and Evolutionary Algorithms in Engineering Systems , 2015 .

[7]  Aurobinda Routray,et al.  Classification of Power Quality Disturbances Using Parzen Kernels , 2010 .

[8]  Jiucang Hao,et al.  Emotion recognition by speech signals , 2003, INTERSPEECH.

[9]  Rosângela Coelho,et al.  Text-independent speaker recognition based on the Hurst parameter and the multidimensional fractional Brownian motion model , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Constantine Kotropoulos,et al.  Emotional speech recognition: Resources, features, and methods , 2006, Speech Commun..

[11]  H. E. Hurst,et al.  Long-Term Storage Capacity of Reservoirs , 1951 .

[12]  T. Higuchi Approach to an irregular time series on the basis of the fractal theory , 1988 .

[13]  Werner Verhelst,et al.  Automatic Classification of Expressiveness in Speech: A Multi-corpus Study , 2007, Speaker Classification.

[14]  George N. Votsis,et al.  Emotion recognition in human-computer interaction , 2001, IEEE Signal Process. Mag..

[15]  Ashok Kumar,et al.  Neural Networks for Fast Estimation of Social Network Centrality Measures , 2015 .

[16]  Aurobinda Routray,et al.  Machine Learning Approach for Emotional Speech Classification , 2014, SEMCCO.

[17]  Mohammad Masoud Javidi,et al.  Speech Emotion Recognition by Using Combinations of C5.0, Neural Network (NN), and Support Vector Machines (SVM) Classification Methods , 2013 .

[18]  Wee Ser,et al.  Probabilistic neural-network structure determination for pattern classification , 2000, IEEE Trans. Neural Networks Learn. Syst..

[19]  Roddy Cowie,et al.  Automatic recognition of emotion from voice: a rough benchmark , 2000 .

[20]  Fakhri Karray,et al.  Survey on speech emotion recognition: Features, classification schemes, and databases , 2011, Pattern Recognit..

[21]  Mahesh Chandra,et al.  Use of Different Features for Emotion Recognition Using MLP Network , 2015 .

[22]  Oh-Wook Kwon,et al.  EMOTION RECOGNITION BY SPEECH SIGNAL , 2003 .

[23]  Hania Farag,et al.  Emotion Recognition Using Neural Network: A Comparative Study , 2013 .

[24]  Mahesh Chandra,et al.  Design of Neural Network Model for Emotional Speech Recognition , 2015 .

[25]  Björn W. Schuller,et al.  Timing levels in segment-based speech emotion recognition , 2006, INTERSPEECH.