Binary Particle Swarm Optimization and F-Ratio for Selection of Features in the Recognition of Asphyxiated Infant Cry

In the infant cry classification for detecting pathological conditions using Artificial Neural Network, a common feature extraction technique employed is Mel Frequency Cepstrum Coefficient (MFCC) analysis due to its good representation properties. However, not all MFCC features are significant for classification. If irrelevant features are retained, the performance of the classifier will be degraded. This paper examines the performance of F-ratio and BPSO in selecting infant cry features obtained from MFCC analysis. The performance of both methods was evaluated based on the classification accuracy produced when the selected features were passed to Multi-Layer Perceptron (MLP) classifier. It was found that the BPSO managed to produce better result compared to F-Ratio technique. The classification accuracy achieved using F-Ratio was 93.38%, which was obtained when 29 MFCC filter banks, 8 selected MFC coefficients and 45 hidden nodes were used. The BPSO managed to obtain classification accuracy of 96.03% using 34 MFCC filter banks, 16 selected MFC coefficients and 5 hidden nodes of MLP.

[1]  N.S. Nehe,et al.  Auditory Perception Based Admissible Wavelet Packet Trees For Speech Recognition , 2008, 2008 IEEE Region 10 and the Third international Conference on Industrial and Information Systems.

[2]  D. Lederman AUTOMATIC CLASSIFICATION OF INFANTS' CRY , 2002 .

[3]  Alfred S. Malowany,et al.  Classification of infant cry vocalizations using artificial neural networks (ANNs) , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[5]  A. Zabidi,et al.  Particle Swarm Optimisation of Mel-frequency Cepstral Coefficients computation for the classification of asphyxiated infant cry , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[6]  A. S. Malowany,et al.  A comparison of neural network architectures for the classification of three types of infant cry vocalizations , 1995, Proceedings of 17th International Conference of the Engineering in Medicine and Biology Society.

[7]  Rohilah Sahak,et al.  Binary Particle Swarm Optimization for selection of features in the recognition of infants cries with asphyxia , 2011, 2011 IEEE 7th International Colloquium on Signal Processing and its Applications.

[8]  K. Anitha Sheela,et al.  Linear Discriminant Analysis F-Ratio for Optimization of TESPAR & MFCC Features for Speaker Recongnition , 2007, J. Multim..

[9]  Aiguo Li,et al.  Feature Subset Selection Based on Binary Particle Swarm Optimization and Overlap Information Entropy , 2009, 2009 International Conference on Computational Intelligence and Software Engineering.

[10]  Orion F. Reyes-Galaviz,et al.  A System for the Processing of Infant Cry to Recognize Pathologies in Recently Born Babies with Neural Networks , 2004 .