Comparison of techniques for environmental sound recognition

This paper presents a comprehensive comparative study of artificial neural networks, learning vector quantization and dynamic time warping classification techniques combined with stationary/non-stationary feature extraction for environmental sound recognition. Results show 70% recognition using mel frequency cepstral coefficients or continuous wavelet transform with dynamic time warping.

[1]  Li Liu Ground Vehicle Acoustic Signal Processing Based on Biological Hearing Models , 1999 .

[2]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[3]  Daniel P. W. Ellis,et al.  Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition , 1999 .

[4]  Richard S. Goldhor,et al.  Recognition of environmental sounds , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Robert Mahony,et al.  Speech perception based algorithm for the separation of overlapping speech signal , 2001, The Seventh Australian and New Zealand Intelligent Information Systems Conference, 2001.

[6]  Robert D. Rodman,et al.  Computer Speech Technology , 1999 .

[7]  Kuldip K. Paliwal,et al.  Automatic Speech and Speaker Recognition: Advanced Topics , 1999 .

[8]  Biing-Hwang Juang,et al.  An Overview of Automatic Speech Recognition , 1996 .

[9]  Renate Sitte,et al.  Analysis of Speech Recognition Techniques for use in a Non-Speech Sound Recognition System , 2002 .

[10]  Barbara Burke Hubbard The World According to Wavelets: The Story of a Mathematical Technique in the Making, Second Edition , 1996 .

[11]  Keith Dana Martin,et al.  Sound-source recognition: a theory and computational model , 1999 .

[12]  Sadaoki Furui,et al.  An Overview of Speaker Recognition Technology , 1996 .

[13]  Somkiat Sampan,et al.  Neural Fuzzy Techniques In Vehicle Acoustic Signal Classification , 1997 .

[14]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[15]  Paul Scheunders,et al.  Wavelet-FILVQ classifier for speech analysis , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[16]  Khaled H. Hamed,et al.  Time-frequency analysis , 2003 .

[17]  J C Brown Computer identification of musical instruments using pattern recognition with cepstral coefficients as features. , 1999, The Journal of the Acoustical Society of America.

[18]  Ian T. Nabney,et al.  Netlab: Algorithms for Pattern Recognition , 2002 .

[19]  Renate Sitte,et al.  Recognition of Environmental Sounds Using Speech Recognition Techniques , 2002 .

[20]  Renate Sitte,et al.  Sound identification and direction detection for surveillance applications , 2000 .