Beamforming Alternatives for Multi-Channel Transient Acoustic Event Classification

Signals acquired through a microphone array are typically beamformed to combine channels and improve the signal-to-noise ratio (SNR). However, it has been previously shown that alternative methods for handling multi-channel systems can outperform beamforming for speech recognition applications. In this paper, we implemented a comprehensive set of classification tests using multiple classifiers and feature extraction techniques to determine whether the alternative methods generalize beyond speech recognition applications. We show that applying the alternative methods (in a slightly simpler form) outperforms beamforming when used for classifying transient acoustic projectile weapon signals. Furthermore, an additional technique is introduced which outperforms both beamforming and previously proposed alternatives in certain classification scenarios. For the majority of classification tests, the improvements seen through the use of these alternative methods are statistically significant.

[1]  Longbiao Wang,et al.  Robust Distant Speech Recognition by Combining Multiple Microphone-Array Processing with Position-Dependent CMN , 2006, EURASIP J. Adv. Signal Process..

[2]  Maya R. Gupta,et al.  Nonparametric supervised learning by linear interpolation with maximum entropy , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Michael S. Brandstein,et al.  Microphone Arrays - Signal Processing Techniques and Applications , 2001, Microphone Arrays.

[4]  Peter Kabal,et al.  Room speech dereverberation via minimum-phase and all-pass component processing of multi-microphone signals , 1995, IEEE Pacific Rim Conference on Communications, Computers, and Signal Processing. Proceedings.

[5]  Michael P. Friedlander,et al.  On minimizing distortion and relative entropy , 2006, IEEE Transactions on Information Theory.

[6]  Javier Ortega-Garcia,et al.  A new approach to dereverberation and noise reduction with microphone arrays , 2000, 2000 10th European Signal Processing Conference.

[7]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[8]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[9]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  篠田 浩一 私のすすめるこの一冊 ; Spoken Launguage Processing: A Guide to Theory, Algorithm, and System Development, Xuedong Huang, Alex Acero and Hsiao-Wuen Hon, Prentice Hall, 2001 年 , 2003 .

[11]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .