Speech dereverberation in multisource environment using LCMV filter

In this paper, a novel multi channel speech dereverberation method in a multisource environment is proposed. The primary contribution of this work is the utilization of a multi channel linearly constrained minimum variance (LCMV) filter for speech dereverberation in the presence of multiple sources. An initial estimate of the desired speaker signal is obtained using a sparse matrix structure obtained herein. A multi channel LCMV filter is then used to obtain the final estimates of the desired signal by applying constraints on both early and late part of individual frames of the short term speech signal. Experiments on speech dereverberation and distant speech recognition are conducted at various direct to reverberant ratios to evaluate the performance of the proposed method. Performance improvements are noticable when compared to other multi channel speech dereverberation methods in the literature.

[1]  Carlos Hernandez Matas and Diana Gomez Olmedilla Image Theory Applied to virtual Microphones , 2008 .

[2]  S. R. Mahadeva Prasanna,et al.  Speech enhancement using excitation source information , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Y. Haneda,et al.  Estimating Direct-to-Reverberant Energy Ratio Using D/R Spatial Correlation Matrix Model , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Andries P. Hekstra,et al.  Perceptual evaluation of speech quality (PESQ)-a new method for speech quality assessment of telephone networks and codecs , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[6]  E.A.P. Habets,et al.  Towards multi-microphone speech dereverberation using spectral enhancement and statistical reverberation models , 2008, 2008 42nd Asilomar Conference on Signals, Systems and Computers.

[7]  Ken'ichi Furuya,et al.  Speech Dereverberation by Combining Mint-Based Blind Deconvolution and Modified Spectral Subtraction , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[8]  T. Houtgast,et al.  A review of the MTF concept in room acoustics and its use for estimating speech intelligibility in auditoria , 1985 .

[9]  Carla Teixeira Lopes,et al.  TIMIT Acoustic-Phonetic Continuous Speech Corpus , 2012 .

[10]  Pavel Zahorik,et al.  Direct-to-reverberant energy ratio sensitivity. , 2002, The Journal of the Acoustical Society of America.

[11]  Jessica Koehler,et al.  Advanced Digital Signal Processing And Noise Reduction , 2016 .

[12]  Patrick A. Naylor,et al.  Acoustic Signal Processing in Noise: It's Not Getting Any Quieter , 2012, IWAENC.

[13]  Yi Hu,et al.  Evaluation of Objective Quality Measures for Speech Enhancement , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[14]  Jacob Benesty,et al.  A Single-Channel MVDR Filter for Acoustic Echo Suppression , 2013, IEEE Signal Processing Letters.

[15]  R. Balan,et al.  PERFORMANCE ASSESSMENT METHOD FOR SPEECH ENHANCEMENT SYSTEMS , 2005 .

[16]  P.A. Naylor,et al.  Spatiotemporal Averagingmethod for Enhancement of Reverberant Speech , 2007, 2007 15th International Conference on Digital Signal Processing.

[17]  Rajesh M. Hegde,et al.  Joint noise cancellation and dereverberation using multi-channel linearly constrained minimum variance filter , 2013, INTERSPEECH.

[18]  Jacob Benesty,et al.  Single-channel noise reduction using optimal rectangular filtering matrices. , 2013, The Journal of the Acoustical Society of America.