On acoustic channel identification in multi-microphone systems via adaptive blind signal enhancement techniques

Among the different configurations of multi-microphone systems, e.g., in applications of speech dereverberation or denoising, we consider the case without a priori information of the microphone-array geometry. This naturally invokes explicit or implicit identification of source-receiver transfer functions as an indirect description of the microphone-array configuration. However, this blind channel identification (BCI) has been difficult due to the lack of unique identifiability in the presence of observation noise or near-common channel zeros. In this paper, we study the implicit BCI performance of blind signal enhancement techniques such as the adaptive principal component analysis (PCA) or the iterative blind equalization and channel identification (BENCH). To this end, we make use of a recently proposed metric, the normalized filter-projection misalignment (NFPM), which is tailored for BCI evaluation in ill-conditioned (e.g., noisy) scenarios. The resulting understanding of implicit BCI performance can help to judge the behavior of multi-microphone speech enhancement systems and the suitability of implicit BCI to serve channel-based (i.e., channel-informed) enhancement.

[1]  L. J. Griffiths,et al.  An alternative approach to linearly constrained adaptive beamforming , 1982 .

[2]  D.G. Tzikas,et al.  The variational approximation for Bayesian inference , 2008, IEEE Signal Processing Magazine.

[3]  Ehud Weinstein,et al.  Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..

[4]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[5]  Bin Yang,et al.  Projection approximation subspace tracking , 1995, IEEE Trans. Signal Process..

[6]  Masato Miyoshi,et al.  Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..

[7]  Dominic Schmid,et al.  Cross-Relation-Based Blind SIMO Identifiability in the Presence of Near-Common Zeros and Noise , 2012, IEEE Transactions on Signal Processing.

[8]  Gerald Enzner,et al.  Online maximum-likelihood learning of time-varying dynamical models in block-frequency-domain , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[9]  Patrick A. Naylor,et al.  Noise Robust Adaptive Blind Channel Identification Using Spectral Constraints , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[10]  Benesty Adaptive eigenvalue decomposition algorithm for passive acoustic source localization , 2000, The Journal of the Acoustical Society of America.

[11]  Reinhold Häb-Umbach,et al.  Speech Enhancement With a GSC-Like Structure Employing Eigenvector-Based Transfer Function Ratios Estimation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Biing-Hwang Juang,et al.  Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Dominic Schmid,et al.  An expectation-maximization algorithm for multichannel adaptive speech dereverberation in the frequency-domain , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Jacob Benesty,et al.  Adaptive multi-channel least mean square and Newton algorithms for blind channel identification , 2002, Signal Process..

[15]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[16]  Marc Moonen,et al.  Subspace Methods for Multimicrophone Speech Dereverberation , 2003, EURASIP J. Adv. Signal Process..

[17]  O. L. Frost,et al.  An algorithm for linearly constrained adaptive array processing , 1972 .

[18]  John G. Proakis,et al.  Digital Signal Processing: Principles, Algorithms, and Applications , 1992 .

[19]  Peter Vary,et al.  Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones , 2006, Signal Process..

[20]  Jacob Benesty,et al.  Adaptive common root estimation and the common zeros problem in blind channel identification , 2005, 2005 13th European Signal Processing Conference.

[21]  T. Kailath,et al.  A least-squares approach to blind channel identification , 1995, IEEE Trans. Signal Process..

[22]  Dominic Schmid,et al.  A Maximum A Posteriori Approach to Multichannel Speech Dereverberation and Denoising , 2012, IWAENC.

[23]  Reinhold Häb-Umbach,et al.  Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Eric Moulines,et al.  Subspace methods for the blind identification of multichannel FIR filters , 1995, IEEE Trans. Signal Process..

[25]  E. Oja Simplified neuron model as a principal component analyzer , 1982, Journal of mathematical biology.

[26]  Juha Karhunen,et al.  Adaptive algorithms for estimating eigenvectors of correlation type matrices , 1984, ICASSP.

[27]  Patrick A. Naylor,et al.  An extended normalized multichannel FLMS algorithm for blind channel identification , 2006, 2006 14th European Signal Processing Conference.