Trends in adaptive MISO system identification for multichannel audio reproduction and speech communication

Online identification of multiple-input/single-output (MISO) acoustic systems is one of the long-standing and continuing challenges in multichannel speech and audio applications. Fast and robust estimation of the impulse response of an acoustic system is a key requirement for several adaptive solutions in time-varying scenarios, such as stereophonic acoustic echo cancellation, room equalization, or crosstalk cancellation. The inevitable presence of cross-correlated loudspeaker signals that is implied by multichannel applications, however, entails the well-known non-uniqueness problem of MISO system identification. Apart from this fundamental issue, a more practical problem already consists in the lack of techniques to evaluate the estimated impulse responses properly. Since well-established measures are often not capable of accounting for all aspects of online MISO system identification, we revert to the recently proposed spectral-importance weighted misalignment (SIWM) to assess MISO identification. In this contribution, we review SIWM and its relation to well-established evaluation tools. On this basis, we provide an insight into the problem of MISO system identification in applications driven by real stereo data. We also analyze and compare a traditional and a very recent approach to deal with the non-uniqueness problem.

[1]  J. L. Hall,et al.  Stereophonic acoustic echo cancellation-an overview of the fundamental problem , 1995, IEEE Signal Processing Letters.

[2]  Terence Betlehem,et al.  Efficient crosstalk canceler design with impulse response shortening filters , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Shoji Makino,et al.  New configuration for a stereo echo canceller with nonlinear pre-processing , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4]  Yesenia Lacouture-Parodi,et al.  Crosstalk Cancellation System Using a Head Tracker Based on Interaural Time Differences , 2012, IWAENC.

[5]  Masato Miyoshi,et al.  Inverse filtering of room acoustics , 1988, IEEE Trans. Acoust. Speech Signal Process..

[6]  Yiteng Huang Immersive audio schemes , 2011, IEEE Signal Processing Magazine.

[7]  Radoslaw Mazur,et al.  Optimized gradient calculation for room impulse response reshaping algorithm based on p-norm optimization , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Mickael Tanter,et al.  Sound focusing in rooms: the time-reversal approach. , 2003, The Journal of the Acoustical Society of America.

[9]  Walter Kellermann,et al.  Acoustic Echo Cancellation for Surround Sound using Perceptually Motivated Convergence Enhancement , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[10]  Murtaza Ali,et al.  Stereophonic acoustic echo cancellation system using time-varying all-pass filtering for signal decorrelation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[11]  Gerald Enzner,et al.  Improved online identification of acoustic MISO systems based on separated input signal components , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12]  Francesco Piazza,et al.  A Mixed Decorrelation Approach for Stereo Acoustic Echo Cancellation Based on the Estimation of the Fundamental Frequency , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Christof Faller,et al.  Reproducing Sound Fields Using MIMO Acoustic Channel Inversion , 2011 .

[14]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[15]  Jacob Benesty,et al.  Frequency-domain adaptive filtering revisited, generalization to the multi-channel case, and application to acoustic echo cancellation , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[16]  Ted S. Wada,et al.  Inter-channel decorrelation by sub-band resampling in frequency domain , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Gerald Enzner,et al.  Assessment of Multichannel Acoustic System Identification Using a Spectral-Importance Weighted Misalignment , 2012, IWAENC.

[18]  Jacob Benesty,et al.  Generalized multichannel frequency-domain adaptive filtering: efficient realization and application to hands-free speech communication , 2005, Signal Process..

[19]  A stereo echo canceler with pre-processing for correct echo-path identification , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[20]  Akihiko Sugiyama,et al.  Acoustic echo cancellation for conference systems , 2004, 2004 12th European Signal Processing Conference.

[21]  Jacob Benesty,et al.  A better understanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation , 1998, IEEE Trans. Speech Audio Process..

[22]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[23]  Yann Joncour,et al.  A stereo echo canceler with correct echo-path identification based on an input-sliding technique , 1997, IEEE Trans. Signal Process..

[24]  Mathias Fink,et al.  Sound focusing in rooms. II. The spatio-temporal inverse filter. , 2003, The Journal of the Acoustical Society of America.

[25]  Gerald Enzner,et al.  Recursive Bayesian Control of Multichannel Acoustic Echo Cancellation , 2011, IEEE Signal Processing Letters.

[26]  Jacob Benesty,et al.  Multicha nnel acoustic echo cancellation: what''s new , 2001 .

[27]  Jacob Benesty,et al.  On Crosstalk Cancellation and Equalization With Multiple Loudspeakers for 3-D Sound Reproduction , 2007, IEEE Signal Processing Letters.

[28]  Woon-Seng Gan,et al.  Time-Reversal Approach to the Stereophonic Acoustic Echo Cancellation Problem , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[29]  Radoslaw Mazur,et al.  Combined Acoustic MIMO Channel Crosstalk Cancellation and Room Impulse Response Reshaping , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[30]  Peter Vary,et al.  Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones , 2006, Signal Process..