Room Reverberation Reconstruction: Interpolation of the Early Part Using Compressed Sensing

This paper deals with the interpolation of the Room Impulse Responses (RIRs) within a whole volume, from as few measurements as possible, and without the knowledge of the geometry of the room. We focus on the early reflections of the RIRs, that have the key property of being sparse in the time domain: this can be exploited in a framework of model-based Compressed Sensing. Starting from a set of RIRs randomly sampled in the spatial domain of interest by a 3D microphone array, we propose a modified Matching Pursuit algorithm to estimate the position of a small set of virtual sources. Then, the reconstruction of the RIRs at interpolated positions is performed using a projection onto a basis of monopoles, which correspond to the estimated virtual sources. An extension of the proposed algorithm allows the interpolation of the positions of both source and receiver, using the acquisition of four different source positions. This approach is validated both by numerical examples, and by experimental measurements using a 3D array with up to 120 microphones.

[1]  Dmitry M. Malioutov,et al.  A sparse signal reconstruction perspective for source localization with sensor arrays , 2005, IEEE Transactions on Signal Processing.

[2]  Zhifeng Zhang,et al.  Adaptive time-frequency decompositions with matching pursuit , 1994, Defense, Security, and Sensing.

[3]  Emmanuel J. Candès,et al.  Robust uncertainty principles: exact signal reconstruction from highly incomplete frequency information , 2004, IEEE Transactions on Information Theory.

[4]  Terence Betlehem,et al.  Theory and design of sound field reproduction in reverberant rooms. , 2005, The Journal of the Acoustical Society of America.

[5]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[6]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[7]  S. Mallat,et al.  Adaptive time-frequency decomposition with matching pursuits , 1992, [1992] Proceedings of the IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.

[8]  Jeffrey C. Lagarias,et al.  Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions , 1998, SIAM J. Optim..

[9]  Emmanuel J. Candès,et al.  Decoding by linear programming , 2005, IEEE Transactions on Information Theory.

[10]  U. Svensson,et al.  An analytic secondary source model of edge diffraction impulse responses , 1999 .

[11]  Frank Boland,et al.  Acoustic Impulse Response Interpolation for Multichannel Systems Using Dynamic Time Warping , 2009 .

[12]  Nobuhiko Kitawaki,et al.  Common-acoustical-pole and residue model and its application to spatial interpolation and extrapolation of a room transfer function , 1999, IEEE Trans. Speech Audio Process..

[13]  E.J. Candes Compressive Sampling , 2022 .

[14]  Zhifeng Zhang,et al.  Adaptive time-frequency decompositions , 1994 .

[15]  Rémi Gribonval,et al.  Blind calibration for compressed sensing by convex optimization , 2011, 1111.7248.

[16]  Stéphane Mallat,et al.  Matching pursuits with time-frequency dictionaries , 1993, IEEE Trans. Signal Process..

[17]  T. Ajdler,et al.  The Plenacoustic Function and Its Sampling , 2006, IEEE Transactions on Signal Processing.

[18]  P. Peterson Simulating the response of multiple microphones to a single acoustic source in a reverberant room. , 1986, The Journal of the Acoustical Society of America.

[19]  J. Borish Extension of the image model to arbitrary polyhedra , 1984 .

[20]  R.G. Baraniuk,et al.  Compressive Sensing [Lecture Notes] , 2007, IEEE Signal Processing Magazine.

[21]  Manfred R. Schroeder,et al.  Statistical parameters of the frequency response curves of large rooms , 1987 .

[22]  Laurent Daudet,et al.  Compressively sampling the plenacoustic function , 2011, Optical Engineering + Applications.

[23]  Jean-Marc Jot,et al.  Analysis and synthesis of room reverberation based on a statistical time-frequency model , 1997 .

[24]  Y. C. Pati,et al.  Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[25]  Laurent Daudet,et al.  Compressed sensing for acoustic response reconstruction: Interpolation of the early part , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[26]  Dale Borowiak,et al.  Linear Models, Least Squares and Alternatives , 2001, Technometrics.

[27]  J. Polack Playing billiards in the concert hall: The mathematical foundations of geometrical room acoustics , 1993 .

[28]  Massimo Fornasier,et al.  Compressive Sensing , 2015, Handbook of Mathematical Methods in Imaging.

[29]  Jont B. Allen,et al.  Image method for efficiently simulating small‐room acoustics , 1976 .

[30]  Nobutaka Ito,et al.  Blind alignment of asynchronously recorded signals for distributed microphone array , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.