Source, sensor and reflector position estimation from acoustical room impulse responses

The acoustic environment affects the properties of the audio signals recorded. Generally, given room impulse responses (RIRs), three sets of parameters have to be extracted in order to create an acoustic model of the environment: sources, sensors and reflector positions. In this paper, the cross-correlation based iterative sensor position estimation (CISPE) algorithm is presented, a new method to estimate a microphone configuration, together with source and reflector position estimators. A rough measurement of the microphone positions initializes the process; then a recursive algorithm is applied to improve the estimates, exploiting a delay-and-sum beamformer. Knowing where the microphones lie in the space, the dynamic programming projected phase slope algorithm (DYPSA) extracts the times of arrival (TOAs) of the direct sounds from the RIRs, and multiple signal classification (MUSIC) extracts the directions of arrival (DOAs). A triangulation technique is then applied to estimate the source positions. Finally, exploiting properties of 3D quadratic surfaces (namely, ellipsoids), reflecting planes are localized via a technique ported from image processing, by random sample consensus (RANSAC). Simulation tests were performed on measured RIR datasets acquired from three different rooms located at the University of Surrey, using either a uniform circular array (UCA) or uniform rectangular array (URA) of microphones. Results showed small improvements with CISPE pre-processing in almost every case.

[1]  Harry L. Van Trees,et al.  Optimum Array Processing: Part IV of Detection, Estimation, and Modulation Theory , 2002 .

[2]  Sakari Tervo,et al.  Estimation of reflective surfaces from continuous signals , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Harvey F. Silverman,et al.  Microphone position and gain calibration for a large-aperture microphone array , 2005, IEEE Transactions on Speech and Audio Processing.

[4]  Minghua Chen,et al.  Energy-Based Position Estimation of Microphones and Speakers for Ad Hoc Microphone Arrays , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[5]  Rainer Lienhart,et al.  Position calibration of microphones and loudspeakers in distributed computing platforms , 2005, IEEE Transactions on Speech and Audio Processing.

[6]  Wenwu Wang,et al.  Room boundary estimation from acoustic room impulse responses , 2014, 2014 Sensor Signal Processing for Defence (SSPD).

[7]  Martin Vetterli,et al.  Can one hear the shape of a room: The 2-D polygonal case , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  P. Jackson,et al.  Personal audio with a planar bright zone. , 2014, The Journal of the Acoustical Society of America.

[9]  Stanley T. Birchfield,et al.  Microphone array position calibration by basis-point classical multidimensional scaling , 2005, IEEE Transactions on Speech and Audio Processing.

[10]  Augusto Sarti,et al.  Estimation of room dimensions from a single impulse response , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[11]  Martin Vetterli,et al.  Single-channel indoor microphone localization , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[12]  Yubin Kuang,et al.  A complete characterization and solution to the microphone position self-calibration problem , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Walter Kellermann,et al.  Joint DOA and TDOA estimation for 3D localization of reflective surfaces using eigenbeam MVDR and spherical microphone arrays , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Jonathon A. Chambers,et al.  A 3D model for room boundary estimation , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[15]  Augusto Sarti,et al.  Localization of planar acoustic reflectors from the combination of linear estimates , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[16]  Sakari Tervo,et al.  3D room geometry estimation from measured impulse responses , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Emanuel A. P. Habets,et al.  Inference of Room Geometry From Acoustic Impulse Responses , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[18]  Philip J. B. Jackson,et al.  Estimation of Room Reflection Parameters for a Reverberant Spatial Audio Object , 2015 .

[19]  Mike Brookes,et al.  Estimation of Glottal Closure Instants in Voiced Speech Using the DYPSA Algorithm , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Mike Brookes,et al.  Room geometry estimation from a single channel acoustic impulse response , 2013, 21st European Signal Processing Conference (EUSIPCO 2013).

[21]  Alessio Del Bue,et al.  A closed form solution to the microphone position self-calibration problem , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Augusto Sarti,et al.  From direction of arrival estimates to localization of planar reflectors in a two dimensional geometry , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).