Spatial accuracy of binaural synthesis from rigid spherical microphone array recordings

Binaural systems are a promising class of three-dimensional (3D) auditory displays for high-definition personal 3D audio devices. They properly synthesize the sound pressure signals at the ears of a listener, namely binaural signals, by means of the head-related transfer functions (HRTFs). Rigid spherical microphone arrays (RSMAs) are widely used to capture sound pressure fields for binaural presentation to multiple listeners. However, the spatial resolution needed in the RSMAs to allow for accurate binaural reproduction has not been studied in detail. The aim of this paper is to objectively address this question. We evaluated the spatial accuracy in binaural signals synthesized from the recordings of RSMAs with different number of microphones using the model of a human head. We find that the synthesis of spectral cues is accurate up to a maximum frequency determined by the number of microphones. Nevertheless, we also identify a limit beyond which adding more microphones does not improve overall accuracy. Said limit is higher for the interaural spectral cues than for the monaural ones.

[1]  Larry S. Davis,et al.  High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues , 2005 .

[2]  Richard M. Stern,et al.  Efficient Real Spherical Harmonic Representation of Head-Related Transfer Functions , 2015, IEEE Journal of Selected Topics in Signal Processing.

[3]  W. M. Rabinowitz,et al.  Auditory localization of nearby sources. Head-related transfer functions. , 1999, The Journal of the Acoustical Society of America.

[4]  Yoichi Ando,et al.  On the simulation of sound localization , 1980 .

[5]  Yôiti Suzuki,et al.  Accuracy of head-related transfer functions synthesized with spherical microphone arrays , 2013 .

[6]  Shuichi Sakamoto,et al.  Sound-space recording and binaural presentation system based on a 252-channel microphone array , 2015 .

[7]  William A. Yost,et al.  Spatial hearing: The psychophysics of human sound localization, revised edition , 1998 .

[8]  Shuichi Sakamoto,et al.  3D Sound-Space Sensing Method Based on Numerous Symmetrically Arranged Microphones , 2014, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[9]  Ramani Duraiswami,et al.  Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation. , 2010, The Journal of the Acoustical Society of America.

[10]  Henrik Møller Fundamentals of binaural technology , 1991 .

[11]  Yukio Iwaya,et al.  3D Spatial Sound Systems Compatible with Human's Active Listening to Realize Rich High-Level kansei Information , 2012 .

[12]  Noam R. Shabtai,et al.  Optimization of the directivity in binaural sound reproduction beamforming. , 2015, The Journal of the Acoustical Society of America.

[13]  Yukio Iwaya,et al.  Dataset of head-related transfer functions measured with a circular loudspeaker array , 2014 .

[14]  Jin Cheng,et al.  Regularization Methods for Ill-Posed Problems , 2015, Handbook of Mathematical Methods in Imaging.

[15]  Steven van de Par,et al.  Least squares versus non-linear cost functions for a vitual artificial head , 2013 .

[16]  Ramani Duraiswami,et al.  Extracting the frequencies of the pinna spectral notches in measured head related impulse responses. , 2004, The Journal of the Acoustical Society of America.

[17]  Wolfgang Ellermeier,et al.  Using beamforming and binaural synthesis for the psychoacoustical evaluation of target sources in noise. , 2008, The Journal of the Acoustical Society of America.

[18]  Shuichi Sakamoto,et al.  SENZI and ASURA: New High-Precision Sound-Space Sensing Systems based on Symmetrically Arranged Numerous Microphones , 2008, 2008 Second International Symposium on Universal Communication.

[19]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[20]  Yôiti Suzuki,et al.  Sound localization in headphone reproduction by simulating transfer functions from the sound source to the external ear , 1991 .

[21]  R. Duda,et al.  Range dependence of the response of a spherical head model , 1998 .

[22]  Holography Book,et al.  Fourier Acoustics Sound Radiation And Nearfield Acoustical Holography , 2016 .

[23]  Seok-Pil Lee,et al.  A Relevant Distance Criterion for Interpolation of Head-Related Transfer Functions , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  H. Colburn,et al.  Sensitivity of human subjects to head-related transfer-function phase spectra. , 1999, Journal of the Acoustical Society of America.

[25]  Makoto Otani,et al.  Fast calculation system specialized for head-related transfer function based on boundary element method. , 2006, The Journal of the Acoustical Society of America.

[26]  Boaz Rafaely,et al.  Generalized Spherical Array Beamforming for Binaural Speech Reproduction , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[27]  Boaz Rafaely,et al.  Spatial perception of sound fields recorded by spherical microphone arrays with varying spatial resolution. , 2013, The Journal of the Acoustical Society of America.

[28]  Rodney A. Kennedy,et al.  Novel Sampling Scheme on the Sphere for Head-Related Transfer Function Measurements , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[29]  Christoph Pörschmann,et al.  Binaural Reproduction of Plane Waves With Reduced Modal Order , 2014 .