Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

Non-interactive and linear experiences like cinema film offer high quality surround sound audio to enhance immersion, however the listener's experience is usually fixed to a single acoustic perspective. With the rise of virtual reality, there is a demand for recording and recreating real-world experiences in a way that allows for the user to interact and move within the reproduction. Conventional sound field translation techniques take a recording and expand it into an equivalent environment of virtual sources. However, the finite sampling of a commercial higher order microphone produces an acoustic sweet-spot in the virtual reproduction. As a result, the technique remains to restrict the listener's navigable region. In this paper, we propose a method for listener translation in an acoustic reproduction that incorporates a mixture of near-field and far-field sources in a sparsely expanded virtual environment. We perceptually validate the method through a Multiple Stimulus with Hidden Reference and Anchor (MUSHRA) experiment. Compared to the planewave benchmark, the proposed method offers both improved source localizability and robustness to spectral distortions at translated positions. A cross-examination with numerical simulations demonstrated that the sparse expansion relaxes the inherent sweet-spot constraint, leading to the improved localizability for sparse environments. Additionally, the proposed method is seen to better reproduce the intensity and binaural room impulse response spectra of near-field environments, further supporting the strong perceptual results.

[1]  Yonggang Hu,et al.  Modeling Characteristics of Real Loudspeakers Using Various Acoustic Models: Modal-domain Approaches , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[3]  Lachlan Birnie,et al.  Sound Field Translation Methods for Binaural Reproduction , 2019, 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[4]  Peter Dodds,et al.  Auralization systems for simulation of augmented reality experiences in virtual environments , 2019 .

[5]  Rodney A. Kennedy,et al.  Intrinsic Limits of Dimensionality and Richness in Random Multipath Fields , 2007, IEEE Transactions on Signal Processing.

[6]  Mark A. Poletti,et al.  Three-Dimensional Surround Sound Systems Based on Spherical Harmonics , 2005 .

[7]  R. Duraiswami,et al.  Insights into head-related transfer function: Spatial dimensionality and continuous representation. , 2010, The Journal of the Acoustical Society of America.

[8]  Joseph G. Tylka,et al.  Fundamentals of a parametric method for virtual navigation within an array of ambisonics microphones , 2020 .

[9]  Emanuel A. P. Habets,et al.  Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[10]  Shuichi Sakamoto,et al.  Extended sound field recording using position information of directional sound sources , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[11]  Boaz Rafaely,et al.  Spherical Microphone Array Beam Steering Using Wigner-D Weighting , 2008, IEEE Signal Processing Letters.

[12]  Rudolf Rabenstein,et al.  Limitations in the extrapolation of wave fields from circular measurements , 2007, 2007 15th European Signal Processing Conference.

[13]  Thushara D. Abhayapala,et al.  Mode Domain Spatial Active Noise Control Using Sparse Signal Representation , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[14]  Ville Pulkki,et al.  Synthesis of Complex Sound Scenes with Transformation of Recorded Spatial Sound in Virtual Reality , 2015 .

[15]  Satoru Emura,et al.  Sound field estimation using two spherical microphone arrays , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Robert Höldrich,et al.  A 3D Ambisonic Based Binaural Sound Reproduction System , 2003 .

[17]  Dylan Menzies,et al.  Ambisonic Synthesis of Complex Sources , 2007 .

[18]  Peter Grosche,et al.  A Cross-Evaluated Database of Measured and Simulated HRTFs Including 3D Head Meshes, Anthropometric Features, and Headphone Impulse Responses , 2019, Journal of the Audio Engineering Society.

[19]  Filippo Maria Fazi,et al.  Velocity controlled sound field reproduction by non-uniformly spaced loudspeakers , 2016 .

[20]  Sascha Spors,et al.  Data-Based Binaural Synthesis Including Rotational and Translatory Head-Movements , 2013 .

[21]  Yukio Iwaya,et al.  3D Spatial Sound Systems Compatible with Human's Active Listening to Realize Rich High-Level kansei Information , 2012 .

[22]  Boaz Rafaely,et al.  Analysis and design of spherical microphone arrays , 2005, IEEE Transactions on Speech and Audio Processing.

[23]  Jerome Daniel,et al.  Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format , 2003 .

[24]  Shuichi Sakamoto,et al.  Spatial accuracy of binaural synthesis from rigid spherical microphone array recordings , 2017 .

[25]  Thushara D. Abhayapala,et al.  Reproduction of a plane-wave sound field using an array of loudspeakers , 2001, IEEE Trans. Speech Audio Process..

[26]  Edgar Y. Choueiri,et al.  Comparison of Techniques for Binaural Navigation of Higher-Order Ambisonic Soundfields , 2015 .

[27]  Joseph G. Tylka,et al.  Models for evaluating navigational techniques for higher-order ambisonics , 2017 .

[28]  Peter Jax,et al.  Translation of a Higher Order Ambisonics Sound Scene Based on Parametric Decomposition , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[30]  Ramani Duraiswami,et al.  Regularized HRTF fitting using spherical harmonics , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[31]  Thushara D. Abhayapala,et al.  Theory and design of high order sound field microphones using spherical microphone array , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[32]  Jörg Fliege,et al.  The distribution of points on the sphere and corresponding cubature formulae , 1999 .

[33]  Sebastià V. Amengual Garí,et al.  Evaluation of Real-Time Sound Propagation Engines in a Virtual Reality Framework , 2019 .

[34]  Edgar Y. Choueiri,et al.  Soundfield Navigation using an Array of Higher-Order Ambisonics Microphones , 2016 .

[35]  Emanuel A. P. Habets,et al.  Six-Degrees-of-Freedom Binaural Audio Reproduction of First-Order Ambisonics with Distance Information , 2018 .

[36]  Marwan Al-Akaidi,et al.  Nearfield binaural synthesis and ambisonics. , 2007, The Journal of the Acoustical Society of America.

[37]  Georgios B. Giannakis,et al.  Sound Field Reproduction using the Lasso , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[38]  Gavin Kearney,et al.  Practical Recording Techniques for Music Production with Six-Degrees of Freedom Virtual Reality , 2018 .

[39]  Wotao Yin,et al.  Iteratively reweighted algorithms for compressive sensing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[40]  Joseph G. Tylka,et al.  Performance of Linear Extrapolation Methods for Virtual Sound Field Navigation , 2020 .

[41]  Sascha Spors,et al.  Physical Properties of Modal Beamforming in the Context of Data-Based Sound Reproduction , 2015 .

[42]  Tomasz Zernicki,et al.  Toward Six Degrees of Freedom Audio Recording and Playback Using Multiple Ambisonics Sound Fields , 2019 .

[43]  Efren Fernandez-Grande,et al.  Sound field reconstruction using a spherical microphone array. , 2016, The Journal of the Acoustical Society of America.

[44]  Brinkmann Fabian,et al.  The HUTUBS head-related transfer function (HRTF) database , 2019 .

[45]  Yan Wang,et al.  Translations of spherical harmonics expansion coefficients for a sound field using plane wave expansions. , 2018, The Journal of the Acoustical Society of America.

[46]  Prasanga N. Samarasinghe,et al.  Wavefield Analysis Over Large Areas Using Distributed Higher Order Microphones , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[47]  Joseph G. Tylka,et al.  Domains of Practical Applicability for Parametric Interpolation Methods for Virtual Sound Field Navigation , 2019 .

[48]  Stefan Weinzierl,et al.  Binaural Resynthesis for Comparative Studies of Acoustical Environments , 2007 .