Virtual Navigation of Ambisonics-Encoded Sound Fields Containing Near-Field Sources

State-of-the-art methods for virtual navigation of ambisonics-encoded sound fields (i.e., sound fields represented by spherical harmonics) are evaluated, and the perceptual penalties incurred by navigating beyond near-field sources are identified. Virtual navigation enables a listener to explore an acoustic space and, ideally, experience a tonallyand spatially-accurate perception of the sound field. For many methods, navigation is theoretically restricted to the so-called region of validity, which extends spherically from the recording ambisonics microphone to its nearest source, although the precise consequences of violating that restriction have not been previously established. Several auditory metrics are presented for objectively evaluating navigational methods. In particular, models for spectral coloration and perceived localization are developed and shown, through subjective listening experiments, to predict those attributes directly from ambisonics impulse responses with comparable, if not superior, accuracy to existing binaural models. Numerical simulations of simple incident sound fields are implemented and experimentally validated for characterizing navigational methods in terms of the presented metrics. Existing navigational methods are critically reviewed and several linear methods are characterized. Results show that violating the region of validity restriction introduces significant errors in the reproduced levels and localization of near-field sources due to the drastic geometric changes required as the listener navigates. A parametric interpolation method is proposed which ensures, based on known or inferred positions of near-field sources, that the region of validity restriction is not violated. This method is shown to significantly improve coloration and localization performance over a benchmark linear interpolation method due to the exclusion of any invalid microphones from the interpolation calculation. An existing parametric interpolation method is also characterized, and practical guidelines are identified for iii choosing between the methods considered here depending on the sparsity of the microphone array, the intimacy of the sources, and the complexity of the sound field. Results show that, for applications with intimate sources and sparsely distributed microphones, only the proposed method yields high tonal fidelity, whereas the existing parametric method yields accurate and superior localization. For applications with distant sources and sparsely distributed microphones, the proposed method achieves both high tonal fidelity and accurate localization.

[1]  Michele Geronazzo,et al.  Coloration metrics for headphone equalization , 2015, ICAD.

[2]  Satoru Emura,et al.  Sound field estimation using two spherical microphone arrays , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Emanuel A. P. Habets,et al.  Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Shuichi Sakamoto,et al.  Extended sound field recording using position information of directional sound sources , 2017, 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[5]  David G. Malham,et al.  3-D Sound Spatialization using Ambisonic Techniques , 1995 .

[6]  Maarten van Walstijn,et al.  Extended Energy Vector Prediction of Ambisonically Reproduced Image Direction at Off-Center Listening Positions , 2016 .

[7]  Joseph G. Tylka,et al.  A Toolkit for Customizing the ambiX Ambisonics-to-Binaural Renderer , 2017 .

[8]  Allan D. Pierce,et al.  Acoustics , 1989 .

[9]  Piotr Majdak,et al.  The Auditory Modeling Toolbox , 2013 .

[10]  Alexander Lindau,et al.  Evaluation of equalization methods for binaural signals , 2009 .

[11]  Ramani Duraiswami,et al.  Computation of the head-related transfer function via the fast multipole accelerated boundary element method and its spherical harmonic representation. , 2010, The Journal of the Acoustical Society of America.

[12]  Xiguang Zheng,et al.  Soundfield navigation: Separation, compression and transmission , 2013 .

[13]  Ville Pulkki,et al.  Spatial Sound Reproduction with Directional Audio Coding , 2007 .

[14]  Larry S. Davis,et al.  High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues , 2005 .

[15]  Hyunkook Lee Subjective Evaluations of Perspective Control Microphone Array (PCMA) , 2012 .

[16]  Joseph G. Tylka,et al.  Models for evaluating navigational techniques for higher-order ambisonics , 2017 .

[17]  Balázs Bank Audio Equalization with Fixed-Pole Parallel Filters: An Efficient Alternative to Complex Smoothing , 2010 .

[18]  Thushara D. Abhayapala,et al.  Reproduction of a plane-wave sound field using an array of loudspeakers , 2001, IEEE Trans. Speech Audio Process..

[19]  Edgar Y. Choueiri,et al.  Comparison of Techniques for Binaural Navigation of Higher-Order Ambisonic Soundfields , 2015 .

[20]  P. Hansen Rank-Deficient and Discrete Ill-Posed Problems: Numerical Aspects of Linear Inversion , 1987 .

[21]  Alois Sontacchi,et al.  Localization Experiments Using Different 2 D Ambisonics Decoders , 2008 .

[22]  E. Williams,et al.  Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography , 1999 .

[23]  Angelo Farina,et al.  Advancements in Impulse Response Measurements by Sine Sweeps , 2007 .

[24]  Enda Bates,et al.  A Recording Technique for 6 Degrees of Freedom VR , 2018 .

[25]  Roland Bücklein,et al.  The Audibility of Frequency Response Irregularities , 1981 .

[26]  Boaz Rafaely,et al.  Interaural cross correlation in a sound field represented by spherical harmonics. , 2009, The Journal of the Acoustical Society of America.

[27]  Damian T. Murphy,et al.  Rendering walk-through auralisations using wave-based acoustical models , 2009, 2009 17th European Signal Processing Conference.

[28]  André van Schaik,et al.  Room acoustics simulation for multichannel microphone arrays , 2010 .

[29]  Jörg Fliege,et al.  The distribution of points on the sphere and corresponding cubature formulae , 1999 .

[30]  C. Avendano,et al.  The CIPIC HRTF database , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).

[31]  Thushara D. Abhayapala,et al.  3D sound field analysis using circular higher-order microphone array , 2015, 2015 23rd European Signal Processing Conference (EUSIPCO).

[32]  Sascha Spors,et al.  Localization Properties of Data-based Binaural Synthesis including Translatory Head-Movements , 2014 .

[33]  F. Jacobsen,et al.  A new interpretation of distortion artifacts in sweep measurements , 2011 .

[34]  Christof Faller,et al.  Linear Simulation of Spaced Microphone Arrays Using B-Format Recordings , 2010 .

[35]  Natasha Barrett,et al.  A New Method for B-Format to Binaural Transcoding , 2010 .

[37]  Christoph Pörschmann,et al.  Binaural Reproduction of Plane Waves With Reduced Modal Order , 2014 .

[38]  Edgar Y. Choueiri,et al.  Soundfield Navigation using an Array of Higher-Order Ambisonics Microphones , 2016 .

[39]  Juha Merimaa,et al.  Spatial Impulse Response Rendering I: Analysis and Synthesis , 2005 .

[40]  Jerome Daniel,et al.  Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format , 2003 .

[41]  Joseph G. Tylka,et al.  Performance of Linear Extrapolation Methods for Virtual Sound Field Navigation , 2020 .

[42]  V. Ralph Algazi,et al.  Motion-Tracked Binaural Sound , 2004 .

[43]  Efren Fernandez-Grande,et al.  Sound field reconstruction using a spherical microphone array. , 2016, The Journal of the Acoustical Society of America.

[44]  Matti Karjalainen,et al.  Analyzing Virtual Sound Source Attributes Using a Binaural Auditory Model , 1999 .

[45]  Swen Müller,et al.  Transfer-Function Measurement with Sweeps , 2001 .

[46]  Francis F. Li,et al.  Localisation Performance of Higher-Order Ambisonics for Off-Centre Listening , 2013 .

[47]  Francis M. Boland,et al.  Dynamic Time Warping for acoustic response interpolation: Possibilities and limitations , 2009, 2009 17th European Signal Processing Conference.

[48]  Yan Wang,et al.  Translations of spherical harmonics expansion coefficients for a sound field using plane wave expansions. , 2018, The Journal of the Acoustical Society of America.

[49]  John C. Platt,et al.  HRTF magnitude synthesis via sparse representation of anthropometric features , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[50]  Peter Balazs,et al.  Multiple Exponential Sweep Method for Fast Measurement of Head-Related Transfer Functions , 2007 .

[51]  Rudolf Rabenstein,et al.  Limitations in the extrapolation of wave fields from circular measurements , 2007, 2007 15th European Signal Processing Conference.

[52]  Simon Doclo,et al.  Smoothing individual head-related transfer functions in the frequency and spatial domains. , 2014, The Journal of the Acoustical Society of America.

[53]  Hugo Fastl,et al.  Auditory adapted exponential transfer function smoothing (AAS) , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[54]  Marwan Al-Akaidi,et al.  Nearfield binaural synthesis and ambisonics. , 2007, The Journal of the Acoustical Society of America.

[55]  Mark A. Poletti,et al.  Three-Dimensional Surround Sound Systems Based on Spherical Harmonics , 2005 .

[56]  Sascha Spors,et al.  Comparison of Modal Versus Delay-and-Sum Beamforming in the Context of Data-Based Binaural Synthesis , 2012 .

[57]  John Vanderkooy,et al.  Increasing the audio measurement capability of FFT analyzers by microcomputer postprocessing , 1985 .

[58]  Bosun Xie,et al.  The Audibility of Spectral Detail of Head-Related Transfer Functions at High Frequency , 2010 .

[59]  Edgar Y. Choueiri,et al.  A Generalized Method for Fractional-Octave Smoothing of Transfer Functions that Preserves Log-Frequency Symmetry , 2017 .

[60]  Dylan Menzies,et al.  Ambisonic Synthesis of Complex Sources , 2007 .

[61]  Robert Höldrich,et al.  A 3D Ambisonic Based Binaural Sound Reproduction System , 2003 .

[62]  Sascha Spors,et al.  An analytical approach to sound field reproduction with a movable sweet spot using circular distributions of loudspeakers , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[63]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[64]  Volker Hohmann,et al.  Auditory model based direction estimation of concurrent speakers from binaural signals , 2011, Speech Commun..

[65]  Rahulram Sridhar,et al.  A Database of Head-Related Transfer Functions and Morphological Measurements , 2017 .

[66]  Hyunkook Lee,et al.  A New Multichannel Microphone Technique for Effective Perspective Control , 2011 .

[67]  Alois Sontacchi,et al.  AMBIX - A SUGGESTED AMBISONICS FORMAT , 2011 .

[68]  Sascha Spors,et al.  Binaural Assessment of Multichannel Reproduction , 2013 .

[69]  Emanuel A. P. Habets,et al.  Six-Degrees-of-Freedom Binaural Audio Reproduction of First-Order Ambisonics with Distance Information , 2018 .

[70]  Method for the subjective assessment of intermediate quality level of , 2014 .

[71]  Francis Rumsey,et al.  On the Sound Color Properties of Wavefield Synthesis and Stereo , 2007 .

[72]  David Stanley Mcgrath,et al.  Sound Field Format to Binaural Decoder with Head Tracking , 1996 .

[73]  John Mourjopoulos,et al.  Addendum to 'Generalized Fractional-Octave Smoothing of Audio and Acoustic Responses' , 2000 .

[74]  Hans-Joachim Maempel,et al.  On the Audibility of Comb Filter Distortions , 2007 .

[75]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[76]  A. Bronkhorst,et al.  Auditory distance perception in humans : A summary of past and present research , 2005 .

[77]  Thibaut Carpentier,et al.  Normalization Schemes in Ambisonic: Does it Matter? , 2017 .

[78]  W. Lindemann Extension of a binaural cross-correlation model by contralateral inhibition. II. The law of the first wave front. , 1986, The Journal of the Acoustical Society of America.

[79]  Thushara D. Abhayapala,et al.  Practical implementation and analysis of spatial soundfield capture by higher order microphones , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.

[80]  Sascha Spors,et al.  Physical Properties of Modal Beamforming in the Context of Data-Based Sound Reproduction , 2015 .

[81]  Tomasz Zernicki,et al.  Toward Six Degrees of Freedom Audio Recording and Playback Using Multiple Ambisonics Sound Fields , 2019 .

[82]  Maarten van Walstijn,et al.  Off-Center Listening with Third-Order Ambisonics: Dependence of Perceived Source Direction on Signal Type , 2017 .

[83]  Khaled Boussetta,et al.  SoundDelta: A Study of Audio Augmented Reality Using WiFi-Distributed Ambisonic Cell Rendering , 2010 .

[84]  Katja Vetter,et al.  ExpoChirpToolbox: a Pure Data implementation of ESS impulse response measurement , 2011 .

[85]  Matthias KRONLACHNER Ambisonics plug-in suite for production and performance usage , 2013 .

[86]  Jose J. Lopez,et al.  Binaural Room Impulse Responses Interpolation for Multimedia Real-Time Applications , 2018 .

[87]  Edgar Y. Choueiri,et al.  A Method for Efficiently Calculating Head-Related Transfer Functions Directly from Head Scan Point Clouds , 2017 .

[88]  Yukio Iwaya,et al.  Dataset of head-related transfer functions measured with a circular loudspeaker array , 2014 .

[89]  James M. Kates A Perceptual Criterion for Loudspeaker Evaluation , 1984 .

[90]  Mikko-Ville Laitinen,et al.  Binaural reproduction for Directional Audio Coding , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[91]  Edgar Y. Choueiri Binaural Audio Through Loudspeakers , 2017 .

[92]  Hisashi Kihara,et al.  digital audio signal processing , 1990 .

[93]  A. M. Salomons,et al.  Coloration and Binaural Decoloration of Sound due to Reflections , 1995 .

[94]  W. A. Mvnso,et al.  Loudness , Its Definition , Measurement and Calculation , 2004 .

[95]  Hiroshi Saruwatari,et al.  Sound Field Recording Using Distributed Microphones Based on Harmonic Analysis of Infinite Order , 2018, IEEE Signal Processing Letters.

[96]  Joseph G. Tylka,et al.  Domains of Practical Applicability for Parametric Interpolation Methods for Virtual Sound Field Navigation , 2019 .

[97]  Frank Boland,et al.  Acoustic Impulse Response Interpolation for Multichannel Systems Using Dynamic Time Warping , 2009 .

[98]  Sascha Spors,et al.  Data-Based Binaural Synthesis Including Rotational and Translatory Head-Movements , 2013 .

[99]  André van Schaik,et al.  The application of compressive sampling to the analysis and synthesis of spatial sound fields , 2009 .

[100]  E. Owens,et al.  An Introduction to the Psychology of Hearing , 1997 .

[101]  Prasanga N. Samarasinghe,et al.  Wavefield Analysis Over Large Areas Using Distributed Higher Order Microphones , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[102]  Aaron Heller,et al.  A Toolkit for the Design of Ambisonic Decoders , 2012 .

[103]  鈴木 陽一,et al.  頭部伝達関数データセット記述の標準化を目指したSOFA: Spetially Oriented format for Acoustics , 2014 .

[104]  Boaz Rafaely,et al.  Analysis and design of spherical microphone arrays , 2005, IEEE Transactions on Speech and Audio Processing.

[105]  R. Duraiswami,et al.  Fast Multipole Methods for the Helmholtz Equation in Three Dimensions , 2005 .

[106]  Yutaka Kaneda,et al.  A Recursive Adaptive Method of Impulse Response Measurement with Constant SNR over Target Frequency Band , 2013 .

[107]  Mark S. Gordon,et al.  Rapid and stable determination of rotation matrices between spherical harmonics by direct recursion , 1999 .

[108]  Brian F G Katz,et al.  A comparative study of Interaural Time Delay estimation methods. , 2014, The Journal of the Acoustical Society of America.

[109]  Joseph G. Tylka,et al.  A New Approach to Impulse Response Measurements at High Sampling Rates , 2014 .

[110]  Michael A. Gerzon,et al.  General Metatheory of Auditory Localisation , 1992 .

[111]  Joseph G. Tylka,et al.  Fundamentals of a parametric method for virtual navigation within an array of ambisonics microphones , 2020 .

[112]  Edgar Y. Choueiri,et al.  3 D Audio and Applied Acoustics Laboratory · Princeton University Algorithms for Computing Ambisonics Translation Filters , 2019 .

[113]  Prasanga N. Samarasinghe,et al.  An Efficient Parameterization of the Room Transfer Function , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.