Spatial Sound With Loudspeakers and Its Perception: A Review of the Current State

This paper reviews the current state of loudspeaker-based spatial sound reproduction methods from technical perspective as well as perceptual perspective. A nomenclature is developed that allows for a strict separation between these two perspectives. The physical fundamentals, practical realization, and results from perceptual studies are discussed for a number of well-established and emerging reproduction techniques. Further, the paper outlines novel approaches to spatial sound evaluation in terms of perceived quality and provides a comparison of current approaches.

[1]  Johann-Markus Batke,et al.  USING VBAP-DERIVED PANNING FUNCTIONS FOR 3 D AMBISONICS DECODING , 2010 .

[2]  Audun Solvang Spectral Impairment for Two-Dimensional Higher Order Ambisonics , 2008 .

[3]  Jerome Daniel,et al.  Ambisonics Encoding of Other Audio Formats for Multiple Listening Conditions , 1998 .

[4]  W. Marsden I and J , 2012 .

[5]  Scott G. Norcross,et al.  Subjective Investigations of Inverse Filtering , 2004 .

[6]  E. Start Direct sound enhancement by wave field synthesis , 1997 .

[7]  Philippe-Aubert Gauthier,et al.  Sound-field reproduction in-room using optimal control techniques: simulations in the frequency domain. , 2005, The Journal of the Acoustical Society of America.

[8]  Craig T. Jin,et al.  Time domain reconstruction of spatial sound fields using compressed sensing , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  A. H. Marshall,et al.  Spatial impression due to early lateral reflections in concert halls: The derivation of a physical measure , 1981 .

[10]  S. Bech,et al.  Timbral aspects of reproduced sound in small rooms. I. , 1995, The Journal of the Acoustical Society of America.

[11]  Sascha Spors,et al.  Spatial Aliasing Artifacts Produced by Linear and Circular Loudspeaker Arrays used for Wave Field Synthesis , 2006 .

[12]  Methods for the subjective assessment of small impairments in audio systems , 2015 .

[13]  Terence Betlehem,et al.  Theory and design of sound field reproduction in reverberant rooms. , 2005, The Journal of the Acoustical Society of America.

[14]  Thushara D. Abhayapala,et al.  Theory and Design of Soundfield Reproduction Using Continuous Loudspeaker Concept , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Bobby Owsinski The Recording Engineer's Handbook , 2004 .

[16]  Francis Rumsey,et al.  QESTRAL (Part 2): Calibrating the QESTRAL model using listening test data , 2008 .

[17]  Dylan Menzies Nearfield synthesis of complex sources with high-order ambisonics, and binaural rendering. , 2007 .

[18]  Sascha Spors,et al.  Efficient realization of model-based rendering for 2.5-dimensional near-field compensated higher order Ambisonics , 2011, 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).

[19]  Sascha Spors,et al.  Local Sound Field Synthesis by Virtual Acoustic Scattering and Time-Reversal , 2011 .

[20]  Franz Zotter,et al.  All-Round Ambisonic Panning and Decoding , 2012 .

[21]  ON THE LOCALISATION IN THE SUPERIMPOSED SOUNDFIELD , 2004 .

[22]  W. Hartmann Localization of sound in rooms. , 1983, The Journal of the Acoustical Society of America.

[23]  Ute Jekosch,et al.  Voice and Speech Quality Perception: Assessment and Evaluation , 2005 .

[24]  Jan-Jakob Sonke,et al.  Variable Acoustics by Wavefield Synthesis: A Closer Look at Amplitude Effects , 1998 .

[25]  David G. Malham,et al.  3-D Sound Spatialization using Ambisonic Techniques , 1995 .

[26]  Marije A. J. Baalman Reproduction of Arbitrarily Shaped Sound Sources with Wave Field Synthesis - Discretisation and Diffraction Effects , 2007 .

[27]  C. Micheyl,et al.  Toward a Theory of Information Processing in Auditory Cortex , 2012 .

[28]  Michael A. Gerzon Signal Processing for Simulating Realistic Stereo Images , 1992 .

[29]  Alexander Raake,et al.  Localization in Wave Field Synthesis and higher order Ambisonics at different positions within the listening area , 2013 .

[30]  Nick Zacharov,et al.  Audio descriptive analysis & mapping of spatial sound displays , 2001 .

[31]  Sascha Spors,et al.  Comparison of Higher Order Ambisonics and Wave Field Synthesis with Respect to Spatial Discretization Artifacts in Time Domain , 2010 .

[32]  Francis Rumsey,et al.  On the relative importance of spatial and timbral fidelities in judgments of degraded multichannel audio quality. , 2005, The Journal of the Acoustical Society of America.

[33]  Duane H. Cooper,et al.  Discrete-Matrix Multichannel Stereo , 1972 .

[34]  Bobby Owsinski The Mixing Engineer's Handbook , 1999 .

[35]  Brian C. J. Moore,et al.  Development and Validation of a Method for Predicting the Perceived Naturalness of Sounds Subjected to Spectral Distortion , 2004 .

[36]  J. Ahrens,et al.  An Analytical Approach to Sound Field Reproduction Using Circular and Spherical Loudspeaker Distributions , 2008 .

[37]  William B. Snow Basic Principles of Stereophonic Sound , 1953 .

[38]  A. Berkhout,et al.  Acoustic control by wave field synthesis , 1993 .

[39]  D. Leakey Some Measurements on the Effects of Interchannel Intensity and Time Differences in Two Channel Sound Systems , 1959 .

[40]  Michael A. Gerzon,et al.  Ambisonics. Part two: Studio techniques , 1975 .

[41]  Thushara D. Abhayapala,et al.  Higher Order Loudspeakers for Improved Surround Sound Reproduction in Rooms , 2012 .

[42]  E. Williams,et al.  Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography , 1999 .

[43]  Stefan Weinzierl,et al.  Binaural Resynthesis for Comparative Studies of Acoustical Environments , 2007 .

[44]  Tomasz Letowski,et al.  Sound Quality Assessment: Concepts and Criteria , 1989 .

[45]  Sascha Spors,et al.  Sound Field Reproduction Using Planar and Linear Arrays of Loudspeakers , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[46]  Kevin D. Donohue,et al.  Virtual Sound Source Rendering Using a Multipole-Expansion and Method-of-Moments Approach , 2008 .

[47]  S. Spors,et al.  Reproduction of a plane-wave sound field using planar and linear arrays of loudspeakers , 2008, 2008 3rd International Symposium on Communications, Control and Signal Processing.

[48]  Guillaume Potard,et al.  3D-AUDIO OBJECT ORIENTED CODING , 2006 .

[49]  Sylvain Choisel,et al.  Evaluation of multichannel reproduced sound: scaling auditory attributes underlying listener preference. , 2007, The Journal of the Acoustical Society of America.

[50]  Francis Rumsey,et al.  Spatial Attribute Identification and Scaling by Repertory Grid Technique and Other Methods , 1999 .

[51]  Georgios B. Giannakis,et al.  Sound Field Reproduction using the Lasso , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[52]  Koichiro Hiyama,et al.  The 22.2 Multichannel Sound System and Its Application , 2005 .

[53]  Jean Giroire Integral equation methods for the Helmholtz equation , 1982 .

[54]  Michael J. Gerzon Periphony: With-Height Sound Reproduction , 1973 .

[55]  Franz Zotter,et al.  Energy-Preserving Ambisonic Decoding , 2012 .

[56]  Michael A. Gerzon Hierarchical Transmission System for Multispeaker Stereo , 1992 .

[57]  Francis Rumsey,et al.  QESTRAL (Part 4): Test Signals, Combining Metrics, and the Prediction of Overall Spatial Quality , 2008 .

[58]  Alois Sontacchi,et al.  Localization Experiments Using Different 2 D Ambisonics Decoders , 2008 .

[59]  Alan D. Blumlein,et al.  British Patent Specification 394,325 (Improvements in and relating to Sound-transmission, Sound-recording and Sound-reproducing Systems) , 1958 .

[60]  S. Bech,et al.  Spatial aspects of reproduced sound in small rooms. , 1998, The Journal of the Acoustical Society of America.

[61]  Matti Karjalainen,et al.  Localization, Coloration, and Enhancement of Amplitude-Panned Virtual Sources , 1999 .

[62]  Stanley P. Lipshitz,et al.  Stereo Microphone Techniques: Are the Purists Wrong? , 1985 .

[63]  Ville Pulkki,et al.  Coloration of Amplitude-Panned Virtual Sources , 2001 .

[64]  Sascha Spors,et al.  A Comparison of Wave Field Synthesis and Higher-Order Ambisonics with Respect to Physical Properties and Spatial Sampling , 2008 .

[65]  Helmut. Wittek,et al.  Perceptual Differences Between Wavefield Synthesis and Stereophony. , 2007 .

[66]  Sascha Spors,et al.  Analysis and Improvement of Pre-Equalization in 2.5-Dimensional Wave Field Synthesis , 2010 .

[67]  R. Kress,et al.  Integral equation methods in scattering theory , 1983 .

[68]  R. Rabenstein,et al.  The Theory of Wave Field Synthesis Revisited , 2008 .

[69]  J. Daniel,et al.  Représentation de champs acoustiques, application à la transmission et à la reproduction de scènes sonores complexes dans un contexte multimédia , 2000 .

[70]  Francis Rumsey,et al.  QESTRAL (Part 1): Quality Evaluation of Spatial Transmission and Reproduction Using an Artificial Listener , 2008 .

[71]  Peter Fellgett,et al.  Ambisonics. Part one: General system description , 1975 .

[72]  Olivier Warusfel,et al.  Investigation of the perceived spatial resolution of higher order ambisonic sound fields : a subjective evaluation involving virtual and real 3D microphones , 2007 .

[73]  Frank Melchior,et al.  Perceptual Evaluation of a Spatial Audio Algorithm Based on Wave Field Synthesis Using a Reduced Number of Loudspeakers , 2011 .

[74]  A. Gabrielsson,et al.  Perceived sound quality of sound-reproducing systems. , 1979, The Journal of the Acoustical Society of America.

[75]  Mark A. Poletti,et al.  The Design of Encoding Functions for Stereophonic and Polyphonic Sound Systems , 1996 .

[76]  Mark A. Poletti,et al.  An Investigation of 2-D Multizone Surround Sound Systems , 2008 .

[77]  Sandra Brix,et al.  Wave Field Synthesis , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[78]  W.P.J. de Bruijn Application of wave field synthesis in videoconferencing , 2004 .

[79]  Chan-Hui Lee,et al.  A realization of sound focused personal audio system using acoustic contrast control. , 2009, The Journal of the Acoustical Society of America.

[80]  Thushara D. Abhayapala,et al.  Reproduction of a plane-wave sound field using an array of loudspeakers , 2001, IEEE Trans. Speech Audio Process..

[81]  Sascha Spors,et al.  Active listening room compensation for massive multichannel sound reproduction systems using wave-domain adaptive filtering. , 2007, The Journal of the Acoustical Society of America.

[82]  Sascha Spors,et al.  Towards a theory for arbitrarily shaped sound field reproduction systems , 2008 .

[83]  Peter G. Craven,et al.  Continuous Surround Panning for 5-Speaker Reproduction , 2003 .

[84]  Francis Rumsey,et al.  Spatial quality evaluation for reproduced sound: terminology, meaning and a scene-based paradigm , 2002 .

[85]  Thushara D. Abhayapala,et al.  Spatial multizone soundfield reproduction , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[86]  Mark A. Poletti Robust Two-Dimensional Surround Sound Reproduction for Nonuniform Loudspeaker Layouts , 2007 .

[87]  M. Kubovy,et al.  Auditory and visual objects , 2001, Cognition.

[88]  O. C. Trimble LOCALIZATION OF SOUND IN THE ANTERIOR‐POSTERIOR AND VERTICAL DIMENSIONS OF “AUDITORY” SPACE1 , 1934 .

[89]  Robert Alexander The Inventor of Stereo: The Life and Works of Alan Dower Blumlein , 2000 .

[90]  Michael Williams,et al.  Microphone Array Analysis for Multichannel Sound Recording , 1999 .

[91]  P. E. Doak,et al.  A subjective rating scale for timbre , 1976 .

[92]  O. Kirkeby,et al.  Reproduction of plane wave sound fields , 1993 .

[93]  Sascha Spors,et al.  Localization of a Virtual Point Source within the Listening Area for Wave Field Synthesis , 2012 .

[94]  E. Verheijen Sound reproduction by wave field synthesis , 1998 .

[95]  Rodney A. Kennedy,et al.  Intrinsic Limits of Dimensionality and Richness in Random Multipath Fields , 2007, IEEE Transactions on Signal Processing.

[96]  Yan Jennifer Wu,et al.  Spatial Soundfield Reproduction with Zones of Quiet , 2009 .

[97]  F. Melchior Investigations on spatial sound design based on measured room impulse responses , 2011 .

[98]  Adrian Bahne,et al.  Compensation of Loudspeaker–Room Responses in a Robust MIMO Control Framework , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[99]  Alois Sontacchi,et al.  PHANTOM SOURCE WIDENING WITH DETERMINISTIC FREQUENCY DEPENDENT TIME DELAYS , 2011 .

[100]  Wieslaw Woszczyk,et al.  Controlling Phantom Image Focus in a Multichannel Reproduction System , 1999 .

[101]  Jens Blauert,et al.  Concepts Behind Sound Quality: Some Basic Considerations , 2003 .

[102]  Sascha Spors,et al.  The SoundScape Renderer: A Unified Spatial Audio Reproduction Framework for Arbitrary Rendering Methods , 2008 .

[103]  Gary S. Kendall,et al.  The Decorrelation of Audio Signals and Its Impact on Spatial Imagery , 1995 .

[104]  Georg Plenge,et al.  Localization of Lateral Phantom Sources , 1976 .

[105]  Andreas Franck,et al.  Reproduction of Moving Sound Sources by Wave Field Synthesis: An Analysis of Artifacts , 2007 .

[106]  T. Wu,et al.  A New Look at the High Frequency Boundary Element and Rayleigh Integral Approximations , 2003 .

[107]  Michael A. Gerzon,et al.  General Metatheory of Auditory Localisation , 1992 .

[108]  H. Gaskell The precedence effect , 1983, Hearing Research.

[109]  Sascha Spors,et al.  Sound Field Synthesis Toolbox , 2012 .

[110]  Christof Faller,et al.  Binaural cue coding-Part II: Schemes and applications , 2003, IEEE Trans. Speech Audio Process..

[111]  Ville Pulkki,et al.  Spatial sound generation and perception by amplitude panning techniques , 2001 .

[112]  William G. Gardner,et al.  Efficient Convolution without Input/Output Delay , 1995 .

[113]  Ville Pulkki,et al.  Spatial Sound Reproduction with Directional Audio Coding , 2007 .

[114]  A. J. Berkhout,et al.  A Holographic Approach to Acoustic Control , 1988 .

[115]  Cumhur Erkut,et al.  Parametric time-frequency representation of spatial sound in virtual worlds , 2012, TAP.

[116]  Christof Faller,et al.  Binaural cue coding-Part I: psychoacoustic fundamentals and design principles , 2003, IEEE Trans. Speech Audio Process..

[117]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[118]  Sascha Spors,et al.  Analytical driving functions for higher order Ambisonics , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[119]  Maximo Cobos,et al.  On the Use of Small Microphone Arrays for Wave Field Synthesis Auralization , 2012 .

[120]  J. Blauert Spatial Hearing: The Psychophysics of Human Sound Localization , 1983 .

[121]  F. Wightman,et al.  The dominant role of low-frequency interaural time differences in sound localization. , 1992, The Journal of the Acoustical Society of America.

[122]  Catherine Guastavino,et al.  Perceptual evaluation of multi-dimensional spatial audio reproduction. , 2004, The Journal of the Acoustical Society of America.

[123]  Francis Rumsey,et al.  Localization Curves for a Regularly-Spaced Octagon Loudspeaker Array , 2009 .

[124]  Martin Schneider,et al.  Adaptive listening room equalization using a scalable filtering structure in thewave domain , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[125]  Jerome Daniel,et al.  Spatial Sound Encoding Including Near Field Effect: Introducing Distance Coding Filters and a Viable, New Ambisonic Format , 2003 .

[126]  Philip A. Nelson,et al.  The Ill-Conditioning Problem in Sound Field Reconstruction , 2007 .

[127]  Paul Troughton,et al.  Convenient Multi-Channel Sound in the Home , 2002 .

[128]  Francis Rumsey,et al.  QESTRAL (Part 3): System and Metrics for Spatial Quality Prediction , 2008 .

[129]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[130]  Philip A. Nelson,et al.  Surround System Based on Three-Dimensional Sound Field Reconstruction , 2008 .

[131]  Sascha Spors,et al.  Perceptual Evaluation of Focused Sources in Wave Field Synthesis , 2010 .

[132]  Mark A. Poletti,et al.  Three-Dimensional Surround Sound Systems Based on Spherical Harmonics , 2005 .