Headphone-Based Spatial Sound

With its power to transport the listener to a distant real or virtual world, realistic spatial audio has a significant role to II play for immersive communications. Headphone-based rendering is particularly attractive for mobile communications systems. Augmented realism and versatility in applications can be achieved when the headphone signals respond dynamically to the motion of the listener. The timely development of miniature lowpower motion sensors is making this technology possible. This article reviews the physical and psychoacoustic foundations, practical methods, and engineering challenges to the realization of motion-tracked sound over headphones. Some new applications that are enabled by this technology are outlined.

[1]  Tomlinson Holman,et al.  Surrounded by sound , 1999 .

[2]  D. Brungart,et al.  The effects of production and presentation level on the auditory distance perception of speech. , 2001, The Journal of the Acoustical Society of America.

[3]  Gary S. Kendall,et al.  The Decorrelation of Audio Signals and Its Impact on Spatial Imagery , 1995 .

[4]  F. Wightman,et al.  A model of head-related transfer functions based on principal components analysis and minimum-phase reconstruction. , 1992, The Journal of the Acoustical Society of America.

[5]  D. M. Green,et al.  Directional dependence of interaural envelope delays. , 1990, The Journal of the Acoustical Society of America.

[6]  Jean-Marc Jot,et al.  Real-time spatial processing of sounds for music, multimedia and interactive human-computer interfaces , 1999, Multimedia Systems.

[7]  Michael A. Gerzon,et al.  Ambisonics in Multichannel Broadcasting and Video , 1985 .

[8]  F L Wightman,et al.  Headphone simulation of free-field listening. I: Stimulus synthesis. , 1989, The Journal of the Acoustical Society of America.

[9]  J. C. Middlebrooks,et al.  Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. , 2002, The Journal of the Acoustical Society of America.

[10]  P. Scheiber History of Spatial Coding , 2003 .

[11]  Sandra Brix,et al.  Wave Field Synthesis , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[12]  Ramani Duraiswami,et al.  INTERPOLATION AND RANGE EXTRAPOLATION OF HRTFS , 2004 .

[13]  V R Algazi,et al.  Elevation localization and head-related transfer function analysis at low frequencies. , 2001, The Journal of the Acoustical Society of America.

[14]  Mendel Kleiner,et al.  Auralization-An Overview , 1993 .

[15]  William A. Yost,et al.  Spatial hearing: The psychophysics of human sound localization, revised edition , 1998 .

[16]  V. Ralph Algazi,et al.  Customization for Personalized Rendering of Motion-Tracked Binaural Sound , 2004 .

[17]  Rick L. Jenison,et al.  A Spherical Basis Function Neural Network for Modeling Auditory Space , 1996, Neural Computation.

[18]  F L Wightman,et al.  Resolution of front-back ambiguity in spatial hearing by listener and source movement. , 1999, The Journal of the Acoustical Society of America.

[19]  V. Ralph Algazi,et al.  Immersive spatial sound for mobile multimedia , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[20]  David Griesinger,et al.  Binaural Techniques for Music Reproduction , 1990 .

[21]  Nobuhiko Kitawaki,et al.  Common-acoustical-pole and zero modeling of head-related transfer functions , 1999, IEEE Trans. Speech Audio Process..

[22]  Mark B. Gardner,et al.  Distance Estimation of 0° or Apparent 0°‐Oriented Speech Signals in Anechoic Space , 1969 .

[23]  Günther Theile,et al.  Binaural room scanning—A new tool for acoustic and psychoacoustic research , 1999 .

[24]  Tammo Houtgast,et al.  Auditory distance perception in rooms , 1999, Nature.

[25]  H. Steven Colburn,et al.  Role of spectral detail in sound-source localization , 1998, Nature.

[26]  F. Wightman,et al.  The dominant role of low-frequency interaural time differences in sound localization. , 1992, The Journal of the Acoustical Society of America.

[27]  Larry S. Davis,et al.  High Order Spatial Audio Capture and Its Binaural Head-Tracked Playback Over Headphones with HRTF Cues , 2005 .

[28]  J. C. Middlebrooks Narrow-band sound localization related to external ear acoustics. , 1992, The Journal of the Acoustical Society of America.

[29]  D Pralong,et al.  The role of individualized headphone calibration for the generation of high fidelity virtual auditory space. , 1996, The Journal of the Acoustical Society of America.

[30]  V. Ralph Algazi,et al.  Motion-Tracked Binaural Sound , 2004 .

[31]  Richard O. Duda,et al.  A structural model for binaural sound synthesis , 1998, IEEE Trans. Speech Audio Process..

[32]  Jyri Huopaniemi Future of Personal Audio: Smart Applications and Immersive Communication , 2007 .

[33]  Durand R. Begault,et al.  Auditory and Non-Auditory Factors that Potentially Influence Virtual Acoustic Imagery , 1999 .

[34]  Christof Faller,et al.  Sound Field Analysis along a Circle and Its Applications to HRTF Interpolation , 2008 .

[35]  D. Brungart Auditory localization of nearby sources. III. Stimulus effects. , 1999, The Journal of the Acoustical Society of America.

[36]  Dorte Hammershøi,et al.  Binaural Technique: Do We Need Individual Recordings? , 1996 .

[37]  Fiona Harvey,et al.  Surrounded by sound. , 2002, Scientific American.