Parametric Spatial Sound Processing: A flexible and efficient solution to sound scene acquisition, modification, and reproduction

Flexible and efficient spatial sound acquisition and subsequent processing are of paramount importance in communication and assisted listening devices such as mobile phones, hearing aids, smart TVs, and emerging wearable devices (e.g., smart watches and glasses). In application scenarios where the number of sound sources quickly varies, sources move, and nonstationary noise and reverberation are commonly encountered, it remains a challenge to capture sounds in such a way that they can be reproduced with a high and invariable sound quality. In addition, the objective in terms of what needs to be captured, and how it should be reproduced, depends on the application and on the user?s preferences. Parametric spatial sound processing has been around for two decades and provides a flexible and efficient solution to capture, code, and transmit, as well as manipulate and reproduce spatial sounds.

[1]  Ville Pulkki,et al.  Spatial Sound Reproduction with Directional Audio Coding , 2007 .

[2]  Mikko-Ville Laitinen,et al.  Binaural reproduction for Directional Audio Coding , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[3]  Maja Taseska,et al.  The diffuse sound field in energetic analysis. , 2012, The Journal of the Acoustical Society of America.

[4]  Ivan Tashev,et al.  Microphone Array for Headset with Spatial Noise Suppressor , 2005 .

[5]  Emanuel A. P. Habets,et al.  An Informed Parametric Spatial Filter Based on Instantaneous Direction-of-Arrival Estimates , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[6]  Gary W. Elko,et al.  Spatial Coherence Functions for Differential Microphones in Isotropic Noise Fields , 2001, Microphone Arrays.

[7]  Emanuel A. P. Habets,et al.  Extracting Reverberant Sound Using a Linearly Constrained Minimum Variance Spatial Filter , 2014, IEEE Signal Processing Letters.

[8]  Giovanni Del Galdo,et al.  Generating virtual microphone signals using geometrical information gathered by distributed arrays , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.

[9]  Boaz Rafaely,et al.  Microphone Array Signal Processing , 2008 .

[10]  Jürgen Herre,et al.  Interactive Teleconferencing Combining Spatial Audio Object Coding and DirAC Technology , 2010 .

[11]  Özgür Yilmaz,et al.  On the approximate W-disjoint orthogonality of speech , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  A. Gualtierotti H. L. Van Trees, Detection, Estimation, and Modulation Theory, , 1976 .

[13]  Ville Pulkki,et al.  Virtual Sound Source Positioning Using Vector Base Amplitude Panning , 1997 .

[14]  Christof Faller Microphone Front-Ends for Spatial Audio Coders , 2008 .

[15]  Sharon Gannot,et al.  Adaptive Beamforming and Postfiltering , 2008 .

[16]  Ville Pekka Sivonen,et al.  Parametric Spatial Sound Processing Applied to Bilateral Hearing Aids , 2012 .

[17]  Christof Faller,et al.  PARAMETRIC CODING OF SPATIAL AUDIO , 2004 .

[18]  Giovanni Del Galdo,et al.  Dereverberation in the Spatial Audio Coding Domain , 2011 .

[19]  Michael M. Goodwin,et al.  Spatial Audio Scene Coding , 2008 .

[20]  Svein Berge,et al.  HIGH ANGULAR RESOLUTION PLANEWAVE EXPANSION , 2010 .

[21]  Emanuel A. P. Habets,et al.  Sound acquisition in noisy and reverberant environments using virtual microphones , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[22]  Richard Schultz-Amling,et al.  Acoustical Zooming Based on a Parametric Sound Field Representation , 2010 .

[23]  Emanuel A. P. Habets,et al.  Geometry-Based Spatial Sound Acquisition Using Distributed Microphone Arrays , 2013, IEEE Transactions on Audio, Speech, and Language Processing.

[24]  Juha Merimaa,et al.  Spatial Impulse Response Rendering II: Reproduction of Diffuse Sound and Listening Tests , 2006 .

[25]  Volker Hohmann,et al.  Strategy-selective noise reduction for binaural digital hearing aids , 2003, Speech Commun..

[26]  Jean-Marc Jot,et al.  Beyond Coding: Reproduction of Direct and Diffuse Sound in Multiple Environments , 2010 .

[27]  Giovanni Del Galdo,et al.  On the spatial coherence in mixed sound fields and its application to signal-to-diffuse ratio estimation. , 2012, The Journal of the Acoustical Society of America.

[28]  Giovanni Del Galdo,et al.  Spatial filtering using directional audio coding parameters , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.