Microphone Array Geometries for Horizontal Spatial Audio Object Capture With Beamforming

Microphone array beamforming can be used to enhance and separate sound sources, with applications in the capture of object-based audio. Many beamforming methods have been proposed and assessed against each other. However, the effects of compact microphone array design on beamforming performance have not been studied for this kind of application. This study investigates how to maximize the quality of audio objects extracted from a horizontal sound field by filter-and-sum beamforming, through appropriate choice of microphone array design. Eight uniform geometries with practical constraints of a limited number of microphones and maximum array size are evaluated over a range of physical metrics. Results show that baffled circular arrays outperform the other geometries in terms of perceptually relevant frequency range, spatial resolution, directivity and robustness. Moreover, a subjective evaluation of microphone arrays and beamformers is conducted with regards to the quality of the target sound, interference suppression and overall quality of simulated music performance recordings. Baffled circular arrays achieve higher target quality and interference suppression than alternative geometries with wideband signals. Furthermore, subjective scores of beamformers regarding target quality and interference suppression agree well with beamformer onaxis and off-axis responses; with wideband signals the superdirective beamformer achieves the highest overall quality.

[1]  Shoichi Koyama,et al.  Mixed-order Ambisonics encoding of cylindrical microphone array signals , 2014 .

[2]  O. Hoshuyama,et al.  A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Gary W. Elko,et al.  Constant beamwidth beamforming , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Angelo Farina,et al.  SPATIAL SOUND RECORDING WITH DENSE MICROPHONE ARRAYS , 2014 .

[5]  J. Capon High-resolution frequency-wavenumber spectrum analysis , 1969 .

[6]  Jacob Benesty,et al.  On the Design of Frequency-Invariant Beampatterns With Uniform Circular Microphone Arrays , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[7]  Jacob Benesty,et al.  Study and Design of Differential Microphone Arrays , 2012 .

[8]  Boaz Rafaely,et al.  On the Consistent Use of Space and Time Conventions in Array Processing , 2015 .

[9]  Walter Kellermann,et al.  Acoustic source detection and localization based on wavefield decomposition using circular microphone arrays , 2006 .

[10]  Yiteng Huang Immersive audio schemes , 2011, IEEE Signal Processing Magazine.

[11]  Emmanuel Vincent,et al.  Subjective and Objective Quality Assessment of Audio Source Separation , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Boaz Rafaely,et al.  Open-Sphere Designs for Spherical Microphone Arrays , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Angelo Farina,et al.  Spatial PCM Sampling: A New Method for Sound Recording and Playback , 2013 .

[14]  Klaus Uwe Simmer,et al.  Superdirective Microphone Arrays , 2001, Microphone Arrays.

[15]  Gary W. Elko,et al.  Differential Microphone Arrays , 2004 .

[16]  Henry Cox,et al.  Practical supergain , 1986, IEEE Trans. Acoust. Speech Signal Process..

[17]  Jacob Benesty,et al.  Audio Signal Processing for Next-Generation Multimedia Communication Systems , 2004 .

[18]  Jens Meyer,et al.  Beamforming for a circular microphone array mounted on spherically shaped objects , 2001 .

[19]  P. Stoica,et al.  Robust Adaptive Beamforming , 2013 .

[20]  Method for the subjective assessment of intermediate quality level of , 2014 .

[21]  Jacob Benesty,et al.  Reduced-Order Robust Superdirective Beamforming With Uniform Linear Microphone Arrays , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[22]  Boaz Rafaely The Spherical-Shell Microphone Array , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[23]  Antoine Liutkus,et al.  The 2018 Signal Separation Evaluation Campaign , 2018, LVA/ICA.

[24]  Algorithms to measure audio programme loudness and true-peak audio level , 2011 .

[25]  Franz Pernkopf,et al.  A multi-channel postfilter based on the diffuse noise sound field , 2014, 2014 22nd European Signal Processing Conference (EUSIPCO).

[26]  Philip J. B. Jackson,et al.  Audio Object Separation Using Microphone Array Beamforming , 2015 .

[27]  Rudolf Rabenstein,et al.  CARDIOID PATTERN OPTIMIZATION FOR A VIRTUAL CIRCULAR MICRO PHONE ARRAY , 2009 .

[29]  Filippo Maria Fazi,et al.  A System Architecture for Semantically Informed Rendering of Object-Based Audio , 2019 .

[30]  Miguel Blanco Galindo Microphone array beamforming for spatial audio object capture. , 2020 .

[31]  Philip Coleman,et al.  Robust Hypercardioid Synthesis for Spatial Audio Capture: Microphone Geometry, Directivity and Regularization , 2019 .

[32]  Sridha Sridharan,et al.  Robust speaker recognition using microphone arrays , 2001, Odyssey.

[33]  Ben Shirley,et al.  Personalization in Object-based Audio for Accessibility: A Review of Advancements for Hearing Impaired Listeners , 2019, Journal of the Audio Engineering Society.

[34]  G. F. Kuhn Model for the interaural time differences in the azimuthal plane , 1977 .

[35]  B.D. Van Veen,et al.  Beamforming: a versatile approach to spatial filtering , 1988, IEEE ASSP Magazine.

[36]  Jürgen Herre,et al.  Source Separation for Enabling Dialogue Enhancement in Object-based Broadcast with MPEG-H , 2019, Journal of the Audio Engineering Society.

[37]  Boaz Rafaely,et al.  Analysis and design of spherical microphone arrays , 2005, IEEE Transactions on Speech and Audio Processing.

[38]  Frank Melchior,et al.  An Audio-Visual System for Object-Based Audio: From Recording to Listening , 2018, IEEE Transactions on Multimedia.

[39]  Wen Zhang,et al.  Theory and design of compact hybrid microphone arrays on two-dimensional planes for three-dimensional soundfield analysis. , 2015, The Journal of the Acoustical Society of America.

[40]  Angelo Farina,et al.  A Spherical Microphone Array for Synthesizing Virtual Directive Microphones in Live Broadcasting and in Post Production , 2010 .

[41]  Heinz Teutsch,et al.  Modal Array Signal Processing: Principles and Applications of Acoustic Wavefield Decomposition , 2007 .

[42]  B. Rafaely Plane-wave decomposition of the sound field on a sphere by spherical convolution , 2004 .

[43]  Franz Pernkopf,et al.  Blind source extraction based on a direction-dependent a-priori SNR , 2014, INTERSPEECH.

[44]  Gary W. Elko,et al.  A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[45]  Frank Melchior,et al.  Object-Based Reverberation for Spatial Audio , 2017 .

[46]  D P Jarrett,et al.  Rigid sphere room impulse response simulation: algorithm and applications. , 2012, The Journal of the Acoustical Society of America.

[47]  Rudolf Rabenstein,et al.  Wave field analysis using multiple radii measurements , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.