Spatial release from masking for binaural reproduction of speech in noise with varying spherical harmonics order

Abstract Spatial release from masking (SRM) in the context of speech perception denotes the improvement of speech intelligibility in noise when the speech and the noise sources are spatially separated in space. A similar effect has been reported with binaural sound reproduction (BSR), in which speech and noise are simulated and reproduced using headphones. The effect of SRM on speech intelligibility using BSR has not yet been thoroughly studied for signals represented by different spherical harmonic (SH) orders. This may be important when speech corrupted by noise is delivered in spatial audio channels, such as Ambisonics and higher order Ambisonics. In this work, a hearing in noise test (HINT) is performed in which a female speaker and noise are binaurally reproduced using headphones under different SH orders and different signal to noise ratio (SNR) values. The results show that a first SH order is sufficient to achieve intelligibility that is as good as the highest SH order used in this work.

[1]  Sean Wallis,et al.  Binomial Confidence Intervals and Contingency Tests: Mathematical Fundamentals and the Evaluation of Alternative Methods , 2013, J. Quant. Linguistics.

[2]  Boaz Rafaely,et al.  Joint sampling theory and subjective investigation of plane-wave and spherical harmonics formulations for binaural reproduction , 2018 .

[3]  Noam R. Shabtai,et al.  Optimization of the directivity in binaural sound reproduction beamforming. , 2015, The Journal of the Acoustical Society of America.

[4]  Boaz Rafaely,et al.  Spatial perception of sound fields recorded by spherical microphone arrays with varying spatial resolution. , 2013, The Journal of the Acoustical Society of America.

[5]  R Plomp,et al.  Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. , 1992, The Journal of the Acoustical Society of America.

[6]  D. Wolfe,et al.  Nonparametric Statistical Methods. , 1974 .

[7]  Boaz Rafaely,et al.  Generalized Spherical Array Beamforming for Binaural Speech Reproduction , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[8]  Ruth Y. Litovsky,et al.  Spatial Release from Masking , 2012 .

[9]  Boaz Rafaely,et al.  Intelligibility of speech in noise under diotic and dichotic binaural listening , 2017 .

[10]  Boaz Rafaely,et al.  Interaural cross correlation in a sound field represented by spherical harmonics. , 2009, The Journal of the Acoustical Society of America.

[11]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[12]  Tammo Houtgast,et al.  Development and validation of an automatic speech-in-noise screening test by telephone , 2004, International journal of audiology.

[13]  Boaz Rafaely,et al.  Fundamentals of Spherical Array Processing , 2015, Springer Topics in Signal Processing.

[14]  Cas Smits,et al.  The interpretation of speech reception threshold data in normal-hearing and hearing-impaired listeners: steady-state noise. , 2011, The Journal of the Acoustical Society of America.

[15]  G. Arfken Mathematical Methods for Physicists , 1967 .

[16]  Robert Höldrich,et al.  A 3D Ambisonic Based Binaural Sound Reproduction System , 2003 .