Extension and evaluation of a near-end listening enhancement algorithm for listeners with normal and impaired hearing.

In many applications in which speech is played back via a sound reinforcement system such as public address systems and mobile phones, speech intelligibility is degraded by additive environmental noise. A possible solution to maintain high intelligibility in noise is to pre-process the speech signal based on the estimated noise power at the position of the listener. The previously proposed AdaptDRC algorithm [Schepker, Rennies, and Doclo (2015). J. Acoust. Soc. Am. 138, 2692-2706] applies both frequency shaping and dynamic range compression under an equal-power constraint, where the processing is adaptively controlled by short-term estimates of the speech intelligibility index. Previous evaluations of the algorithm have focused on normal-hearing listeners. In this study, the algorithm was extended with an adaptive gain stage under an equal-peak-power constraint, and evaluated with eleven normal-hearing and ten mildly to moderately hearing-impaired listeners. For normal-hearing listeners, average improvements in speech reception thresholds of about 4 and 8 dB compared to the unprocessed reference condition were measured for the original algorithm and its extension, respectively. For hearing-impaired listeners, the average improvements were about 2 and 6 dB, indicating that the relative improvement due to the proposed adaptive gain stage was larger for these listeners than the benefit of the original processing stages.

[1]  Birger Kollmeier,et al.  Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests. , 2002, The Journal of the Acoustical Society of America.

[2]  Jan Rennies,et al.  Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index. , 2015, The Journal of the Acoustical Society of America.

[3]  Yannis Stylianou,et al.  Evaluating the intelligibility benefit of speech modifications in known noise conditions , 2013, Speech Commun..

[4]  P. Van cauwenberge,et al.  Sensitivity of transient evoked and distortion product otoacoustic emissions to the direct effects of noise on the human cochlea. , 1999, Audiology : official organ of the International Society of Audiology.

[5]  Richard C. Hendriks,et al.  Optimizing Speech Intelligibility in a Noisy Environment: A unified view , 2015, IEEE Signal Processing Magazine.

[6]  Takayuki Arai,et al.  Using Steady-State Suppression to Improve Speech Intelligibility in Reverberant Environments for Elderly Listeners , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Richard Heusdens,et al.  Speech energy redistribution for intelligibility improvement in noise based on a perceptual distortion measure , 2014, Comput. Speech Lang..

[8]  Jan Rennies,et al.  Improving speech intelligibility in noise by SII-dependent preprocessing using frequency-dependent amplification and dynamic range compression , 2013, INTERSPEECH.

[9]  Elias Azarov,et al.  General-Purpose Listening Enhancement Based on Subband Non-Linear Amplification with Psychoacoustic Criterion , 2015 .

[10]  Yan Tang,et al.  Subjective and Objective Evaluation of Speech Intelligibility Enhancement Under Constant Energy and Duration Constraints , 2011, INTERSPEECH.

[11]  Yannis Stylianou,et al.  Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression , 2012, INTERSPEECH.

[12]  Peter Vary,et al.  Near-End Listening Enhancement in the Presence of Bandpass Noises , 2012, ITG Conference on Speech Communication.

[13]  Jesper Jensen,et al.  SII-based speech preprocessing for intelligibility improvement in noise , 2013, INTERSPEECH.

[14]  J T Goodman,et al.  The intelligibility of whitened and peak clipped speech. , 1978, Journal of the American Audiology Society.

[15]  Marc Moonen,et al.  Real-Time Perception-Based Clipping of Audio Signals Using Convex Optimization , 2012, IEEE Transactions on Audio, Speech, and Language Processing.

[16]  Birger Kollmeier,et al.  Listening effort and speech intelligibility in listening situations affected by noise and reverberation. , 2014, The Journal of the Acoustical Society of America.

[17]  Sergei Kochkin,et al.  MarkeTrak VIII: The Key Influencing Factors in Hearing Aid Purchase Intent , 2012 .

[18]  Gerhard Schmidt,et al.  Speech and Audio Processing in Adverse Environments , 2008 .

[19]  K. S. Rhebergen,et al.  A Speech Intelligibility Index-based approach to predict the speech reception threshold for sentences in fluctuating noise for normal-hearing listeners. , 2005, The Journal of the Acoustical Society of America.

[20]  Hervé Lissek,et al.  Intelligibility enhancement of vocal announcements for public address systems: a design for all through a presbycusis pre-compensation filter , 2015, INTERSPEECH.

[21]  Rafik Goubran,et al.  NONLINEAR LOUDSPEAKER COMPENSATION FOR HANDS FREE ACOUSTIC ECHO CANCELLATION , 1996 .

[22]  Hiroshi Sato,et al.  Using listening difficulty ratings of conditions for speech communication in rooms. , 2005, The Journal of the Acoustical Society of America.

[23]  Cassia Valentini-Botinhao,et al.  Intelligibility-enhancing speech modifications: the hurricane challenge , 2020, INTERSPEECH.

[24]  Jesper Jensen,et al.  An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[25]  Peter Vary,et al.  Recursive Closed-Form Optimization of Spectral Audio Power Allocation for Near End Listening Enhancement , 2010, Sprachkommunikation.