Use of a Deep Recurrent Neural Network to Reduce Wind Noise: Effects on Judged Speech Intelligibility and Sound Quality

Despite great advances in hearing-aid technology, users still experience problems with noise in windy environments. The potential benefits of using a deep recurrent neural network (RNN) for reducing wind noise were assessed. The RNN was trained using recordings of the output of the two microphones of a behind-the-ear hearing aid in response to male and female speech at various azimuths in the presence of noise produced by wind from various azimuths with a velocity of 3 m/s, using the “clean” speech as a reference. A paired-comparison procedure was used to compare all possible combinations of three conditions for subjective intelligibility and for sound quality or comfort. The conditions were unprocessed noisy speech, noisy speech processed using the RNN, and noisy speech that was high-pass filtered (which also reduced wind noise). Eighteen native English-speaking participants were tested, nine with normal hearing and nine with mild-to-moderate hearing impairment. Frequency-dependent linear amplification was provided for the latter. Processing using the RNN was significantly preferred over no processing by both subject groups for both subjective intelligibility and sound quality, although the magnitude of the preferences was small. High-pass filtering (HPF) was not significantly preferred over no processing. Although RNN was significantly preferred over HPF only for sound quality for the hearing-impaired participants, for the results as a whole, there was a preference for RNN over HPF. Overall, the results suggest that reduction of wind noise using an RNN is possible and might have beneficial effects when used in hearing aids.

[1]  B. Moore,et al.  Perceived naturalness of spectrally distorted speech and music. , 2003, The Journal of the Acoustical Society of America.

[2]  DeLiang Wang,et al.  Features for Masking-Based Monaural Speech Separation in Reverberant Conditions , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[3]  Yang Lu,et al.  An algorithm that improves speech intelligibility in noise for normal-hearing listeners. , 2009, The Journal of the Acoustical Society of America.

[4]  Francis Kuk,et al.  Evaluation of a Wind Noise Attenuation Algorithm on Subjective Annoyance and Speech‐in‐Wind Performance , 2017, Journal of the American Academy of Audiology.

[5]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[6]  Björn W. Schuller,et al.  Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR , 2015, LVA/ICA.

[7]  R. Patterson,et al.  Time-domain modeling of peripheral auditory processing: a modular architecture and a software platform. , 1995, The Journal of the Acoustical Society of America.

[8]  Sergei Kochkin,et al.  MarkeTrak VIII: Consumer satisfaction with hearing aids is slowly increasing , 2010 .

[9]  Yuan Tang,et al.  TF.Learn: TensorFlow's High-level Module for Distributed Machine Learning , 2016, ArXiv.

[10]  R. Raspet,et al.  Framework for wind noise studies , 2006 .

[11]  Dong Yu,et al.  Multitalker Speech Separation With Utterance-Level Permutation Invariant Training of Deep Recurrent Neural Networks , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[12]  DeLiang Wang,et al.  Long short-term memory for speaker generalization in supervised speech separation. , 2017, The Journal of the Acoustical Society of America.

[13]  Justin A. Zakis Wind noise at microphones within and across hearing aids at wind speeds below and above microphone saturation. , 2011, The Journal of the Acoustical Society of America.

[14]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[15]  Elias Nemer,et al.  Single-microphone wind noise reduction by adaptive postfiltering , 2009, 2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[16]  DeLiang Wang,et al.  An algorithm to increase speech intelligibility for hearing-impaired listeners in novel segments of the same noise type. , 2015, The Journal of the Acoustical Society of America.

[17]  Jörg Wuttke Microphones and Wind , 1992 .

[18]  Paris Smaragdis,et al.  Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[19]  Jessica J. M. Monaghan,et al.  Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users , 2017, Hearing Research.

[20]  Brian R Glasberg,et al.  Derivation of auditory filter shapes from notched-noise data , 1990, Hearing Research.

[21]  Christine M. Tan,et al.  Robust wind noise detection , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Birger Kollmeier,et al.  SNR estimation based on amplitude modulation analysis with applications to noise suppression , 2003, IEEE Trans. Speech Audio Process..

[23]  Andrew W. Senior,et al.  Long short-term memory recurrent neural network architectures for large scale acoustic modeling , 2014, INTERSPEECH.

[24]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[25]  M A Stone,et al.  Tolerable hearing aid delays. I. Estimation of limits imposed by the auditory path alone using simulated hearing losses. , 1999, Ear and hearing.

[26]  B. Moore,et al.  Tolerable Hearing-Aid Delays: IV. Effects on Subjective Disturbance During Speech Production by Hearing-Impaired Subjects , 2005, Ear and hearing.

[27]  J. Larsen,et al.  Wind Noise Reduction using Non-Negative Sparse Coding , 2007, 2007 IEEE Workshop on Machine Learning for Signal Processing.

[28]  R. M. Sachs,et al.  Anthropometric manikin for acoustic research. , 1975, The Journal of the Acoustical Society of America.

[29]  DeLiang Wang,et al.  Large-scale training to increase speech intelligibility for hearing-impaired listeners in novel noises. , 2016, The Journal of the Acoustical Society of America.

[30]  Xin Yang,et al.  Auditory inspired machine learning techniques can improve speech intelligibility and quality for hearing-impaired listeners. , 2017, The Journal of the Acoustical Society of America.

[31]  Brian C. J. Moore,et al.  Hearing Aid Signal Processing , 2016 .

[32]  Zachary Chase Lipton A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[33]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[34]  Brian C J Moore,et al.  Comparison of the CAM2 and NAL-NL2 Hearing Aid Fitting Methods , 2013, Ear and hearing.

[35]  Gitte Keidser,et al.  The National Acoustic Laboratories (NAL) CDs of Speech and Noise for Hearing Aid Evaluation: Normative Data and Potential Applications , 2002 .

[36]  B C Moore,et al.  Use of a loudness model for hearing-aid fitting. I. Linear hearing aids. , 1998, British journal of audiology.

[37]  DeLiang Wang,et al.  An algorithm to improve speech recognition in noise for hearing-impaired listeners. , 2013, The Journal of the Acoustical Society of America.

[38]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[39]  King Chung,et al.  Wind noise in hearing aids with directional and omnidirectional microphones: polar characteristics of behind-the-ear hearing aids. , 2009, The Journal of the Acoustical Society of America.

[40]  James M. Kates,et al.  Digital hearing aids. , 2008, Harvard health letter.

[41]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[42]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  DeLiang Wang,et al.  An algorithm to increase intelligibility for hearing-impaired listeners in the presence of a competing talker. , 2017, The Journal of the Acoustical Society of America.