Deep Neural Network-Based Generalized Sidelobe Canceller for Robust Multi-Channel Speech Recognition
暂无分享,去创建一个
[1] Richard M. Stern,et al. Likelihood-maximizing beamforming for robust hands-free speech recognition , 2004, IEEE Transactions on Speech and Audio Processing.
[2] John McDonough,et al. Distant Speech Recognition , 2009 .
[3] Liang Lu,et al. Deep beamforming networks for multi-channel speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] Reinhold Häb-Umbach,et al. Blind Acoustic Beamforming Based on Generalized Eigenvalue Decomposition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[5] Ehud Weinstein,et al. Signal enhancement using beamforming and nonstationarity with applications to speech , 2001, IEEE Trans. Signal Process..
[6] L. Griffiths,et al. An adaptive generalized sidelobe canceller with derivative constraints , 1986 .
[7] Jont B. Allen,et al. Image method for efficiently simulating small‐room acoustics , 1976 .
[8] H. Cox. Resolving power and sensitivity to mismatch of optimum array processors , 1973 .
[9] Reinhold Häb-Umbach,et al. Beamnet: End-to-end training of a beamformer-supported multi-channel ASR system , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[10] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[11] Nikko Strom,et al. Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[12] Aaas News,et al. Book Reviews , 1893, Buffalo Medical and Surgical Journal.
[13] G. Carter,et al. The generalized correlation method for estimation of time delay , 1976 .
[14] Nikko Strom,et al. Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Björn Schuller,et al. Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments , 2017 .
[16] Nobutaka Ito,et al. The Diverse Environments Multi-channel Acoustic Noise Database (DEMAND): A database of multichannel environmental noise recordings , 2013 .
[17] Shih-Chii Liu,et al. Multi-channel Attention for End-to-End Speech Recognition , 2018, INTERSPEECH.
[18] Steve Renals,et al. WSJCAMO: a British English speech corpus for large vocabulary continuous speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[19] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[20] Tara N. Sainath,et al. Factored spatial and spectral multichannel raw waveform CLDNNs , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Jacob Benesty,et al. On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction , 2010, IEEE Transactions on Audio, Speech, and Language Processing.
[22] Marc Moonen,et al. Superdirective Beamforming Robust Against Microphone Mismatch , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[23] Tara N. Sainath,et al. Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition , 2016, INTERSPEECH.
[24] Tara N. Sainath,et al. Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition , 2017, IEEE/ACM Transactions on Audio, Speech, and Language Processing.
[25] Ian Lane,et al. End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition , 2017, INTERSPEECH.
[26] M.L. Seltzer. Bridging the Gap: Towards a Unified Framework for Hands-Free Speech Recognition Using Microphone Arrays , 2008, 2008 Hands-Free Speech Communication and Microphone Arrays.
[27] Ian Lane,et al. Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition , 2016, INTERSPEECH.
[28] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .
[29] John R. Hershey,et al. Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming , 2017, IEEE Journal of Selected Topics in Signal Processing.
[30] J. Flanagan,et al. Computer‐steered microphone arrays for sound transduction in large rooms , 1985 .
[31] Changliang Li,et al. Direction-Aware Speaker Beam for Multi-Channel Speaker Extraction , 2019, INTERSPEECH.
[32] Kevin Barraclough,et al. I and i , 2001, BMJ : British Medical Journal.
[33] Tianqi Chen,et al. Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.
[34] Sharon Gannot,et al. Adaptive Beamforming and Postfiltering , 2008 .
[35] R. Maas,et al. A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research , 2016, EURASIP Journal on Advances in Signal Processing.
[36] P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .
[37] Ulpu Remes,et al. Techniques for Noise Robustness in Automatic Speech Recognition , 2012 .