Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition
暂无分享,去创建一个
Nikko Strom | Björn Hoffmeister | Shiva Sundaram | Ken'ichi Kumatani | Minhua Wu | N. Strom | K. Kumatani | Björn Hoffmeister | Shiva Sundaram | Minhua Wu
[1] Maurizio Omologo,et al. Cepstral distance based channel selection for distant speech recognition , 2018, Comput. Speech Lang..
[2] Bhiksha Raj,et al. Non-negative matrix factorization based compensation of music for automatic speech recognition , 2010, INTERSPEECH.
[3] Tara N. Sainath,et al. Performance of Mask Based Statistical Beamforming in a Smart Home Scenario , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[4] K. Kumatani,et al. ON HIDDEN MARKOV MODEL MAXIMUM NEGENTROPY BEAMFORMING , 2008 .
[5] Marc Moonen,et al. Superdirective Beamforming Robust Against Microphone Mismatch , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[6] Ivan Himawan,et al. Microphone Array Shape Calibration in Diffuse Noise Fields , 2008, IEEE Transactions on Audio, Speech, and Language Processing.
[7] Tomohiro Nakatani,et al. Frame-by-Frame Closed-Form Update for Mask-Based Adaptive MVDR Beamforming , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[8] Nikko Strom,et al. Direct modeling of raw audio with DNNS for wake word detection , 2017, 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
[9] Sridha Sridharan,et al. Clustered Blind Beamforming From Ad-Hoc Microphone Arrays , 2011, IEEE Transactions on Audio, Speech, and Language Processing.
[10] Ulpu Remes,et al. Techniques for Noise Robustness in Automatic Speech Recognition , 2012 .
[11] Eap Emanuël Habets. Single- and multi-microphone speech dereverberation using spectral enhancement , 2007 .
[12] Sree Hari Krishnan Parthasarathi,et al. Robust Speech Recognition via Anchor Word Representations , 2017, INTERSPEECH.
[13] Martin Wolf,et al. Channel selection measures for multi-microphone speech recognition , 2014, Speech Commun..
[14] Ivan Tashev,et al. Sound Capture and Processing: Practical Approaches , 2009 .
[15] R. Maas,et al. A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research , 2016, EURASIP Journal on Advances in Signal Processing.
[16] Sree Hari Krishnan Parthasarathi,et al. Lessons from Building Acoustic Models with a Million Hours of Speech , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Richard M. Stern,et al. Likelihood-maximizing beamforming for robust hands-free speech recognition , 2004, IEEE Transactions on Speech and Audio Processing.
[18] Bhiksha Raj,et al. Microphone array processing for distant speech recognition: Towards real-world deployment , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.
[19] Maurizio Omologo,et al. Speech Recognition with Microphone Arrays , 2001, Microphone Arrays.
[20] Nikko Strom,et al. Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[21] Nikko Strom,et al. Scalable distributed DNN training using commodity GPU cloud computing , 2015, INTERSPEECH.
[22] S. Haykin,et al. Adaptive Filter Theory , 1986 .
[23] Shih-Chii Liu,et al. Multi-channel Attention for End-to-End Speech Recognition , 2018, INTERSPEECH.
[24] Jill Fain Lehman,et al. Channel selection based on multichannel cross-correlation coefficients for distant speech recognition , 2011, 2011 Joint Workshop on Hands-free Speech Communication and Microphone Arrays.
[25] Shrikanth S. Narayanan,et al. An Overview on Perceptually Motivated Audio Indexing and Classification , 2013, Proceedings of the IEEE.
[26] John R. Hershey,et al. Multichannel End-to-end Speech Recognition , 2017, ICML.
[27] Steve Renals,et al. Convolutional Neural Networks for Distant Speech Recognition , 2014, IEEE Signal Processing Letters.
[28] John McDonough,et al. Distant Speech Recognition , 2009 .
[29] Liang Lu,et al. Deep beamforming networks for multi-channel speech recognition , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[30] Sridha Sridharan,et al. Dealing with uncertainty in microphone placement in a microphone array speech recognition system , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[31] M.L. Seltzer. Bridging the Gap: Towards a Unified Framework for Hands-Free Speech Recognition Using Microphone Arrays , 2008, 2008 Hands-Free Speech Communication and Microphone Arrays.
[32] Tara N. Sainath,et al. Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[33] Sree Hari Krishnan Parthasarathi,et al. Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[34] M. Wolfel,et al. Distant Speech Recognition: Bridging the Gaps , 2008, 2008 Hands-Free Speech Communication and Microphone Arrays.
[35] Ian Lane,et al. Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition , 2016, INTERSPEECH.
[36] James L. Flanagan,et al. Robust distant-talking speech recognition , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.