The DIRHA-ENGLISH corpus and related tasks for distant-speech recognition in domestic environments
暂无分享,去创建一个
Maurizio Omologo | Mirco Ravanelli | Roberto Gretter | Luca Cristoforetti | Marco Pellin | Alessandro Sosi | M. Ravanelli | M. Omologo | L. Cristoforetti | R. Gretter | Marco Pellin | Alessandro Sosi
[1] Jon Barker,et al. The third ‘CHiME’ speech separation and recognition challenge: Dataset, task and baselines , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
[2] Ning Ma,et al. The PASCAL CHiME speech separation and recognition challenge , 2013, Comput. Speech Lang..
[3] Walter Kellermann,et al. Beamforming for Speech and Audio Signals , 2008 .
[4] Maurizio Omologo,et al. On the selection of the impulse responses for distant-speech recognition based on contaminated speech training , 2014, INTERSPEECH.
[5] Alessio Brutti,et al. A speech event detection and localization task for multiroom environments , 2014, 2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA).
[6] Tomohiro Nakatani,et al. The reverb challenge: A common evaluation framework for dereverberation and recognition of reverberant speech , 2013, 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.
[7] Tatsuya Kawahara,et al. Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature , 2015, EURASIP J. Adv. Signal Process..
[8] Maurizio Omologo,et al. A multi-channel corpus for distant-speech interaction in presence of known interferences , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[9] Christophe Ris,et al. A corpus-based approach for robust ASR in reverberant environments , 2000, INTERSPEECH.
[10] Angelo Farina,et al. Simultaneous Measurement of Impulse Response and Distortion with a Swept-Sine Technique , 2000 .
[11] Lukás Burget,et al. Sequence-discriminative training of deep neural networks , 2013, INTERSPEECH.
[12] John McDonough,et al. Distant Speech Recognition , 2009 .
[13] Ramón Fernández Astudillo,et al. The DIRHA-GRID corpus: baseline and tools for multi-room distant speech recognition using distributed microphones , 2014, INTERSPEECH.
[14] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[15] Janet M. Baker,et al. The Design for the Wall Street Journal-based CSR Corpus , 1992, HLT.
[16] Martin Wolf,et al. Channel selection measures for multi-microphone speech recognition , 2014, Speech Commun..
[17] Roland Maas,et al. Spatial diffuseness features for DNN-based speech recognition in noisy and reverberant environments , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[18] Elmar Nöth,et al. Using Artificially Reverberated Training Data in Distant-Talking ASR , 2005, TSD.
[19] Thomas Hain,et al. Using neural network front-ends on far field multiple microphones based speech recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[20] Maurizio Omologo,et al. Automatic segmentation and labeling of speech based on Hidden Markov Models , 1993, Speech Commun..
[21] Yuuki Tachioka,et al. The MERL/MELCO/TUM system for the REVERB Challenge using Deep Recurrent Neural Network Feature Enhancement , 2014, ICASSP 2014.
[22] Jon Barker,et al. An audio-visual corpus for speech perception and automatic speech recognition. , 2006, The Journal of the Acoustical Society of America.
[23] Steve Renals,et al. Hybrid acoustic models for distant and multichannel large vocabulary speech recognition , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.
[24] Jonathan G. Fiscus,et al. Darpa Timit Acoustic-Phonetic Continuous Speech Corpus CD-ROM {TIMIT} | NIST , 1993 .
[25] Hsiao-Wuen Hon,et al. Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..
[26] Maurizio Omologo,et al. Hidden Markov model training with contaminated speech material for distant-talking speech recognition , 2002, Comput. Speech Lang..
[27] Petros Maragos,et al. The DIRHA simulated corpus , 2014, LREC.
[28] Maurizio Omologo,et al. Impulse response estimation for robust speech recognition in a reverberant environment , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).
[29] Gerhard Schmidt,et al. Speech and Audio Processing in Adverse Environments , 2008 .
[30] Maurizio Omologo,et al. Contaminated speech training methods for robust DNN-HMM distant speech recognition , 2017, INTERSPEECH.
[31] Dong Yu,et al. Automatic Speech Recognition: A Deep Learning Approach , 2014 .