Automatic Speech Recognition System Development in the "Wild"
暂无分享,去创建一个
[1] Sanjeev Khudanpur,et al. A pitch extraction algorithm tuned for automatic speech recognition , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[2] Mark J. F. Gales,et al. Progress in the CU-HTK broadcast news transcription system , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[3] Richard M. Schwartz,et al. Enhancing low resource keyword spotting with automatically retrieved web documents , 2015, INTERSPEECH.
[4] H. Ney,et al. INTERDEPENDENCE OF LANGUAGE MODELS AND DISCRIMINATIVE TRAINING , 2007 .
[5] Geoffrey Zweig,et al. The microsoft 2016 conversational speech recognition system , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[6] Mark J. F. Gales,et al. Improving speech recognition and keyword search for low resource languages using web data , 2015, INTERSPEECH.
[7] Marcello Federico,et al. Language Model Adaptation , 1999 .
[8] Hui Jiang,et al. Confidence measures for speech recognition: A survey , 2005, Speech Commun..
[9] Mark J. F. Gales,et al. Speech recognition and keyword spotting for low-resource languages: Babel project research at CUED , 2014, SLTU.
[10] Philip C. Woodland,et al. Detecting deletions in ASR output , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Daniel Povey,et al. Large scale discriminative training of hidden Markov models for speech recognition , 2002, Comput. Speech Lang..
[12] Yiming Wang,et al. Low Latency Acoustic Modeling Using Temporal Convolution and LSTMs , 2018, IEEE Signal Processing Letters.
[13] Brian Roark,et al. Unsupervised language model adaptation , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..
[14] Jean-Luc Gauvain,et al. Lightly supervised and unsupervised acoustic model training , 2002, Comput. Speech Lang..
[15] Sherif Abdou,et al. Recent progress in Arabic broadcast news transcription at BBN , 2005, INTERSPEECH.
[16] Dong Yu,et al. Large vocabulary continuous speech recognition with context-dependent DBN-HMMS , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[17] Mitch Weintraub,et al. Neural-network based measures of confidence for word recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[18] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[19] Mark J. F. Gales,et al. Low-Resource Speech Recognition and Keyword-Spotting , 2017, SPECOM.
[20] George Saon,et al. The IBM 2016 English Conversational Telephone Speech Recognition System , 2016, INTERSPEECH.
[21] Gunnar Evermann,et al. Posterior probability decoding, confidence estimation and system combination , 2000 .
[22] Alex Acero,et al. Estimating speech recognition error rate without acoustic test data , 2003, INTERSPEECH.
[23] Hermann Ney,et al. Joint-sequence models for grapheme-to-phoneme conversion , 2008, Speech Commun..
[24] Alexander Gruenstein,et al. Unsupervised Testing Strategies for ASR , 2011, INTERSPEECH.
[25] Tara N. Sainath,et al. Convolutional, Long Short-Term Memory, fully connected Deep Neural Networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] Sanjeev Khudanpur,et al. Audio augmentation for speech recognition , 2015, INTERSPEECH.
[27] Yu Wang,et al. PHONETIC AND GRAPHEMIC SYSTEMS FOR MULTI-GENRE BROADCAST TRANSCRIPTION , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).