ASR Performance Prediction on Unseen Broadcast Programs Using Convolutional Neural Networks
暂无分享,去创建一个
Benjamin Lecouteux | Olivier Galibert | Juliette Kahn | Laurent Besacier | Zied Elloumi | Juliette Kahn | Olivier Galibert | L. Besacier | Z. Elloumi | B. Lecouteux
[1] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.
[2] Georges Linarès,et al. Combined low level and high level features for out-of-vocabulary word detection , 2009, INTERSPEECH.
[3] Guillaume Gravier,et al. The ESTER phase II evaluation campaign for the rich transcription of French broadcast news , 2005, INTERSPEECH.
[4] Sheryl R. Young,et al. Recognition Confidence Measures: Detection of Misrecognitions and Out- Of-Vocabulary Words , 1994 .
[5] Olivier Galibert,et al. A presentation of the REPERE challenge , 2012, 2012 10th International Workshop on Content-Based Multimedia Indexing (CBMI).
[6] José Guilherme Camargo de Souza,et al. Quality Estimation for Automatic Speech Recognition , 2014, COLING.
[7] Colin Raffel,et al. librosa: Audio and Music Signal Analysis in Python , 2015, SciPy.
[8] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.
[9] José Guilherme Camargo de Souza,et al. TranscRater: a Tool for Automatic Speech Recognition Quality Estimation , 2016, ACL.
[10] Wei Dai,et al. Very deep convolutional neural networks for raw waveforms , 2016, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[11] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.
[12] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[13] Richard M. Schwartz,et al. Automatic Detection Of New Words In A Large Vocabulary Continuous Speech Recognition System , 1989, HLT.
[14] Dimitri Palaz,et al. Convolutional Neural Networks-based continuous speech recognition using raw speech signal , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[15] Tara N. Sainath,et al. Learning the speech front-end with raw waveform CLDNNs , 2015, INTERSPEECH.
[16] Karol J. Piczak. Environmental sound classification with convolutional neural networks , 2015, 2015 IEEE 25th International Workshop on Machine Learning for Signal Processing (MLSP).
[17] Li-Rong Dai,et al. LID-senone Extraction via Deep Neural Networks for End-to-End Language Identification , 2016, Odyssey.
[18] Björn Schuller,et al. Opensmile: the munich versatile and fast open-source audio feature extractor , 2010, ACM Multimedia.
[19] Olivier Galibert,et al. Methodologies for the evaluation of speaker diarization and automatic speech recognition in the presence of overlapping speech , 2013, INTERSPEECH.
[20] Guy Perennou,et al. BDLEX: a lexicon for spoken and written french , 1998, LREC.
[21] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.
[22] Olivier Galibert,et al. The ETAPE corpus for the evaluation of speech-based TV content processing in the French language , 2012, LREC.
[23] Yonatan Belinkov,et al. Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems , 2017, NIPS.
[24] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..