Librispeech: An ASR corpus based on public domain audio books
暂无分享,去创建一个
Sanjeev Khudanpur | Daniel Povey | Guoguo Chen | Vassil Panayotov | S. Khudanpur | Daniel Povey | Vassil Panayotov | Guoguo Chen
[1] Vladimir I. Levenshtein,et al. Binary codes capable of correcting deletions, insertions, and reversals , 1965 .
[2] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .
[3] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.
[4] Ian H. Witten,et al. The zero-frequency problem: Estimating the probabilities of novel events in adaptive text compression , 1991, IEEE Trans. Inf. Theory.
[5] Janet M. Baker,et al. The Design for the Wall Street Journal-based CSR Corpus , 1992, HLT.
[6] Hermann Ney,et al. Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[7] Mark J. F. Gales,et al. Mean and variance adaptation within the MLLR framework , 1996, Comput. Speech Lang..
[8] Mark J. F. Gales,et al. Variance compensation within the MLLR framework , 1996 .
[9] Stanley F. Chen,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.
[10] Aravind K. Joshi,et al. Proceedings of the 34th annual meeting on Association for Computational Linguistics , 1996 .
[11] Richard M. Schwartz,et al. A compact model for speaker-adaptive training , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[12] Mark J. F. Gales,et al. Semi-tied covariance matrices for hidden Markov models , 1999, IEEE Trans. Speech Audio Process..
[13] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.
[14] Shankar Kumar,et al. Normalization of non-standard words , 2001, Comput. Speech Lang..
[15] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.
[16] Timothy J. Hazen. Automatic alignment and error correction of human generated transcripts for long speech recordings , 2006, INTERSPEECH.
[17] Hermann Ney,et al. Joint-sequence models for grapheme-to-phoneme conversion , 2008, Speech Commun..
[18] Brian Kingsbury,et al. Boosted MMI for model and feature-space discriminative training , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[19] Mark J. F. Gales,et al. Lightly supervised recognition for automatic alignment of large coherent speech recordings , 2010, INTERSPEECH.
[20] Kishore Prahallad,et al. Automatic Building of Synthetic Voices from Audio Books , 2010 .
[21] Sylvain Meignier,et al. LIUM SPKDIARIZATION: AN OPEN SOURCE TOOLKIT FOR DIARIZATION , 2010 .
[22] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .
[23] S. King,et al. The Blizzard Challenge 2012 , 2012 .
[24] Jordi Luque,et al. Audio-to-text alignment for speech recognition with very limited resources , 2014, INTERSPEECH.
[25] Xiaohui Zhang,et al. Improving deep neural network acoustic models using generalized maxout networks , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
[26] M. Picheny,et al. Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentences , 2017 .