Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
暂无分享,去创建一个
Tara N. Sainath | D. Nahamoo | T. N. Sainath | B. Ramabhadran | M. Picheny | D. Kanevsky | M. Picheny | D. Nahamoo | T. Sainath | M. Picheny | D. Kanevsky | B. Ramabhadran
[1] Ronald A. Cole,et al. Experiments on spectrogram reading , 1979, ICASSP.
[2] Patti Price,et al. The DARPA 1000-word resource management database for continuous speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.
[3] Hsiao-Wuen Hon,et al. Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..
[4] Stephen Cox,et al. Some statistical issues in the comparison of speech recognition algorithms , 1989, International Conference on Acoustics, Speech, and Signal Processing,.
[5] Steve Young,et al. The general use of tying in phoneme-based HMM speech recognisers , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[6] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[7] G. Strang. Introduction to Linear Algebra , 1993 .
[8] Jean-Luc Gauvain,et al. High performance speaker-independent phone recognition using CDHMM , 1993, EUROSPEECH.
[9] Steve J. Young,et al. MMI training for continuous phoneme recognition on the TIMIT database , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[10] Anthony J. Robinson,et al. An application of recurrent nets to phone probability estimation , 1994, IEEE Trans. Neural Networks.
[11] James R. Glass,et al. A probabilistic framework for feature-based speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[12] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[13] James R. Glass,et al. Heterogeneous measurements and multiple classifiers for speech recognition , 1998, ICSLP.
[14] Mark J. F. Gales,et al. Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..
[15] Francis Jack Smith,et al. Improved phone recognition using Bayesian triphone models , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[16] Michael Picheny,et al. Recent advances in speech recognition system for IBM DARPA communicator , 2001, INTERSPEECH.
[17] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..
[18] Herbert Gish,et al. The 2001 BYBLOS English large vocabulary conversational speech recognition system , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[19] Geoffrey Zweig,et al. An architecture for rapid decoding of large vocabulary conversational speech , 2003, INTERSPEECH.
[20] Daniel Povey. Phone duration modeling for LVCSR , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[21] Joseph Picone,et al. Applications of support vector machines to speech recognition , 2004, IEEE Transactions on Signal Processing.
[22] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .
[23] Geoffrey Zweig,et al. The IBM 2004 conversational telephony system for rich transcription , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..
[24] Georg Heigold,et al. Speech recognition with state-based nearest neighbour classifiers , 2007, INTERSPEECH.
[25] Li Deng,et al. Phone-discriminating minimum classification error (p-MCE) training for phonetic recognition , 2007, INTERSPEECH.
[26] Patrick Wambacq,et al. Template-Based Continuous Speech Recognition , 2007, IEEE Transactions on Audio, Speech, and Language Processing.
[27] Tara N. Sainath,et al. Audio classification using extended baum-welch transformations , 2007, INTERSPEECH.
[28] Dong Yu,et al. Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[29] Geoffrey Zweig,et al. The IBM 2006 Gale Arabic ASR System , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[30] Lawrence K. Saul,et al. Comparison of Large Margin Training to Other Discriminative Methods for Phonetic Recognition by Hidden Markov Models , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.
[31] Brian Kingsbury,et al. Boosted MMI for model and feature-space discriminative training , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.
[32] Brian Kingsbury,et al. Lattice-based optimization of sequence classification criteria for neural-network acoustic modeling , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.
[33] D. Kanevsky,et al. ABCS : Approximate Bayesian Compressed Sensing , 2009 .
[34] Wei Wu,et al. Development of the 2008 SRI Mandarin speech-to-text system for broadcast news and conversation , 2009, INTERSPEECH.
[35] Tara N. Sainath,et al. An exploration of large vocabulary tools for small vocabulary phonetic recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[36] Geoffrey Zweig,et al. A segmental CRF approach to large vocabulary continuous speech recognition , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.
[37] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[38] Tara N. Sainath,et al. Sparse representations for text categorization , 2010, INTERSPEECH.
[39] Tuomas Virtanen,et al. Noise robust exemplar-based connected digit recognition , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[40] Hynek Hermansky,et al. Sparse auto-associative neural networks: theory and application to speech recognition , 2010, INTERSPEECH.
[41] Mark J. F. Gales,et al. Recent improvements to the Cambridge Arabic Speech-to-Text systems , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[42] Tara N. Sainath,et al. An analysis of sparseness and regularization in exemplar-based methods for speech classification , 2010, INTERSPEECH.
[43] Tara N. Sainath,et al. Bayesian compressive sensing for phonetic classification , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.
[44] Geoffrey E. Hinton,et al. Phone Recognition with the Mean-Covariance Restricted Boltzmann Machine , 2010, NIPS.
[45] Ulpu Remes,et al. Observation uncertainty measures for sparse imputation , 2010, INTERSPEECH.
[46] Brian Kingsbury,et al. The IBM Attila speech recognition toolkit , 2010, 2010 IEEE Spoken Language Technology Workshop.
[47] Tara N. Sainath,et al. Reducing Computational Complexities of Exemplar-Based Sparse Representations with Applications to Large Vocabulary Speech Recognition , 2011, INTERSPEECH.