Improved modeling and efficiency for automatic transcription of Broadcast News
暂无分享,去创建一个
Andreas Stolcke | Ananth Sankar | Fuliang Weng | Venkata Ramana Rao Gadde | A. Stolcke | V. R. Gadde | Ananth Sankar | F. Weng
[1] Hermann Ney,et al. A word graph algorithm for large vocabulary continuous speech recognition , 1994, Comput. Speech Lang..
[2] 守屋 悦朗,et al. J.E.Hopcroft, J.D. Ullman 著, "Introduction to Automata Theory, Languages, and Computation", Addison-Wesley, A5変形版, X+418, \6,670, 1979 , 1980 .
[3] R. Schwartz,et al. A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[4] Frank K. Soong,et al. Quantizing mixture-weights in a tied-mixture HMM , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[5] Vassilios Digalakis,et al. High-Accuracy Large-Vocabulary Speech Recognition Using Mixture Tying and Consistency Modeling , 1994, HLT.
[6] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[7] Jj Odell,et al. The Use of Context in Large Vocabulary Speech Recognition , 1995 .
[8] Chin-Hui Lee,et al. A maximum-likelihood approach to stochastic matching for robust speech recognition , 1996, IEEE Trans. Speech Audio Process..
[9] Thomas Hain,et al. The 1997 HTK broadcast news transcription system , 1998 .
[10] Reinhard Kneser,et al. Statistical language modeling using a variable context length , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[11] A. Sankar,et al. Stochastic matching for robust speech recognition , 1994, IEEE Signal Processing Letters.
[12] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[13] Robert M. Gray,et al. An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..
[14] Ananth Sankar,et al. Parameter tying and gaussian clustering for faster, better, and smaller speech recognition , 1999, EUROSPEECH.
[15] Ananth Sankar. Robust HMM estimation with Gaussian merging-splitting and tied-transform HMMs , 1998, ICSLP.
[16] Nikko Ström. Automatic Continuous Speech Recognition with Rapid Speaker Adaptation for Human/machine Interaction , 1997 .
[17] Vassilios Digalakis,et al. Genones: generalized mixture tying in continuous hidden Markov model-based speech recognizers , 1996, IEEE Trans. Speech Audio Process..
[18] Mehryar Mohri,et al. Weighted determinization and minimization for large vocabulary speech recognition , 1997, EUROSPEECH.
[19] Steve J. Young,et al. The use of state tying in continuous speech recognition , 1993, EUROSPEECH.
[20] Ronald Rosenfeld,et al. Scalable backoff language models , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[21] Andreas Stolcke,et al. Acoustic Modeling for the SRI Hub4 Partitioned Evaluation Continuous Speech Recognition System , 1997 .
[22] Mari Ostendorf,et al. Modeling long distance dependence in language: topic mixtures vs. dynamic cache models , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[23] Mitch Weintraub,et al. Large-vocabulary dictation using SRI's DECIPHER speech recognition system: progressive search techniques , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[24] Ananth Sankar. A new look at HMM parameter tying for large vocabulary speech recognition , 1998, ICSLP.
[25] Ronald Rosenfeld,et al. Using story topics for language model adaptation , 1997, EUROSPEECH.
[26] Vassilios Digalakis,et al. A comparative study of speaker adaptation techniques , 1995, EUROSPEECH.
[27] Allen Gersho,et al. Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.
[28] Andreas Stolcke,et al. Efficient lattice representation and generation , 1998, ICSLP.
[29] Ramana Rao,et al. SRI’s 1998 Broadcast News System – Toward Faster, Better, Smaller Speech Recognition , 1999 .
[30] Vassilios Digalakis,et al. Speaker adaptation using constrained estimation of Gaussian mixtures , 1995, IEEE Trans. Speech Audio Process..
[31] Anthony J. Robinson,et al. Language model adaptation using mixtures and an exponentially decaying cache , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[32] Vassilios Digalakis,et al. Speaker adaptation using combined transformation and Bayesian methods , 1996, IEEE Trans. Speech Audio Process..
[33] Jeffrey D. Ullman,et al. Introduction to Automata Theory, Languages and Computation , 1979 .
[34] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..
[35] Richard M. Schwartz,et al. The 1996 BBN BYBLOS HUB-4 Transcription System , 1996 .
[36] Steve J. Young,et al. Large vocabulary continuous speech recognition using HTK , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[37] B.-H. Juang,et al. Maximum-likelihood estimation for mixture multivariate stochastic observations of Markov chains , 1985, AT&T Technical Journal.
[38] P. Woodland,et al. Flexible speaker adaptation using maximum likelihood linear regression , 1995 .
[39] Larry P. Heck,et al. Acoustic clustering and adaptation for robust speech recognition , 1997, EUROSPEECH.
[40] Mari Ostendorf,et al. Modeling long distance dependence in language: topic mixtures versus dynamic cache models , 1996, IEEE Trans. Speech Audio Process..
[41] Steve Austin,et al. The forward-backward search algorithm , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[42] Mei-Yuh Hwang,et al. Predicting unseen triphones with senones , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[43] George R. Doddington. CSR Corpus Development , 1992, HLT.
[44] I. Good. THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .
[45] Jerome R. Bellegarda,et al. A latent semantic analysis framework for large-Span language modeling , 1997, EUROSPEECH.