Hierarchical search for large-vocabulary conversational speech recognition: working toward a solution to the decoding problem
暂无分享,去创建一个
[1] B. Atal. Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. , 1974, The Journal of the Acoustical Society of America.
[2] John Makhoul,et al. BYBLOS: The BBN continuous speech recognition system , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Lalit R. Bahl,et al. A tree-based statistical language model for natural language speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..
[4] Yu-Hung Kao,et al. Efficient search strategies in hierarchical pattern recognition systems , 1995, Proceedings of the Twenty-Seventh Southeastern Symposium on System Theory.
[5] Andrej Ljolje,et al. Full expansion of context-dependent networks in large vocabulary speech recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[6] Vaibhava Goel,et al. Syllable-a promising recognition unit for LVCSR , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.
[7] S. Furui,et al. Cepstral analysis technique for automatic speaker verification , 1981 .
[8] Douglas B. Paul,et al. The Lincoln large-vocabulary stack-decoder HMM CSR , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[9] Mari Ostendorf,et al. A Bayesian approach to speaker adaptation for the stochastic segment model , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[10] Raj Reddy,et al. Large-vocabulary speaker-independent continuous speech recognition: the sphinx system , 1988 .
[11] Anthony J. Robinson,et al. An application of recurrent nets to phone probability estimation , 1994, IEEE Trans. Neural Networks.
[12] Renato De Mori,et al. A Cache-Based Natural Language Model for Speech Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..
[13] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[14] Vassilios Digalakis,et al. Techniques to Achieve an Accurate Real-Time Large-Vocabulary Speech Recognition System , 1994, HLT.
[15] Michael Picheny,et al. Context Dependent Modeling of Phones in Continuous Speech Using Decision Trees , 1991, HLT.
[16] R. Schwartz,et al. A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[17] Hermann Ney,et al. Dynamic programming parsing for context-free grammars in continuous speech recognition , 1991, IEEE Trans. Signal Process..
[18] Lalit R. Bahl,et al. A tree search strategy for large-vocabulary continuous speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[19] John J. Godfrey,et al. SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[20] Mari Ostendorf,et al. Fast algorithms for phone classification and recognition using segment-based models , 1992, IEEE Trans. Signal Process..
[21] Stan Davis,et al. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Se , 1980 .
[22] Hermann Ney,et al. Dynamic programming search for continuous speech recognition , 1999, IEEE Signal Process. Mag..
[23] Steve J. Young,et al. A One Pass Decoder Design For Large Vocabulary Recognition , 1994, HLT.
[24] Andreas Noll,et al. A data-driven organization of the dynamic programming beam search for continuous speech recognition , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[25] Michael Riley,et al. The Watson speech recognition engine , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[26] Vassilios Digalakis,et al. Genones: generalized mixture tying in continuous hidden Markov model-based speech recognizers , 1996, IEEE Trans. Speech Audio Process..
[27] Ronald Rosenfeld,et al. Trigger-based language models: a maximum entropy approach , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[28] Richard M. Schwartz,et al. Search Algorithms for Software-Only Real-Time Recognition with Very Large Vocabularies , 1993, HLT.
[29] John Lafferty,et al. Grammatical Trigrams: A Probabilistic Model of Link Grammar , 1992 .
[30] Hermann Ney,et al. Data driven search organization for continuous speech recognition , 1992, IEEE Trans. Signal Process..
[31] Steve J. Young,et al. State clustering in hidden Markov model-based continuous speech recognition , 1994, Comput. Speech Lang..
[32] Frank K. Soong,et al. An N-best candidates-based discriminative training for speech recognition applications , 1994, IEEE Trans. Speech Audio Process..
[33] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.
[34] Kai-Fu Lee,et al. On large-vocabulary speaker-independent continuous speech recognition , 1988, Speech Commun..
[35] Richard M. Schwartz,et al. The N-Best Algorithm: Efficient Procedure for Finding Top N Sentence Hypotheses , 1989, HLT.
[36] Michèle Jardino. Multilingual stochastic n-gram class language models , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[37] Mei-Yuh Hwang,et al. A comparative study of discrete, semicontinuous, and continuous hidden Markov models , 1993, Comput. Speech Lang..
[38] George Zavaliagkos,et al. Is N-Best Dead? , 1994, HLT.
[39] Frank K. Soong,et al. A Tree.Trellis Based Fast Search for Finding the N Best Sentence Hypotheses in Continuous Speech Recognition , 1990, HLT.
[40] Joseph Picone,et al. Signal modeling techniques in speech recognition , 1993, Proc. IEEE.
[41] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .
[42] Hitoshi Iida,et al. A Japanese-to-English speech translation system: ATR-MATRIX , 1998, ICSLP.
[43] Hermann Ney,et al. On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..
[44] Michael Picheny,et al. Large vocabulary natural language continuous speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.
[45] Douglas B. Paul. The Lincoln Large-Vocabulary Stack-Decoder Based HMM CSR , 1994, HLT.
[46] J. Picone,et al. Continuous speech recognition using hidden Markov models , 1990, IEEE ASSP Magazine.
[47] Julian Kupiec,et al. Probabilistic Models of Short and Long Distance Word Dependencies in Running Text , 1989, HLT.
[48] Frank Fallside,et al. A recurrent error propagation network speech recognition system , 1991 .
[49] Frederick Jelinek,et al. Up from trigrams! - the struggle for improved language models , 1991, EUROSPEECH.
[50] Gérard Chollet,et al. On the evaluation of speech recognizers and data bases using a reference system , 1982, ICASSP.
[51] Mitch Weintraub,et al. Progressive-Search Algorithms For Large-Vocabulary Speech Recognition , 1993, HLT.