Hybrid HMM/ANN Systems for Speech Recognition: Overview and New Research Directions
暂无分享,去创建一个
[1] George Zavaliagkos,et al. A hybrid segmental neural net/hidden Markov model system for continuous speech recognition , 1994, IEEE Trans. Speech Audio Process..
[2] Biing-Hwang Juang,et al. New discriminative training algorithms based on the generalized probabilistic descent method , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.
[3] Hermann Ney,et al. The use of a one-stage dynamic programming algorithm for connected word recognition , 1984 .
[4] Roger K. Moore,et al. Hidden Markov model decomposition of speech and noise , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[5] Steve Renals,et al. Efficient search using posterior phone probability estimates , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[6] Hervé Bourlard,et al. Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems , 1997, EUROSPEECH.
[7] Yochai Konig,et al. A neural network based, speaker independent, large vocabulary, continuous speech recognition system: the WERNICKE project , 1993, EUROSPEECH.
[8] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .
[9] Jeff A. Bilmes,et al. Stochastic perceptual speech models with durational dependence , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[10] Yoshua Bengio,et al. Global optimization of a neural network-hidden Markov model hybrid , 1992, IEEE Trans. Neural Networks.
[11] Alex Waibel,et al. Large vocabulary recognition using linked predictive neural networks , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[12] Yochai Konig,et al. REMAP: recursive estimation and maximization of a posteriori probabilities in connectionist speech recognition , 1994, EUROSPEECH.
[13] Hervé Bourlard,et al. Neural networks for statistical recognition of continuous speech , 1995, Proc. IEEE.
[14] Dieter Geller,et al. Improvements in connected digit recognition using linear discriminant analysis and mixture densities , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[15] Alex Waibel,et al. Connectionist speaker normalization and its applications to speech recognition , 1991, Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop.
[16] Steve Renals,et al. Recent improvements to the ABBOT large vocabulary CSR system , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.
[17] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[18] Yochai Konig,et al. Remap: recursive estimation and maximization of a posteriori probabilities in transition-based speech recognition , 1996 .
[19] Jayant M. Naik,et al. Connected digit recognition using connectionist probability estimators and mixture-Gaussian densities , 1994, ICSLP.
[20] Hervé Bourlard,et al. A mew ASR approach based on independent processing and recombination of partial frequency bands , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.
[21] George Zavaliagkos,et al. Improving State-of-the-Art Continuous Speech Recognition Systems Using the N-Best Paradigm with Neural Networks , 1992, HLT.
[22] Richard Lippmann,et al. Review of Neural Networks for Speech Recognition , 1989, Neural Computation.
[23] Yochai Konig,et al. REMAP: Recursive Estimation and Maximization of A Posteriori Probabilities - Application to Transition-Based Connectionist Speech Recognition , 1995, NIPS.
[24] Richard Lippmann,et al. Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.
[25] Hervé Bourlard,et al. Generalization and Parameter Estimation in Feedforward Netws: Some Experiments , 1989, NIPS.
[26] Esther Levin,et al. Word recognition using hidden control neural architecture , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[27] H.B.D. Sorensen,et al. A cepstral noise reduction multi-layer neural network , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.
[28] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.
[29] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .
[30] Herman J. M. Steeneken,et al. Multi-lingual assessment of speaker independent large vocabulary speech-recognition systems: THE SQALE-PROJECT , 1995, EUROSPEECH.
[31] Jont B. Allen,et al. How do humans process and recognize speech? , 1993, IEEE Trans. Speech Audio Process..
[32] Roger K. Moore,et al. Modelling asynchrony in speech using elementary single-signal decomposition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[33] A. Poritz,et al. On hidden Markov models in isolated word recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[34] R. Kompe,et al. Global optimization of a neural network-hidden Markov model hybrid , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.
[35] Teuvo Kohonen,et al. The 'neural' phonetic typewriter , 1988, Computer.
[36] Hervé Bourlard,et al. Connectionist probability estimators in HMM speech recognition , 1994, IEEE Trans. Speech Audio Process..
[37] H. Gish,et al. A probabilistic approach to the understanding and training of neural network classifiers , 1990, International Conference on Acoustics, Speech, and Signal Processing.
[38] A. B. Poritz,et al. Linear predictive hidden Markov models and the speech signal , 1982, ICASSP.
[39] John Scott Bridle,et al. Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.
[40] Nelson Morgan. Big dumb neural nets: a working brute force approach to speech recognition , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).
[41] Sadaoki Furui,et al. Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..
[42] Hervé Bourlard,et al. Using multiple time scales in a multi-stream speech recognition system , 1997, EUROSPEECH.