04-58 TOWARDS USING HIERARCHICAL POSTERIORS FOR FLEXIBLE AUTOMATIC SPEECH RECOGNITION SYSTEMS
暂无分享,去创建一个
Samy Bengio | Bertrand Mesot | Hervé Bourlard | Nelson Morgan | Qifeng Zhu | M. M. Doss | Mathew Magimai Doss | Samy Bengio | H. Bourlard | N. Morgan | Q. Zhu | Bertrand Mesot
[1] L. Baum,et al. An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .
[2] Andreas Stolcke,et al. THE SRI MARCH 2000 HUB-5 CONVERSATIONAL SPEECH TRANSCRIPTION SYSTEM , 2000 .
[3] Daniel P. W. Ellis,et al. Error visualization for tandem acoustic modeling on the Aurora task , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[4] Steve Renals,et al. Confidence measures from local posterior probability estimates , 1999, Comput. Speech Lang..
[5] N. Morgan,et al. A training algorithm for statistical sequence recognition with applications to transition-based speech recognition , 1996, IEEE Signal Processing Letters.
[6] Keinosuke Fukunaga,et al. Statistical Pattern Recognition , 1993, Handbook of Pattern Recognition and Computer Vision.
[7] Andreas Stolcke,et al. Finding consensus in speech recognition: word error minimization and other applications of confusion networks , 2000, Comput. Speech Lang..
[8] Hervé Bourlard,et al. Connectionist Speech Recognition: A Hybrid Approach , 1993 .
[9] Hynek Hermansky,et al. Hierarchical tandem feature extraction , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[10] Pierre A. Devijver,et al. Baum's forward-backward algorithm revisited , 1985, Pattern Recognit. Lett..
[11] R. Cole,et al. TELEPHONE SPEECH CORPUS DEVELOPMENT AT CSLU , 1998 .
[12] Hynek Hermansky,et al. TRAPS - classifiers of temporal patterns , 1998, ICSLP.
[13] M. Hunt. A statistical approach to metrics for word and syllable recognition , 1979 .
[14] Word-Level Confidence Estimation for Automatic Speech Recognition , 2002 .
[15] Samy Bengio,et al. Modeling Individual and Group Actions in Meetings: A Two-Layer HMM Framework , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.
[16] Eric Horvitz,et al. Layered representations for learning and inferring office activity from multiple sensory channels , 2004, Comput. Vis. Image Underst..
[17] Nikki Mirghafori,et al. Combining connectionist multi-band and full-band probability streams for speech recognition of natural numbers , 1998, ICSLP.
[18] Hervé Bourlard,et al. Estimation of global posteriors and forward-backward training of hybrid HMM/ANN systems , 1997, EUROSPEECH.
[19] H Hermansky,et al. Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.
[20] Michael S. Lewicki,et al. Efficient coding of natural sounds , 2002, Nature Neuroscience.
[21] Hervé Bourlard,et al. Confidence Measures in Hybrid HMM/ANN Speech Recognition , 1998 .
[22] Hynek Hermansky,et al. Qualcomm-ICSI-OGI features for ASR , 2002, INTERSPEECH.
[23] Andreas Stolcke,et al. On using MLP features in LVCSR , 2004, INTERSPEECH.
[24] D. Ellis,et al. CONNECTIONIST FEATURE EXTRACTION FOR CONVENTIONAL HMM SYSTEMS , 1999 .
[25] Hynek Hermansky,et al. Entropy based combination of tandem representations for noise robust ASR , 2004, INTERSPEECH.
[26] Nelson Morgan,et al. Learning long-term temporal features in LVCSR using neural networks , 2004, INTERSPEECH.