EXTRACTION USING MULTIMODAL CONVOLUTIONAL NEURAL NETWORKS FOR VISUAL SPEECH RECOGNITION
暂无分享,去创建一个
[1] Fei-Fei Li,et al. Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.
[2] Jeff A. Bilmes,et al. DBN based multi-stream models for audio-visual speech recognition , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[3] Gérard Chollet,et al. Acquisition of Ultrasound, Video and Acoustic Speech Data for a Silent-Speech Interface Application , 2008 .
[4] Navdeep Jaitly,et al. Towards End-To-End Speech Recognition with Recurrent Neural Networks , 2014, ICML.
[5] Thomas Hueber,et al. Statistical conversion of silent articulation into audible speech using full-covariance HMM , 2016, Comput. Speech Lang..
[6] Xiaolin Hu,et al. Recurrent convolutional neural network for object recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[7] Juergen Luettin,et al. Visual speech recognition using active shape models and hidden Markov models , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[8] Barry-John Theobald,et al. Recent developments in automated lip-reading , 2013, Optics/Photonics in Security and Defence.
[9] Yochai Konig,et al. "Eigenlips" for robust speech recognition , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.
[10] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.
[11] Chalapathy Neti,et al. Asynchrony modeling for audio-visual speech recognition , 2002 .
[12] Andrew Zisserman,et al. Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.
[13] Martin Heckmann,et al. DCT-based video features for audio-visual speech recognition , 2002, INTERSPEECH.
[14] Lawrence D. Jackel,et al. Handwritten Digit Recognition with a Back-Propagation Network , 1989, NIPS.
[15] G. Chollet,et al. Evaluating speech recognizers and data bases , 1988 .
[16] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.
[17] Tetsuya Ogata,et al. Lipreading using convolutional neural network , 2014, INTERSPEECH.
[18] Christian Wolf,et al. Spatio-Temporal Convolutional Sparse Auto-Encoder for Sequence Classification , 2012, BMVC.
[19] Gérard Chollet,et al. Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips , 2010, Speech Commun..
[20] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..
[21] J. M. Gilbert,et al. Silent speech interfaces , 2010, Speech Commun..
[22] Mahesh Chandra,et al. Multiple cameras audio visual speech recognition using active appearance model visual features in car environment , 2016, Int. J. Speech Technol..