Multi Channel Sequence Processing
暂无分享,去创建一个
[1] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[2] Steve Renals,et al. Indexing and retrieval of broadcast news , 2000, Speech Commun..
[3] Stefan Eickeler,et al. Content-based video indexing of TV broadcast news using hidden Markov models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[4] Juergen Luettin,et al. Audio-Visual Speech Modeling for Continuous Speech Recognition , 2000, IEEE Trans. Multim..
[5] Juergen Luettin,et al. Audio-Visual Automatic Speech Recognition: An Overview , 2004 .
[6] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..
[7] Samy Bengio,et al. Towards using hierarchical posteriors for flexible automatic speech recognition systems , 2004 .
[8] Matthew Brand,et al. Coupled hidden Markov models for modeling interacting processes , 1997 .
[9] Van Nostrand,et al. Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .
[10] Luc Vandendorpe,et al. The M2VTS Multimodal Face Database (Release 1.00) , 1997, AVBPA.
[11] F. Jelinek. Fast sequential decoding algorithm using a stack , 1969 .
[12] Samy Bengio,et al. Automatic analysis of multimodal group actions in meetings , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[13] Samy Bengio,et al. An Asynchronous Hidden Markov Model for Audio-Visual Speech Recognition , 2002, NIPS.
[14] Samy Bengio. Multimodal speech processing using asynchronous Hidden Markov Models , 2004, Inf. Fusion.
[15] Eric Horvitz,et al. Hierarchical Representations for Learning and Inferring Office Activity from Multiple Sensory Channels , 2002 .
[16] Shih-Fu Chang,et al. Structure analysis of soccer video with hidden Markov models , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[17] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .
[18] Eric Horvitz,et al. Layered representations for learning and inferring office activity from multiple sensory channels , 2004, Comput. Vis. Image Underst..
[19] Steve Mann. Smart clothing: The wearable computer and wearcam , 2005, Personal Technologies.
[20] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.
[21] Samy Bengio,et al. Modeling Individual and Group Actions in Meetings: A Two-Layer HMM Framework , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.
[22] Hervé Bourlard,et al. Subband-based speech recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[23] Djoerd Hiemstra,et al. A Probabilistic Multimedia Retrieval Model and Its Evaluation , 2003, EURASIP J. Adv. Signal Process..
[24] Hervé Glotin,et al. Multi-stream adaptive evidence combination for noise robust ASR , 2001, Speech Commun..
[25] John S. Boreczky,et al. A hidden Markov model framework for video segmentation using audio and image features , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[26] Jean-Marc Odobez,et al. A Mixed-State I-Particle Filter for Multi-Camera Speaker Tracking , 2003, ICCV 2003.