Synergy of Lip-Motion and Acoustic Features in Biometric Speech and Speaker Recognition
暂无分享,去创建一个
[1] Tsuhan Chen,et al. Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition , 2005, IEEE Transactions on Multimedia.
[2] Johan Wiklund,et al. Multidimensional Orientation Estimation with Applications to Texture Analysis and Optical Flow , 1991, IEEE Trans. Pattern Anal. Mach. Intell..
[3] Stefan Fischer,et al. Face authentication with sparse grid Gabor information , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[4] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.
[5] E. Petajan,et al. An improved automatic lipreading system to enhance speech recognition , 1988, CHI '88.
[6] Douglas A. Reynolds,et al. Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..
[7] Zhifeng Li,et al. Video based face recognition using multiple classifiers , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..
[8] Ara V. Nefian,et al. Speaker independent audio-visual continuous speech recognition , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.
[9] Lance Williams,et al. Performance-driven facial animation , 1990, SIGGRAPH.
[10] G. Granlund. In search of a general picture processing operator , 1978 .
[11] Satoshi Nakamura,et al. Lip movement synthesis from speech based on hidden Markov models , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.
[12] Satoshi Nakamura,et al. Fusion of Audio-Visual Information for Integrated Speech Processing , 2001, AVBPA.
[13] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.
[14] Tsuhan Chen,et al. Audiovisual speech processing , 2001, IEEE Signal Process. Mag..
[15] Jiri Matas,et al. XM2VTSDB: The Extended M2VTS Database , 1999 .
[16] Richard J. Mammone,et al. Speaker recognition using neural networks and conventional classifiers , 1994, IEEE Trans. Speech Audio Process..
[17] Juergen Luettin,et al. Evaluation Protocol for the extended M2VTS Database (XM2VTSDB) , 1998 .
[18] Pramod K. Varshney,et al. Multisensor Data Fusion , 1997, IEA/AIE.
[19] Alex Pentland,et al. Automatic lipreading by optical-flow analysis , 1989 .
[20] Herbert Gish,et al. Speaker identification via support vector classifiers , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[21] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.
[22] Roberto Brunelli,et al. Person identification using multiple cues , 1995, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[23] Farzin Deravi,et al. A review of speech-based bimodal recognition , 2002, IEEE Trans. Multim..
[24] Bernhard Fröba,et al. SESAM: A Biometric Person Identification System Using Sensor Fusion , 1997, AVBPA.
[25] Douglas A. Reynolds,et al. Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..
[26] Juergen Luettin,et al. Acoustic-labial Speaker Verification , 1997, AVBPA.
[27] Pedro J. Moreno,et al. On the use of support vector machines for phonetic classification , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).
[28] Josef Bigün,et al. Person Verification by Lip-Motion , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).
[29] Timothy J. Hazen. Visual model structures and synchrony constraints for audio-visual speech recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.
[30] Dimitris N. Metaxas,et al. Optical Flow Constraints on Deformable Models with Applications to Face Tracking , 2000, International Journal of Computer Vision.
[31] William M. Campbell,et al. Support vector machines for speaker verification and identification , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).
[32] Juergen Luettin,et al. Speechreading using Probabilistic Models , 1997, Comput. Vis. Image Underst..
[33] Stefan Fischer,et al. Expert Conciliation for Multi Modal Person Authentication Systems by Bayesian Statistics , 1997, AVBPA.
[34] Josef Bigün,et al. Evaluating liveness by face images and the structure tensor , 2005, Fourth IEEE Workshop on Automatic Identification Advanced Technologies (AutoID'05).
[35] Sridha Sridharan,et al. The use of speech and lip modalities for robust speaker verification under adverse conditions , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.
[36] I. Gavat,et al. Robust speech recognizer using multiclass SVM , 2004, 7th Seminar on Neural Network Applications in Electrical Engineering, 2004. NEUREL 2004. 2004.