论文信息 - Aspects of facial biometrics for verification of personal identity

Aspects of facial biometrics for verification of personal identity

This thesis studies various aspects of facial biometrics for the verification of personal identity in a multimodal framework. The research focuses on the mouth area and, more specifically, on the design of a lip tracking system for the extraction of visual features. The tracker is based on statistical chromaticity models and uses a B-spline representation of the contour of the lips. Shape variability is restricted to affine deformations of a linear combination of modes of shape variation, which are automatically estimated in a robust way using the tracking results provided by a first, rather unconstrained lip tracker. Tracking experiments were performed in a large multimedia database and the results were fed as input features to a Dynamic Time Warping algorithm for speaker verification purposes according to a published evaluation protocol. A weighted linear classifier is eventually trained for performing fusion experiments on the same database combining various verification modalities such as face and voice.

Ramos Sanchez | M. Ulises | Ramos Sanchez | M. Ulises

[1] Timothy F. Cootes,et al. An Automatic Face Identification System Using Flexible Appearance Models , 1994, BMVC.

[2] Simon Haykin,et al. Neural networks , 1994 .

[3] Robert J. Baron,et al. Mechanisms of Human Facial Recognition , 1981, Int. J. Man Mach. Stud..

[4] Stefan Fischer,et al. Face authentication with Gabor information on deformable graphs , 1999, IEEE Trans. Image Process..

[5] Michael Isard,et al. Learning to track curves in motion , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[6] A. P. Breen,et al. An investigation into the generation of mouth shapes for a talking head , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[7] Hichem Frigui,et al. A robust algorithm for automatic extraction of an unknown number of clusters from noisy data , 1996, Pattern Recognit. Lett..

[8] Roberto Brunelli,et al. Person identification using multiple cues , 1995, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Alex Pentland,et al. Face recognition using view-based and modular eigenspaces , 1994, Optics & Photonics.

[10] Alex Pentland,et al. Bayesian face recognition using deformable intensity surfaces , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11] Timothy F. Cootes,et al. Locating Objects of Varying Shape Using Statistical Feature Detectors , 1996, ECCV.

[12] A. Waibel,et al. MULTIMODAL INTERPRETER SPEECH GESTURE WRITING DIALOG PROCESSOR MULTIMODAL LEARNING INTERFACES , 1995 .

[13] Jiri Matas,et al. Image interpretation: exploiting multiple cues , 1995 .

[14] Alexandros Eleftheriadis,et al. Automatic face location detection and tracking for model-assisted coding of video teleconferencing sequences at low bit-rates , 1995, Signal Process. Image Commun..

[15] Kiyoharu Aizawa,et al. Detection and tracking of facial features , 1995, Other Conferences.

[16] Rodney G. Winter,et al. Verification of personal identity using facial images , 1994, Optics & Photonics.

[17] Jiri Matas,et al. Fast face localisation and verification , 1999, Image Vis. Comput..

[18] Josef Kittler,et al. Feature selection for a DTW-based speaker verification system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[19] Glenn Healey,et al. Segmenting images using normalized color , 1992, IEEE Trans. Syst. Man Cybern..

[20] Rajesh N. Davé,et al. Robust clustering methods: a unified view , 1997, IEEE Trans. Fuzzy Syst..

[21] John N. Carter,et al. Design and development of a 4D scanner , 1993, Optics & Photonics.

[22] Joachim M. Buhmann,et al. Illumination-Invariant Face Recognition with a Contrast Sensitive Silicon Retina , 1993, NIPS.

[23] Alex Pentland,et al. 3D lip shapes from video: A combined physical-statistical model , 1998, Speech Commun..

[24] Gerald Farin,et al. Curves and surfaces for computer aided geometric design , 1990 .

[25] Ela Claridge,et al. Developing a predictive model of human skin coloring , 1996, Medical Imaging.

[26] Tsunehiro Aibara,et al. Comparison of face recognition: profiles versus frontal faces , 1994, Optics & Photonics.

[27] Jiri Matas,et al. XM2VTSDB: The Extended M2VTS Database , 1999 .

[28] Jonathan Phillips,et al. Matching pursuit filters applied to face identification , 1994, Optics & Photonics.

[29] M. Tovée,et al. An Introduction to the Visual System , 1997 .

[30] Jiri Matas,et al. Spatial and Feature Space Clustering: Applications in Image Analysis , 1995, CAIP.

[31] Steven A. Shafer,et al. Obtaining accurate color images for machine-vision research , 1990, Electronic Imaging.

[32] Alex Pentland,et al. 3D modeling and tracking of human lip motions , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[33] David C. Hogg,et al. Learning Spatiotemporal Models From Training Examples , 1995 .

[34] Juergen Luettin,et al. Visual speech recognition using active shape models and hidden Markov models , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[35] Shimon Ullman,et al. Face Recognition: The Problem of Compensating for Changes in Illumination Direction , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[36] Andy Harter,et al. Parameterisation of a stochastic model for human face identification , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[37] M. M. Cohen,et al. What can visual speech synthesis tell visual speech recognition? , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[38] Roberto Cipolla,et al. Towards an Automatic Human Face Localizations System , 1995, BMVC.

[39] Andrew Blake,et al. Learning Dynamics of Complex Motions from Image Sequences , 1996, ECCV.

[40] David Beymer,et al. Face recognition from one example view , 1995, Proceedings of IEEE International Conference on Computer Vision.

[41] Andrew Blake,et al. Statistical Background Modelling for Tracking with a Virtual Camera , 1995, BMVC.

[42] Timothy F. Cootes,et al. Active Shape Models: Evaluation of a Multi-Resolution Method for Improving Image Search , 1994, BMVC.

[43] T. Poggio,et al. I think I know that face... , 1996, Nature.

[44] Juergen Luettin,et al. Visual Speech and Speaker Recognition , 1997 .

[45] Bill Welsh,et al. Model-based coding of images , 1991 .

[46] Alex Pentland,et al. View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[47] Y. Bar-Shalom. Tracking and data association , 1988 .

[48] R. M. Mersereau,et al. Lip modeling for visual speech recognition , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[49] Luc Vandendorpe,et al. Multiple experts for robust face authentication , 1998, Electronic Imaging.

[50] G Pfurtscheller,et al. Separability of EEG signals recorded during right and left motor imagery using adaptive autoregressive parameters. , 1998, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[51] Yochai Konig,et al. A hybrid approach to bimodal speech recognition , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[52] Rama Chellappa,et al. Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[53] Timothy F. Cootes,et al. Interpreting face images using active appearance models , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[54] Deniz Yuret. From Genetic Algorithms to Efficient Optimization , 1994 .

[55] Michael Isard,et al. 3D position, attitude and shape input using video tracking of hands and lips , 1994, SIGGRAPH.

[56] Man-Wai Mak,et al. Lip-motion analysis for speech segmentation in noise , 1994, Speech Commun..

[57] Horst Bischof,et al. Dealing with occlusions in the eigenspace approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[58] Ching Y. Suen,et al. Face identification using algebraic features of images , 1994, Optics & Photonics.

[59] Roberto Brunelli,et al. Face Recognition: Features Versus Templates , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[60] Demetri Terzopoulos,et al. Topologically adaptable snakes , 1995, Proceedings of IEEE International Conference on Computer Vision.

[61] Stephen J. Cox,et al. A Comparison of Active Shape Model and Scale Decomposition Based Features for Visual Speech Recognition , 1998, ECCV.

[62] Timothy F. Cootes,et al. Face Recognition Using Active Appearance Models , 1998, ECCV.

[63] Ravi Kothari,et al. Detection of eye locations in unconstrained visual images , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[64] Tomaso A. Poggio,et al. Image Synthesis from a Single Example Image , 1996, ECCV.

[65] Alex Pentland,et al. Probabilistic visual learning for object detection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[66] Chung-Lin Huang,et al. Human facial feature extraction for face interpretation and recognition , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[67] Pierre Chardaire,et al. Multiscale Nonlinear Decomposition: The Sieve Decomposition Theorem , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[68] Juergen Luettin,et al. Active Shape Models for Visual Speech Feature Extraction , 1996 .

[69] Isaac Weiss,et al. Straight line fitting in a noisy image , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[70] M. Bichsel. Strategies of robust object recognition for the automatic identification of human faces , 1991 .

[71] Lorenzo Torresani,et al. 2D Deformable Models for Visual Speech Analysis , 1996 .

[72] Bülent Sankur,et al. Facial feature localization and adaptation of a generic face model for model-based coding , 1995, Signal Process. Image Commun..

[73] Nicholas Costen,et al. Automatic Face Recognition: What Representation? , 1996, ECCV.

[74] A. Zakhor,et al. Depth based recovery of human facial features from video sequences , 1995, Proceedings., International Conference on Image Processing.

[75] Tomaso Poggio,et al. Automatic person recognition by acoustic and geometric features , 1995 .

[76] David G. Stork,et al. Using deformable templates to infer visual speech dynamics , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[77] Jiebo Luo,et al. Face location in wavelet-based video compression for high perceptual quality videoconferencing , 1995, Proceedings., International Conference on Image Processing.

[78] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[79] Joseph Wilder,et al. Projection-based face recognition , 1994, Optics & Photonics.

[80] Justine Sergent,et al. Face recognition , 1992, Current Opinion in Neurobiology.

[81] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[82] Alex Pentland,et al. Beyond eigenfaces: probabilistic matching for face recognition , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[83] Yehezkel Yeshurun,et al. Robust detection of facial features by generalized symmetry , 1992, [1992] Proceedings. 11th IAPR International Conference on Pattern Recognition.

[84] C. Braun,et al. Adaptive AR modeling of nonstationary time series by means of Kalman filtering , 1998, IEEE Transactions on Biomedical Engineering.

[85] John N. Carter,et al. Improved Stripe Matching for Colour Encoded Structured Light , 1993, CAIP.

[86] David Beymer,et al. Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[87] Michael Isard,et al. Active Contours , 2000, Springer London.

[88] David J. Kriegman,et al. Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[89] John G. Proakis,et al. Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[90] Anthony J. Yezzi,et al. Gradient flows and geometric active contour models , 1995, Proceedings of IEEE International Conference on Computer Vision.

[91] Javier R. Movellan,et al. Visual Speech Recognition with Stochastic Networks , 1994, NIPS.

[92] Hiroshi Sako,et al. Real-time facial-feature tracking based on matching techniques and its applications , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[93] Andrew Blake,et al. Parallel Implementation of Lagrangian Dynamics for real-time snakes , 1991 .

[94] Stephen M. Omohundro,et al. Nonlinear manifold learning for visual speech recognition , 1995, Proceedings of IEEE International Conference on Computer Vision.

[95] Katsuhiko Ogata,et al. Modern Control Engineering , 1970 .

[96] Ian Craw,et al. Face Recognition by Computer , 1992, BMVC.

[97] John Daugman,et al. Face and Gesture Recognition: Overview , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[98] L Sirovich,et al. Low-dimensional procedure for the characterization of human faces. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[99] Y. J. Tejwani,et al. Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[100] Arthur Gelb,et al. Applied Optimal Estimation , 1974 .

[101] V. Leitáo,et al. Computer Graphics: Principles and Practice , 1995 .

[102] Juergen Luettin,et al. Acoustic-labial speaker verification , 1997, Pattern Recognit. Lett..

[103] Jiri Matas,et al. Lip-shape Dependent Face Verification , 1997, AVBPA.

[104] Michael Isard,et al. Learning to Track the Visual Motion of Contours , 1995, Artif. Intell..

[105] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[106] David C. Hogg,et al. Learning Flexible Models from Image Sequences , 1994, ECCV.

[107] Hyeonjoon Moon,et al. The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[108] Roberto Brunelli,et al. Estimation of pose and illuminant direction for face processing , 1994, Image Vis. Comput..

[109] David C. Hogg,et al. An efficient method for contour tracking using active shape models , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[110] James L. Wayman,et al. Error rate equations for the general biometric system , 1999, IEEE Robotics Autom. Mag..

[111] Katrien van Driessen,et al. A Fast Algorithm for the Minimum Covariance Determinant Estimator , 1999, Technometrics.

[112] Roland Auckenthaler,et al. Lip signatures for automatic person recognition , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[113] Jiri Matas,et al. Audio-visual person verification , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[114] H. McGurk,et al. Hearing lips and seeing voices , 1976, Nature.

[115] Anastasios Tefas,et al. Multi modal verification for teleservices and security applications (M2VTS) , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[116] Keinosuke Fukunaga,et al. Introduction to Statistical Pattern Recognition , 1972 .

[117] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[118] Alexander H. Waibel,et al. Toward movement-invariant automatic lip-reading and speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[119] Ewert Bengtsson,et al. Finding Facial Features Using an HLS Colour Space , 1995, ICIAP.

[120] Penio S. Penev,et al. Local feature analysis: A general statistical theory for object representation , 1996 .

[121] Jiri Matas,et al. Colour-based object recognition , 1995 .

[122] Mark S. Nixon,et al. Feature extraction for video rate three dimensional imaging via coloured spots , 1995, Proceedings 1995 Canadian Conference on Electrical and Computer Engineering.

[123] Luc Vandendorpe,et al. Multi-modal person verification tools using speech and images , 1996 .

[124] Andrew Blake,et al. Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications , 1996, ECCV.

[125] Jiri Matas,et al. Selection of speaker independent feature for a speaker verification system , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[126] M. Tovée,et al. Representational capacity of face coding in monkeys. , 1996, Cerebral cortex.

[127] Yochai Konig,et al. "Eigenlips" for robust speech recognition , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[128] Roberto Brunelli,et al. Face Recognition through Geometrical Features , 1992, ECCV.

[129] Jiri Matas,et al. Effective Implementation of Linear Discriminant Analysis for Face Recognition and Verification , 1999, CAIP.

[130] Joachim M. Buhmann,et al. Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[131] Andrew Blake,et al. Accurate, real-time, unadorned lip tracking , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[132] Andrew Blake,et al. Dynamic contours: real-time active splines , 1993 .

[133] Jean-Michel Jolion,et al. Robust Clustering with Applications in Computer Vision , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[134] A. Yuille. Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[135] M Ferguson-Pell,et al. An empirical technique to compensate for melanin when monitoring skin microcirculation using reflectance spectrophotometry. , 1995, Medical engineering & physics.

[136] Ying Dai,et al. Face-texture model based on SGLD and its application in face detection in a color scene , 1996, Pattern Recognit..

[137] Jiri Matas,et al. Statistical Chromaticity Models for Lip Tracking with B-splines , 1997, AVBPA.

[138] K. Åström. Introduction to Stochastic Control Theory , 1970 .

[139] Alex Pentland,et al. Generalized Image Matching: Statistical Learning of Physically-Based Deformations , 1996, ECCV.

[140] Takeo Kanade,et al. Picture Processing System by Computer Complex and Recognition of Human Faces , 1974 .

[141] Andrew Blake,et al. Determining facial expressions in real time , 1995, Proceedings of IEEE International Conference on Computer Vision.

[142] Alex Pentland,et al. LAFTER: lips and face real time tracker , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[143] A. Welch,et al. A review of the optical properties of biological tissues , 1990 .

[144] Michael Isard,et al. Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[145] Jiri Matas,et al. On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[146] Andrew Stein,et al. Robust statistics in shape fitting , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[147] Timothy F. Cootes,et al. Active Shape Model Search using Local Grey-Level Models: A Quantitative Evaluation , 1993, BMVC.