Aspects of facial biometrics for verification of personal identity

This thesis studies various aspects of facial biometrics for the verification of personal identity in a multimodal framework. The research focuses on the mouth area and, more specifically, on the design of a lip tracking system for the extraction of visual features. The tracker is based on statistical chromaticity models and uses a B-spline representation of the contour of the lips. Shape variability is restricted to affine deformations of a linear combination of modes of shape variation, which are automatically estimated in a robust way using the tracking results provided by a first, rather unconstrained lip tracker. Tracking experiments were performed in a large multimedia database and the results were fed as input features to a Dynamic Time Warping algorithm for speaker verification purposes according to a published evaluation protocol. A weighted linear classifier is eventually trained for performing fusion experiments on the same database combining various verification modalities such as face and voice.

[1]  Timothy F. Cootes,et al.  An Automatic Face Identification System Using Flexible Appearance Models , 1994, BMVC.

[2]  Simon Haykin,et al.  Neural networks , 1994 .

[3]  Robert J. Baron,et al.  Mechanisms of Human Facial Recognition , 1981, Int. J. Man Mach. Stud..

[4]  Stefan Fischer,et al.  Face authentication with Gabor information on deformable graphs , 1999, IEEE Trans. Image Process..

[5]  Michael Isard,et al.  Learning to track curves in motion , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[6]  A. P. Breen,et al.  An investigation into the generation of mouth shapes for a talking head , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[7]  Hichem Frigui,et al.  A robust algorithm for automatic extraction of an unknown number of clusters from noisy data , 1996, Pattern Recognit. Lett..

[8]  Roberto Brunelli,et al.  Person identification using multiple cues , 1995, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Alex Pentland,et al.  Face recognition using view-based and modular eigenspaces , 1994, Optics & Photonics.

[10]  Alex Pentland,et al.  Bayesian face recognition using deformable intensity surfaces , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Timothy F. Cootes,et al.  Locating Objects of Varying Shape Using Statistical Feature Detectors , 1996, ECCV.

[12]  A. Waibel,et al.  MULTIMODAL INTERPRETER SPEECH GESTURE WRITING DIALOG PROCESSOR MULTIMODAL LEARNING INTERFACES , 1995 .

[13]  Jiri Matas,et al.  Image interpretation: exploiting multiple cues , 1995 .

[14]  Alexandros Eleftheriadis,et al.  Automatic face location detection and tracking for model-assisted coding of video teleconferencing sequences at low bit-rates , 1995, Signal Process. Image Commun..

[15]  Kiyoharu Aizawa,et al.  Detection and tracking of facial features , 1995, Other Conferences.

[16]  Rodney G. Winter,et al.  Verification of personal identity using facial images , 1994, Optics & Photonics.

[17]  Jiri Matas,et al.  Fast face localisation and verification , 1999, Image Vis. Comput..

[18]  Josef Kittler,et al.  Feature selection for a DTW-based speaker verification system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[19]  Glenn Healey,et al.  Segmenting images using normalized color , 1992, IEEE Trans. Syst. Man Cybern..

[20]  Rajesh N. Davé,et al.  Robust clustering methods: a unified view , 1997, IEEE Trans. Fuzzy Syst..

[21]  John N. Carter,et al.  Design and development of a 4D scanner , 1993, Optics & Photonics.

[22]  Joachim M. Buhmann,et al.  Illumination-Invariant Face Recognition with a Contrast Sensitive Silicon Retina , 1993, NIPS.

[23]  Alex Pentland,et al.  3D lip shapes from video: A combined physical-statistical model , 1998, Speech Commun..

[24]  Gerald Farin,et al.  Curves and surfaces for computer aided geometric design , 1990 .

[25]  Ela Claridge,et al.  Developing a predictive model of human skin coloring , 1996, Medical Imaging.

[26]  Tsunehiro Aibara,et al.  Comparison of face recognition: profiles versus frontal faces , 1994, Optics & Photonics.

[27]  Jiri Matas,et al.  XM2VTSDB: The Extended M2VTS Database , 1999 .

[28]  Jonathan Phillips,et al.  Matching pursuit filters applied to face identification , 1994, Optics & Photonics.

[29]  M. Tovée,et al.  An Introduction to the Visual System , 1997 .

[30]  Jiri Matas,et al.  Spatial and Feature Space Clustering: Applications in Image Analysis , 1995, CAIP.

[31]  Steven A. Shafer,et al.  Obtaining accurate color images for machine-vision research , 1990, Electronic Imaging.

[32]  Alex Pentland,et al.  3D modeling and tracking of human lip motions , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[33]  David C. Hogg,et al.  Learning Spatiotemporal Models From Training Examples , 1995 .

[34]  Juergen Luettin,et al.  Visual speech recognition using active shape models and hidden Markov models , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[35]  Shimon Ullman,et al.  Face Recognition: The Problem of Compensating for Changes in Illumination Direction , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Andy Harter,et al.  Parameterisation of a stochastic model for human face identification , 1994, Proceedings of 1994 IEEE Workshop on Applications of Computer Vision.

[37]  M. M. Cohen,et al.  What can visual speech synthesis tell visual speech recognition? , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[38]  Roberto Cipolla,et al.  Towards an Automatic Human Face Localizations System , 1995, BMVC.

[39]  Andrew Blake,et al.  Learning Dynamics of Complex Motions from Image Sequences , 1996, ECCV.

[40]  David Beymer,et al.  Face recognition from one example view , 1995, Proceedings of IEEE International Conference on Computer Vision.

[41]  Andrew Blake,et al.  Statistical Background Modelling for Tracking with a Virtual Camera , 1995, BMVC.

[42]  Timothy F. Cootes,et al.  Active Shape Models: Evaluation of a Multi-Resolution Method for Improving Image Search , 1994, BMVC.

[43]  T. Poggio,et al.  I think I know that face... , 1996, Nature.

[44]  Juergen Luettin,et al.  Visual Speech and Speaker Recognition , 1997 .

[45]  Bill Welsh,et al.  Model-based coding of images , 1991 .

[46]  Alex Pentland,et al.  View-based and modular eigenspaces for face recognition , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[47]  Y. Bar-Shalom Tracking and data association , 1988 .

[48]  R. M. Mersereau,et al.  Lip modeling for visual speech recognition , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[49]  Luc Vandendorpe,et al.  Multiple experts for robust face authentication , 1998, Electronic Imaging.

[50]  G Pfurtscheller,et al.  Separability of EEG signals recorded during right and left motor imagery using adaptive autoregressive parameters. , 1998, IEEE transactions on rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[51]  Yochai Konig,et al.  A hybrid approach to bimodal speech recognition , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[52]  Rama Chellappa,et al.  Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[53]  Timothy F. Cootes,et al.  Interpreting face images using active appearance models , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[54]  Deniz Yuret From Genetic Algorithms to Efficient Optimization , 1994 .

[55]  Michael Isard,et al.  3D position, attitude and shape input using video tracking of hands and lips , 1994, SIGGRAPH.

[56]  Man-Wai Mak,et al.  Lip-motion analysis for speech segmentation in noise , 1994, Speech Commun..

[57]  Horst Bischof,et al.  Dealing with occlusions in the eigenspace approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[58]  Ching Y. Suen,et al.  Face identification using algebraic features of images , 1994, Optics & Photonics.

[59]  Roberto Brunelli,et al.  Face Recognition: Features Versus Templates , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[60]  Demetri Terzopoulos,et al.  Topologically adaptable snakes , 1995, Proceedings of IEEE International Conference on Computer Vision.

[61]  Stephen J. Cox,et al.  A Comparison of Active Shape Model and Scale Decomposition Based Features for Visual Speech Recognition , 1998, ECCV.

[62]  Timothy F. Cootes,et al.  Face Recognition Using Active Appearance Models , 1998, ECCV.

[63]  Ravi Kothari,et al.  Detection of eye locations in unconstrained visual images , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[64]  Tomaso A. Poggio,et al.  Image Synthesis from a Single Example Image , 1996, ECCV.

[65]  Alex Pentland,et al.  Probabilistic visual learning for object detection , 1995, Proceedings of IEEE International Conference on Computer Vision.

[66]  Chung-Lin Huang,et al.  Human facial feature extraction for face interpretation and recognition , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[67]  Pierre Chardaire,et al.  Multiscale Nonlinear Decomposition: The Sieve Decomposition Theorem , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[68]  Juergen Luettin,et al.  Active Shape Models for Visual Speech Feature Extraction , 1996 .

[69]  Isaac Weiss,et al.  Straight line fitting in a noisy image , 1988, Proceedings CVPR '88: The Computer Society Conference on Computer Vision and Pattern Recognition.

[70]  M. Bichsel Strategies of robust object recognition for the automatic identification of human faces , 1991 .

[71]  Lorenzo Torresani,et al.  2D Deformable Models for Visual Speech Analysis , 1996 .

[72]  Bülent Sankur,et al.  Facial feature localization and adaptation of a generic face model for model-based coding , 1995, Signal Process. Image Commun..

[73]  Nicholas Costen,et al.  Automatic Face Recognition: What Representation? , 1996, ECCV.

[74]  A. Zakhor,et al.  Depth based recovery of human facial features from video sequences , 1995, Proceedings., International Conference on Image Processing.

[75]  Tomaso Poggio,et al.  Automatic person recognition by acoustic and geometric features , 1995 .

[76]  David G. Stork,et al.  Using deformable templates to infer visual speech dynamics , 1994, Proceedings of 1994 28th Asilomar Conference on Signals, Systems and Computers.

[77]  Jiebo Luo,et al.  Face location in wavelet-based video compression for high perceptual quality videoconferencing , 1995, Proceedings., International Conference on Image Processing.

[78]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[79]  Joseph Wilder,et al.  Projection-based face recognition , 1994, Optics & Photonics.

[80]  Justine Sergent,et al.  Face recognition , 1992, Current Opinion in Neurobiology.

[81]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[82]  Alex Pentland,et al.  Beyond eigenfaces: probabilistic matching for face recognition , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[83]  Yehezkel Yeshurun,et al.  Robust detection of facial features by generalized symmetry , 1992, [1992] Proceedings. 11th IAPR International Conference on Pattern Recognition.

[84]  C. Braun,et al.  Adaptive AR modeling of nonstationary time series by means of Kalman filtering , 1998, IEEE Transactions on Biomedical Engineering.

[85]  John N. Carter,et al.  Improved Stripe Matching for Colour Encoded Structured Light , 1993, CAIP.

[86]  David Beymer,et al.  Face recognition under varying pose , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[87]  Michael Isard,et al.  Active Contours , 2000, Springer London.

[88]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[89]  John G. Proakis,et al.  Probability, random variables and stochastic processes , 1985, IEEE Trans. Acoust. Speech Signal Process..

[90]  Anthony J. Yezzi,et al.  Gradient flows and geometric active contour models , 1995, Proceedings of IEEE International Conference on Computer Vision.

[91]  Javier R. Movellan,et al.  Visual Speech Recognition with Stochastic Networks , 1994, NIPS.

[92]  Hiroshi Sako,et al.  Real-time facial-feature tracking based on matching techniques and its applications , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[93]  Andrew Blake,et al.  Parallel Implementation of Lagrangian Dynamics for real-time snakes , 1991 .

[94]  Stephen M. Omohundro,et al.  Nonlinear manifold learning for visual speech recognition , 1995, Proceedings of IEEE International Conference on Computer Vision.

[95]  Katsuhiko Ogata,et al.  Modern Control Engineering , 1970 .

[96]  Ian Craw,et al.  Face Recognition by Computer , 1992, BMVC.

[97]  John Daugman,et al.  Face and Gesture Recognition: Overview , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[98]  L Sirovich,et al.  Low-dimensional procedure for the characterization of human faces. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[99]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[100]  Arthur Gelb,et al.  Applied Optimal Estimation , 1974 .

[101]  V. Leitáo,et al.  Computer Graphics: Principles and Practice , 1995 .

[102]  Juergen Luettin,et al.  Acoustic-labial speaker verification , 1997, Pattern Recognit. Lett..

[103]  Jiri Matas,et al.  Lip-shape Dependent Face Verification , 1997, AVBPA.

[104]  Michael Isard,et al.  Learning to Track the Visual Motion of Contours , 1995, Artif. Intell..

[105]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[106]  David C. Hogg,et al.  Learning Flexible Models from Image Sequences , 1994, ECCV.

[107]  Hyeonjoon Moon,et al.  The FERET Evaluation Methodology for Face-Recognition Algorithms , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[108]  Roberto Brunelli,et al.  Estimation of pose and illuminant direction for face processing , 1994, Image Vis. Comput..

[109]  David C. Hogg,et al.  An efficient method for contour tracking using active shape models , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[110]  James L. Wayman,et al.  Error rate equations for the general biometric system , 1999, IEEE Robotics Autom. Mag..

[111]  Katrien van Driessen,et al.  A Fast Algorithm for the Minimum Covariance Determinant Estimator , 1999, Technometrics.

[112]  Roland Auckenthaler,et al.  Lip signatures for automatic person recognition , 1999, 1999 IEEE Third Workshop on Multimedia Signal Processing (Cat. No.99TH8451).

[113]  Jiri Matas,et al.  Audio-visual person verification , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[114]  H. McGurk,et al.  Hearing lips and seeing voices , 1976, Nature.

[115]  Anastasios Tefas,et al.  Multi modal verification for teleservices and security applications (M2VTS) , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[116]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[117]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[118]  Alexander H. Waibel,et al.  Toward movement-invariant automatic lip-reading and speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[119]  Ewert Bengtsson,et al.  Finding Facial Features Using an HLS Colour Space , 1995, ICIAP.

[120]  Penio S. Penev,et al.  Local feature analysis: A general statistical theory for object representation , 1996 .

[121]  Jiri Matas,et al.  Colour-based object recognition , 1995 .

[122]  Mark S. Nixon,et al.  Feature extraction for video rate three dimensional imaging via coloured spots , 1995, Proceedings 1995 Canadian Conference on Electrical and Computer Engineering.

[123]  Luc Vandendorpe,et al.  Multi-modal person verification tools using speech and images , 1996 .

[124]  Andrew Blake,et al.  Real-Time Lip Tracking for Audio-Visual Speech Recognition Applications , 1996, ECCV.

[125]  Jiri Matas,et al.  Selection of speaker independent feature for a speaker verification system , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[126]  M. Tovée,et al.  Representational capacity of face coding in monkeys. , 1996, Cerebral cortex.

[127]  Yochai Konig,et al.  "Eigenlips" for robust speech recognition , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[128]  Roberto Brunelli,et al.  Face Recognition through Geometrical Features , 1992, ECCV.

[129]  Jiri Matas,et al.  Effective Implementation of Linear Discriminant Analysis for Face Recognition and Verification , 1999, CAIP.

[130]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[131]  Andrew Blake,et al.  Accurate, real-time, unadorned lip tracking , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[132]  Andrew Blake,et al.  Dynamic contours: real-time active splines , 1993 .

[133]  Jean-Michel Jolion,et al.  Robust Clustering with Applications in Computer Vision , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[134]  A. Yuille Deformable Templates for Face Recognition , 1991, Journal of Cognitive Neuroscience.

[135]  M Ferguson-Pell,et al.  An empirical technique to compensate for melanin when monitoring skin microcirculation using reflectance spectrophotometry. , 1995, Medical engineering & physics.

[136]  Ying Dai,et al.  Face-texture model based on SGLD and its application in face detection in a color scene , 1996, Pattern Recognit..

[137]  Jiri Matas,et al.  Statistical Chromaticity Models for Lip Tracking with B-splines , 1997, AVBPA.

[138]  K. Åström Introduction to Stochastic Control Theory , 1970 .

[139]  Alex Pentland,et al.  Generalized Image Matching: Statistical Learning of Physically-Based Deformations , 1996, ECCV.

[140]  Takeo Kanade,et al.  Picture Processing System by Computer Complex and Recognition of Human Faces , 1974 .

[141]  Andrew Blake,et al.  Determining facial expressions in real time , 1995, Proceedings of IEEE International Conference on Computer Vision.

[142]  Alex Pentland,et al.  LAFTER: lips and face real time tracker , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[143]  A. Welch,et al.  A review of the optical properties of biological tissues , 1990 .

[144]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[145]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[146]  Andrew Stein,et al.  Robust statistics in shape fitting , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[147]  Timothy F. Cootes,et al.  Active Shape Model Search using Local Grey-Level Models: A Quantitative Evaluation , 1993, BMVC.