Partially-supervised learning from facial trajectories for face recognition in video surveillance

Face recognition (FR) is employed in several video surveillance applications to determine if facial regions captured over a network of cameras correspond to a target individuals. To enroll target individuals, it is often costly or unfeasible to capture enough high quality reference facial samples a priori to design representative facial models. Furthermore, changes in capture conditions and physiology contribute to a growing divergence between these models and faces captured during operations. Adaptive biometrics seek to maintain a high level of performance by updating facial models over time using operational data. Adaptive multiple classifier systems (MCSs) have been successfully applied to video-to-video FR, where the face of each target individual is modeled using an ensemble of 2-class classifiers (trained using target vs. non-target samples). In this paper, a new adaptive MCS is proposed for partially-supervised learning of facial models over time based on facial trajectories. During operations, information from a face tracker and individual-specific ensembles is integrated for robust spatio-temporal recognition and for self-update of facial models. The tracker defines a facial trajectory for each individual that appears in a video, which leads to the recognition of a target individual if the positive predictions accumulated along a trajectory surpass a detection threshold for an ensemble. When the number of positive ensemble predictions surpasses a higher update threshold, then all target face samples from the trajectory are combined with non-target samples (selected from the cohort and universal models) to update the corresponding facial model. A learn-and-combine strategy is employed to avoid knowledge corruption during self-update of ensembles. In addition, a memory management strategy based on Kullback-Leibler divergence is proposed to rank and select the most relevant target and non-target reference samples to be stored in memory as the ensembles evolves. For proof-of-concept, a particular realization of the proposed system was validated with videos from Face in Action dataset. Initially, trajectories captured from enrollment videos are used for supervised learning of ensembles, and then videos from various operational sessions are presented to the system for FR and self-update with high-confidence trajectories. At a transaction level, the proposed approach outperforms baseline systems that do not adapt to new trajectories, and provides comparable performance to ideal systems that adapt to all relevant target trajectories, through supervised learning. Subject-level analysis reveals the existence of individuals for which self-updating ensembles with unlabeled facial trajectories provides a considerable benefit. Trajectory-level analysis indicates that the proposed system allows for robust spatio-temporal video-to-video FR, and may therefore enhance security and situation analysis in video surveillance.

[1]  Douglas A. Reynolds,et al.  SHEEP, GOATS, LAMBS and WOLVES A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation , 1998 .

[2]  Robert Sabourin,et al.  Iterative Boolean combination of classifiers in the ROC space: An application to anomaly detection with HMMs , 2010, Pattern Recognit..

[3]  Gian Luca Marcialis,et al.  Capturing large intra-class variations of biometric data by template co-updating , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4]  J. William Ahwood,et al.  CLASSIFICATION , 1931, Foundations of Familiar Language.

[5]  Chee Peng Lim,et al.  Probabilistic Fuzzy ARTMAP: an autonomous neural network architecture for Bayesian probability estimation , 1995 .

[6]  I. Tomek,et al.  Two Modifications of CNN , 1976 .

[7]  Johannes Stallkamp,et al.  A video-based door monitoring system using local appearance-based face models , 2010, Comput. Vis. Image Underst..

[8]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[10]  S. Gentric,et al.  Border control: From Technical to Operational evaluation , 2012 .

[11]  Serge J. Belongie,et al.  Active Learning in Face Recognition: Using Tracking to Build a Face Model , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[12]  Behrooz Kamgar-Parsi,et al.  Toward Development of a Face Recognition System for Watchlist Surveillance , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  David D. Lewis,et al.  Heterogeneous Uncertainty Sampling for Supervised Learning , 1994, ICML.

[14]  Gian Luca Marcialis,et al.  Adaptive Biometric Systems That Can Improve with Use , 2008 .

[15]  Gian Luca Marcialis,et al.  Template Update Methods in Adaptive Biometric Systems: A Critical Review , 2009, ICB.

[16]  Tsuhan Chen,et al.  Video-based face recognition using adaptive hidden Markov models , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[17]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[18]  Rama Chellappa,et al.  Video-based face recognition via joint sparse representation , 2013, 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG).

[19]  Jean-Luc Dugelay,et al.  Person recognition using facial video information: A state of the art , 2009, J. Vis. Lang. Comput..

[20]  Robert Sabourin,et al.  Evolution of heterogeneous ensembles through dynamic particle swarm optimization for video-based face recognition , 2012, Pattern Recognit..

[21]  Gary Bradski,et al.  Computer Vision Face Tracking For Use in a Perceptual User Interface , 1998 .

[22]  Jane You,et al.  Semi-supervised classification based on random subspace dimensionality reduction , 2012, Pattern Recognit..

[23]  Robert P. W. Duin,et al.  Growing a multi-class classifier with a reject option , 2008, Pattern Recognit. Lett..

[24]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[25]  Gian Luca Marcialis,et al.  Semi-supervised PCA-Based Face Recognition Using Self-training , 2006, SSPR/SPR.

[26]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[27]  S. Buxbaum Sheep , 2004 .

[28]  Shaogang Gong,et al.  Analysis and Modelling of Faces and Gestures , 2008 .

[29]  Shlomo Argamon,et al.  Committee-Based Sampling For Training Probabilistic Classi(cid:12)ers , 1995 .

[30]  Gian Luca Marcialis,et al.  Template Co-update in Multimodal Biometric Systems , 2007, ICB.

[31]  Fabio Roli,et al.  Using Co-training and Self-training in Semi-supervised Multiple Classifier Systems , 2006, SSPR/SPR.

[32]  Peter A. Flach,et al.  On classification, ranking, and probability estimation , 2007, Probabilistic, Logical and Relational Learning - A Further Synthesis.

[33]  Josef Kittler,et al.  Extracting discriminative information from cohort models , 2010, 2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[34]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[35]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[36]  Dario Maio,et al.  Incremental template updating for face recognition in home environments , 2010, Pattern Recognit..

[37]  Harry Wechsler,et al.  Open set face recognition using transduction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Yongbin Zhang,et al.  From Stills to Video: Face Recognition Using a Probabilistic Approach , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[39]  Tsuhan Chen,et al.  The CMU Face In Action (FIA) Database , 2005, AMFG.

[40]  Dmitry O. Gorodnichy,et al.  Incremental update of biometric models in face-based video surveillance , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[41]  Ching Y. Suen,et al.  A class-modular feedforward neural network for handwriting recognition , 2002, Pattern Recognit..

[42]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[43]  Ke Lu,et al.  A novel semi-supervised face recognition for video , 2010, 2010 International Conference on Intelligent Control and Information Processing.

[44]  Gongping Yang,et al.  On the Class Imbalance Problem , 2008, 2008 Fourth International Conference on Natural Computation.

[45]  Arun Ross,et al.  Learning user-specific parameters in a multibiometric system , 2002, Proceedings. International Conference on Image Processing.

[46]  Rama Chellappa,et al.  Probabilistic recognition of human faces from video , 2002, Proceedings. International Conference on Image Processing.

[47]  Neamat El Gayar,et al.  Face recognition with semi-supervised learning and multiple classifiers , 2006 .

[48]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[49]  Xin Yao,et al.  An analysis of diversity measures , 2006, Machine Learning.

[50]  Arun Ross,et al.  Biometric classifier update using online learning: A case study in near infrared face verification , 2010, Image Vis. Comput..

[51]  Stefan Wrobel,et al.  Active Hidden Markov Models for Information Extraction , 2001, IDA.

[52]  Dmitry O. Gorodnichy,et al.  Detector ensembles for face recognition in video surveillance , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[53]  Gian Luca Marcialis,et al.  Template Selection by Editing Algorithms: A Case Study in Face Recognition , 2008, SSPR/SPR.

[54]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[55]  Nicu Sebe,et al.  Semisupervised learning of classifiers: theory, algorithms, and their application to human-computer interaction , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Tobias Bjerregaard,et al.  A survey of research and practices of Network-on-chip , 2006, CSUR.

[57]  K. Okada,et al.  An adaptive person recognition system , 2001, Proceedings 10th IEEE International Workshop on Robot and Human Interactive Communication. ROMAN 2001 (Cat. No.01TH8591).

[58]  Weifeng Liu,et al.  Kernel Adaptive Filtering: A Comprehensive Introduction , 2010 .

[59]  Vasant Honavar,et al.  Learn++: an incremental learning algorithm for supervised neural networks , 2001, IEEE Trans. Syst. Man Cybern. Part C.