An Extended Sparse Classification Framework for Domain Adaptation in Video Surveillance

Still-to-video face recognition (FR) systems used in video surveillance applications capture facial trajectories across a network of distributed video cameras and compare them against stored distributed facial models. Currently, the performance of state-of-the-art systems is severely affected by changes in facial appearance caused by variations in, e.g., pose, illumination and scale in different camera viewpoints. Moreover, since an individual is typically enrolled using one or few reference stills captured during enrolment, face models are not robust to intra-class variation. In this paper, the Extended Sparse Representation Classification through Domain Adaptation (ESRC-DA) algorithm is proposed to improve performance of still-to-video FR. The system’s facial models are thereby enhanced by integrating variational information from its operational domain. In particular, robustness to intra-class variations is improved by exploiting: (1) an under-sampled dictionary from target reference facial stills captured under controlled conditions; and (2) an auxiliary dictionary from an abundance of unlabelled facial trajectories captured under different conditions, from each camera viewpoint in the surveillance network. Accuracy and efficiency of the proposed technique is compared to state-of-the-art still-to-video FR techniques using videos from the Chokepoint and COX-S2V databases. Results indicate that ESRC-DA with dictionary learning of unlabelled trajectories provides the highest level of accuracy, while maintaining a low complexity.

[1]  Rama Chellappa,et al.  Domain Adaptive Dictionary Learning , 2012, ECCV.

[2]  Xin Yao,et al.  The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift , 2010, IEEE Transactions on Knowledge and Data Engineering.

[3]  Guillermo Sapiro,et al.  See all by looking at a few: Sparse modeling for finding representative objects , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[5]  Marcello Pelillo,et al.  A matrix factorization approach to graph compression with partial information , 2015, Int. J. Mach. Learn. Cybern..

[6]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[8]  Rama Chellappa,et al.  Subspace Interpolation via Dictionary Learning for Unsupervised Domain Adaptation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Shiguang Shan,et al.  Joint sparse representation for video-based face recognition , 2014, Neurocomputing.

[10]  Dmitry O. Gorodnichy,et al.  Detector ensembles for face recognition in video surveillance , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[11]  Yaakov Tsaig,et al.  Fast Solution of $\ell _{1}$ -Norm Minimization Problems When the Solution May Be Sparse , 2008, IEEE Transactions on Information Theory.

[12]  Ivor W. Tsang,et al.  Domain adaptation from multiple sources via auxiliary classifiers , 2009, ICML '09.

[13]  Robert Sabourin,et al.  Adaptive appearance model tracking for still-to-video face recognition , 2016, Pattern Recognit..

[14]  Shruthi Nagendra,et al.  Video-Based Face Recognition and Face-Tracking using Sparse Representation Based Categorization☆ , 2015 .

[15]  Rama Chellappa,et al.  Visual Domain Adaptation: A survey of recent advances , 2015, IEEE Signal Processing Magazine.

[16]  Yu-Chiang Frank Wang,et al.  Undersampled Face Recognition via Robust Auxiliary Dictionary Learning , 2015, IEEE Transactions on Image Processing.

[17]  Kjersti Engan,et al.  Method of optimal directions for frame design , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[18]  Robert Sabourin,et al.  Ensembles of exemplar-SVMs for video face recognition from a single sample per person , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[19]  Jesús García,et al.  Context-based Information Fusion: A survey and discussion , 2015, Inf. Fusion.

[20]  Lynette I. Millett,et al.  Biometric Recognition: Challenges and Opportunities , 2010 .

[21]  Brian C. Lovell,et al.  Face Recognition from Still Images to Video Sequences: A Local-Feature-Based Framework , 2011, EURASIP J. Image Video Process..

[22]  Lei Zhang,et al.  Sparse Variation Dictionary Learning for Face Recognition with a Single Training Sample per Person , 2013, 2013 IEEE International Conference on Computer Vision.

[23]  Junzhou Huang,et al.  The role of dictionary learning on sparse representation-based classification , 2013, PETRA '13.

[24]  Jian Yang,et al.  Robust sparse coding for face recognition , 2011, CVPR 2011.

[25]  Mohammed Bennamoun,et al.  Sparse Representation for Video-Based Face Recognition , 2009, ICB.

[26]  Larry S. Davis,et al.  Discriminative Dictionary Learning with Pairwise Constraints , 2012, ACCV.

[27]  Shiguang Shan,et al.  Adaptive discriminant learning for face recognition , 2013, Pattern Recognit..

[28]  Jun Guo,et al.  Extended SRC: Undersampled Face Recognition via Intraclass Variant Dictionary , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Eric Granger,et al.  Learning of Graph Compressed Dictionaries for Sparse Representation Classification , 2016, ICPRAM.

[30]  Rama Chellappa,et al.  Dictionary-Based Domain Adaptation Methods for the Re-identification of Faces , 2014, Person Re-Identification.

[31]  Anna Margolis,et al.  Automatic Annotation of Spoken Language Using Out-of-Domain Resources and Domain Adaptation , 2011 .

[32]  Shiguang Shan,et al.  Log-Euclidean Metric Learning on Symmetric Positive Definite Manifold with Application to Image Set Classification , 2015, ICML.

[33]  Zhi-Hua Zhou,et al.  Face recognition from a single image per person: A survey , 2006, Pattern Recognit..

[34]  Yongkang Wong,et al.  Patch-based probabilistic image quality assessment for face selection and improved video-based face recognition , 2011, CVPR 2011 WORKSHOPS.

[35]  Shiguang Shan,et al.  A Benchmark and Comparative Study of Video-Based Face Recognition on COX Face Database , 2015, IEEE Transactions on Image Processing.

[36]  Behrooz Kamgar-Parsi,et al.  Toward Development of a Face Recognition System for Watchlist Surveillance , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Ivor W. Tsang,et al.  Domain Transfer Multiple Kernel Learning , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Haifeng Hu,et al.  Patch-based Sparse Dictionary Representation for Face Recognition with Single Sample per Person , 2015, CCBR.

[39]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[40]  Rama Chellappa,et al.  Generalized Domain-Adaptive Dictionaries , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.