Advances in Multimedia Information Processing – PCM 2012

Video-based face recognition is a fundamental topic in image and video analysis, and presents various challenges and opportunities. In this paper, we introduce an incremental learning approach to video-based face recognition, which efficiently exploits the spatiotemporal information in videos. Face image sequences are incrementally clustered based on their descriptors. With the quantization of the facial words, representation of the face image is generated by concatenating the histograms from regions. In the online recognition, a temporal matrix and a voting algorithm are employed to judge a face video’s identity. The proposed method achieves a 100% recognition rate performed on the Honda/UCSD database, and gives near realtime feedback. Experimental results demonstrate the effectiveness and flexibility of our proposed method.

[1]  Jianfei Cai,et al.  Joint source channel rate-distortion analysis for adaptive mode selection and rate control in wireless video coding , 2002, IEEE Trans. Circuits Syst. Video Technol..

[2]  Antti Hallapuro,et al.  High Performance, Low Complexity Video Coding and the Emerging HEVC Standard , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Rui Zhang,et al.  Video coding with optimal inter/intra-mode switching for packet loss resilience , 2000, IEEE Journal on Selected Areas in Communications.

[4]  Jonathan Loo,et al.  Error-Resilient Scheme for Wavelet Video Codec Using Automatic ROI Detection and Wyner-Ziv Coding Over Packet Erasure Channel , 2010, IEEE Transactions on Broadcasting.

[5]  Bing Zeng,et al.  A new three-step search algorithm for block motion estimation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[6]  Feng Wu,et al.  Channel Distortion Modeling for Multi-View Video Transmission Over Packet-Switched Networks , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Yongdong Zhang,et al.  High Efficiency Video Coding: High Efficiency Video Coding , 2014 .

[8]  Hujun Bao,et al.  Shadow Removal in Sole Outdoor Image , 2006, PCM.

[9]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[10]  Homer H. Chen,et al.  SSIM-Based Perceptual Rate Control for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Xinhua Zhuang,et al.  SNR-Based Bit Allocation in Video Quality Smoothing , 2006, PCM.

[12]  Lei Sun,et al.  Content Based Hierarchical Fast Coding Unit Decision Algorithm for HEVC , 2011, 2011 International Conference on Multimedia and Signal Processing.

[13]  Ee-Chien Chang,et al.  Error resilient content-based image authentication over wireless channel , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[14]  Hatice Gunes,et al.  Continuous Prediction of Spontaneous Affect from Multiple Cues and Modalities in Valence-Arousal Space , 2011, IEEE Transactions on Affective Computing.

[15]  Deanna Needell,et al.  CoSaMP: Iterative signal recovery from incomplete and inaccurate samples , 2008, ArXiv.

[16]  Kai-Kuang Ma,et al.  A new diamond search algorithm for fast block-matching motion estimation , 2000, IEEE Trans. Image Process..

[17]  Neng-Sheng Pai,et al.  An embedded system for real-time facial expression recognition based on the extension theory , 2011, Comput. Math. Appl..

[18]  Weiming Lu,et al.  Region-Based Semantic Similarity Propagation for Image Retrieval , 2006, PCM.

[19]  Thomas Wiegand,et al.  Lagrange multiplier selection in hybrid video coder control , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[20]  Stewart T. Worrall,et al.  Error resilience for multi-view video using redundant macroblock coding , 2011, 2011 6th International Conference on Industrial and Information Systems.

[21]  Wen Gao,et al.  SSIM-Motivated Rate-Distortion Optimization for Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Yao Wang,et al.  Modeling of transmission-loss-induced distortion in decoded video , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Tianxu Zhang,et al.  Distributed Data Visualization Tools for Multidisciplinary Design Optimization of Aero-crafts , 2006, PCM.

[24]  Ming-Wei Huang,et al.  Facial Expression Recognition Based on Fusion of Sparse Representation , 2010, ICIC.