VT-GAN: View Transformation GAN for Gait Recognition Across Views

Recognizing gaits without human cooperation is of importance in surveillance and forensics because of the benefits that gait is unique and collected remotely. However, change of camera view angle severely degrades the performance of gait recognition. To address the problem, previous methods usually learn mappings for each pair of views which incurs abundant independently built models. In this paper, we proposed a View Transformation Generative Adversarial Networks (VT-GAN) to achieve view transformation of gaits across two arbitrary views using only one uniform model. In specific, we generated gaits in target view conditioned on input images from any views and the corresponding target view indicator. In addition to the classical discriminator in GAN which makes the generated images look realistic, a view classifier is imposed. This controls the consistency of generated images and conditioned target view indicator and ensures to generate gaits in the specified target view. On the other hand, retaining identity information while performing view transformation is another challenge. To solve the issue, an identity distilling module with triplet loss is integrated, which constrains the generated images inheriting identity information from inputs and yields discriminative feature embeddings. The proposed VT-GAN generates visually promising gaits and achieves promising performances for cross-view gait recognition, which exhibits great effectiveness of the proposed VT-GAN.

[1]  Xianglei Xing,et al.  Complete canonical correlation analysis with application to multi-view gait recognition , 2016, Pattern Recognit..

[2]  Qiang Wu,et al.  Support vector regression for multi-view gait recognition based on local motion feature selection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[3]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[4]  Kanad K. Biswas,et al.  Biometric Gait Recognition with Carrying and Clothing Variants , 2009, PReMI.

[5]  Yasushi Makihara,et al.  Cross-view gait recognition using view-dependent discriminative analysis , 2014, IEEE International Joint Conference on Biometrics.

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Hua Li,et al.  3D gait recognition using multiple cameras , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[8]  Yasushi Makihara,et al.  View Transformation Model Incorporating Quality Measures for Cross-View Gait Recognition , 2016, IEEE Transactions on Cybernetics.

[9]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Qiang Wu,et al.  Coupled Bilinear Discriminant Projection for Cross-View Gait Recognition , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Qiang Wu,et al.  Multi-Pseudo Regularized Label for Generated Data in Person Re-Identification , 2018, IEEE Transactions on Image Processing.

[12]  Bernt Schiele,et al.  Generative Adversarial Text to Image Synthesis , 2016, ICML.

[13]  Xiaogang Wang,et al.  A Comprehensive Study on Cross-View Gait Based Human Identification with Deep CNNs , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  LinLin Shen,et al.  Invariant feature extraction for gait recognition using only one uniform model , 2017, Neurocomputing.

[16]  Yasushi Makihara,et al.  Gait Recognition Using a View Transformation Model in the Frequency Domain , 2006, ECCV.

[17]  Hongming Shan,et al.  Multi-Task GANs for View-Specific Feature Learning in Gait Recognition , 2019, IEEE Transactions on Information Forensics and Security.

[18]  Peng Zhang,et al.  A general tensor representation framework for cross-view gait recognition , 2019, Pattern Recognit..

[19]  Tieniu Tan,et al.  A Framework for Evaluating the Effect of View Angle, Clothing and Carrying Condition on Gait Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Qiang Wu,et al.  Robust Gait Recognition under Unconstrained Environments Using Hybrid Descriptions , 2017, 2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[21]  Tieniu Tan,et al.  Robust view transformation model for gait recognition , 2011, 2011 18th IEEE International Conference on Image Processing.

[22]  Xiang Li,et al.  Joint Intensity and Spatial Metric Learning for Robust Gait Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  James J. Little,et al.  View-Invariant Discriminative Projection for Multi-View Gait-Based Human Identification , 2013, IEEE Transactions on Information Forensics and Security.

[24]  Bir Bhanu,et al.  Individual recognition using gait energy image , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Yanwei Zheng,et al.  DeepDiff: Learning deep difference features on human body parts for person re-identification , 2017, Neurocomputing.

[26]  Aaron F. Bobick,et al.  Gait recognition using static, activity-specific parameters , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[27]  Yasushi Makihara,et al.  GEINet: View-invariant gait recognition using a convolutional neural network , 2016, 2016 International Conference on Biometrics (ICB).

[28]  Qiang Wu,et al.  Gait Recognition Under Various Viewing Angles Based on Correlated Motion Regression , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Luc Van Gool,et al.  Pose Guided Person Image Generation , 2017, NIPS.

[30]  Shiqi Yu,et al.  GaitGAN: Invariant Gait Feature Extraction Using Generative Adversarial Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[31]  Ersin Yumer,et al.  Neural Face Editing with Intrinsic Image Disentangling , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Shiqi Yu,et al.  GaitGANv2: Invariant gait feature extraction using generative adversarial networks , 2019, Pattern Recognit..

[33]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[34]  Aaron F. Bobick,et al.  Modelling the effects of walking speed on appearance-based gait recognition , 2004, CVPR 2004.

[35]  Qiang Wu,et al.  Multiple views gait recognition using View Transformation Model based on optimized Gait Energy Image , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[36]  Rama Chellappa,et al.  Discriminant analysis of principal components for face recognition , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[37]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[38]  Jung-Woo Ha,et al.  StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.