GaitGCI: Generative Counterfactual Intervention for Gait Recognition

Gait is one of the most promising biometrics that aims to identify pedestrians from their walking patterns. However, prevailing methods are susceptible to confounders, resulting in the networks hardly focusing on the regions that re-flect effective walking patterns. To address this fundamen-tal problem in gait recognition, we propose a Generative Counterfactual Intervention framework, dubbed GaitGCI, consisting of Counterfactual Intervention Learning (CIL) and Diversity-Constrained Dynamic Convolution (DCDC). CIL eliminates the impacts of confounders by maximizing the likelihood difference between factual/counterfactual attention while DCDC adaptively generates sample-wise factual/counterfactual attention to efficiently perceive the sample-wise properties. With matrix decomposition and diversity constraint, DCDC guarantees the model to be efficient and effective. Extensive experiments indicate that proposed GaitGCI.· 1) could effectively focus on the discrimi-native and interpretable regions that reflect gait pattern; 2) is model-agnostic and could be plugged into existing models to improve performance with nearly no extra cost; 3) efficiently achieves state-of-the-art performance on arbitrary scenarios (in-the-lab and in-the-wild).

[1]  Wei Su,et al.  MetaGait: Learning to Learn an Omni Sample Adaptive Representation for Gait Recognition , 2023, ECCV.

[2]  Peihan Miao,et al.  Language Adaptive Weight Generation for Multi-Task Visual Grounding , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Pavel Izmailov,et al.  On Feature Learning in the Presence of Spurious Correlations , 2022, NeurIPS.

[4]  Yuhan Zhao,et al.  A Large-Scale Synthetic Gait Dataset Towards in-the-Wild Simulation and Comparison Study , 2022, ACM Trans. Multim. Comput. Commun. Appl..

[5]  Annan Li,et al.  Lagrange Motion Analysis and View Embeddings for Improved Gait Recognition , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Xilin Chen,et al.  Salient-to-Broad Transition for Video Person Re-identification , 2022, Computer Vision and Pattern Recognition.

[7]  Yuhan Zhao,et al.  GaitMPL: Gait Recognition With Memory-Augmented Progressive Learning , 2022, IEEE Transactions on Image Processing.

[8]  Xinchen Liu,et al.  Gait Recognition in the Wild with Dense 3D Representations and A Benchmark , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Li Fuxin,et al.  Cycle-Consistent Counterfactuals by Latent Transformations , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yongzhen Huang,et al.  GaitEdge: Beyond Plain End-to-end Gait Recognition for Better Practicality , 2022, ECCV.

[11]  Xinmei Tian,et al.  3D Local Convolutional Neural Networks for Gait Recognition , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[12]  Jie Zhou,et al.  Gait Recognition in the Wild: A Benchmark , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[13]  Xinggang Wang,et al.  Context-Sensitive Temporal Feature Learning for Gait Recognition , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[14]  Jiwen Lu,et al.  Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Anna Goldenberg,et al.  Towards Robust Classification Model by Counterfactual and Invariant Data Generation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Lu Yuan,et al.  Revisiting Dynamic Convolution via Matrix Decomposition , 2021, ICLR.

[17]  Xian-Sheng Hua,et al.  Counterfactual Zero-Shot and Open-Set Visual Recognition , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Gao Huang,et al.  Dynamic Neural Networks: A Survey , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  G. Rigoll,et al.  Gaitgraph: Graph Convolutional Network for Skeleton-Based Gait Recognition , 2021, 2021 IEEE International Conference on Image Processing (ICIP).

[20]  Bin Li,et al.  Condition-Aware Comparison Scheme for Gait Recognition , 2020, IEEE Transactions on Image Processing.

[21]  Xin Yu,et al.  Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  John P. Dickerson,et al.  Counterfactual Explanations for Machine Learning: A Review , 2020, ArXiv.

[23]  Shunli Zhang,et al.  Gait Recognition with Multiple-Temporal-Scale 3D Convolutional Neural Network , 2020, ACM Multimedia.

[24]  Xian-Sheng Hua,et al.  Interventional Few-Shot Learning , 2020, NeurIPS.

[25]  Jinhui Tang,et al.  Causal Intervention for Weakly-Supervised Semantic Segmentation , 2020, NeurIPS.

[26]  Daniel C. Castro,et al.  Deep Structural Causal Models for Tractable Counterfactual Inference , 2020, NeurIPS.

[27]  Hanwang Zhang,et al.  Visual Commonsense Representation Learning via Causal Inference , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[28]  Qing Li,et al.  GaitPart: Temporal Part-Based Model for Gait Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Anton van den Hengel,et al.  Counterfactual Vision and Language Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  M. Bethge,et al.  Shortcut learning in deep neural networks , 2020, Nature Machine Intelligence.

[31]  Xiangyu Zhang,et al.  Dynamic Region-Aware Convolution , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Shiqi Yu,et al.  A model-based gait recognition method with body pose and human prior knowledge , 2020, Pattern Recognit..

[33]  Lu Yuan,et al.  Dynamic Convolution: Attention Over Convolution Kernels , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Tinne Tuytelaars,et al.  Dynamic Convolutions: Exploiting Spatial Sparsity for Faster Inference , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Pedro Miguel Moreira,et al.  Benchmark RGB-D Gait Datasets: A Systematic Review , 2019, VipIMAGE 2019.

[36]  Feng Liu,et al.  On Learning Disentangled Representations for Gait Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Yang Yang,et al.  ABD-Net: Attentive but Diverse Person Re-Identification , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38]  Kaiqi Huang,et al.  Towards Rich Feature Discovery With Class Activation Maps Augmentation for Person Re-Identification , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Ziyan Wu,et al.  Counterfactual Visual Explanations , 2019, ICML.

[40]  Hang Su,et al.  Pixel-Adaptive Convolutional Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Quoc V. Le,et al.  CondConv: Conditionally Parameterized Convolutions for Efficient Inference , 2019, NeurIPS.

[42]  Yasushi Makihara,et al.  Multi-view large population gait dataset and its performance evaluation for cross-view gait recognition , 2018, IPSJ Transactions on Computer Vision and Applications.

[43]  Jianfeng Feng,et al.  GaitSet: Regarding Gait as a Set for Cross-View Gait Recognition , 2018, AAAI.

[44]  Manuel J. Marín-Jiménez,et al.  Multimodal feature fusion for CNN-based gait recognition: an empirical comparison , 2018, Neural Computing and Applications.

[45]  Kate Saenko,et al.  Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[46]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[47]  Manuel J. Marín-Jiménez,et al.  Evaluation of Cnn Architectures for Gait Recognition Based on Optical Flow Maps , 2017, 2017 International Conference of the Biometrics Special Interest Group (BIOSIG).

[48]  Iasonas Kokkinos,et al.  Segmentation-Aware Convolutional Networks Using Local Attention Masks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[49]  Song-Chun Zhu,et al.  Inferring Hidden Statuses and Actions in Video by Causal Reasoning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[50]  Lucas Beyer,et al.  In Defense of the Triplet Loss for Person Re-Identification , 2017, ArXiv.

[51]  Matt J. Kusner,et al.  Counterfactual Fairness , 2017, NIPS.

[52]  Xiaogang Wang,et al.  A Comprehensive Study on Cross-View Gait Based Human Identification with Deep CNNs , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53]  Feiping Nie,et al.  On the schatten norm for matrix based subspace learning and classification , 2016, Neurocomputing.

[54]  Hongbin Zha,et al.  A Unified Convex Surrogate for the Schatten-p Norm , 2016, AAAI.

[55]  Dimitris Kastaniotis,et al.  Pose-based gait recognition with local gradient descriptors and hierarchically aggregated residuals , 2016, J. Electronic Imaging.

[56]  Abhishek Das,et al.  Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[57]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[58]  Yasushi Makihara,et al.  GEINet: View-invariant gait recognition using a convolutional neural network , 2016, 2016 International Conference on Biometrics (ICB).

[59]  Trevor Darrell,et al.  Generating Visual Explanations , 2016, ECCV.

[60]  Shuicheng Yan,et al.  Nonconvex Nonsmooth Low Rank Minimization via Iteratively Reweighted Nuclear Norm , 2015, IEEE Transactions on Image Processing.

[61]  Taiji Suzuki,et al.  Convex Tensor Decomposition via Structured Schatten Norm Regularization , 2013, NIPS.

[62]  Gunawan Ariyanto,et al.  Model-based 3D gait biometrics , 2011, 2011 International Joint Conference on Biometrics (IJCB).

[63]  Chen Wang,et al.  Chrono-Gait Image: A Novel Temporal Template for Gait Recognition , 2010, ECCV.

[64]  Osama Masoud,et al.  View-independent human motion classification using image-based reconstruction , 2009, Image Vis. Comput..

[65]  Nikolaos V. Boulgouris,et al.  Gait Recognition Based on Human Body Components , 2007, 2007 IEEE International Conference on Image Processing.

[66]  Tieniu Tan,et al.  A Framework for Evaluating the Effect of View Angle, Clothing and Carrying Condition on Gait Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[67]  Hua Li,et al.  3D gait recognition using multiple cameras , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[68]  Tieniu Tan,et al.  Fusion of static and dynamic body biometrics for gait recognition , 2003, IEEE Transactions on Circuits and Systems for Video Technology.

[69]  Tieniu Tan,et al.  Gait recognition based on Procrustes shape analysis , 2002, Proceedings. International Conference on Image Processing.

[70]  Judea Pearl,et al.  Direct and Indirect Effects , 2001, UAI.

[71]  Pengyi Zhang,et al.  Adaptive Cross-domain Learning for Generalizable Person Re-identification , 2022, ECCV.

[72]  Yasushi Makihara,et al.  End-to-End Model-Based Gait Recognition , 2020, ACCV.

[73]  Yongzhen Huang,et al.  Gait Lateral Network: Learning Discriminative and Compact Representations for Gait Recognition , 2020, ECCV.

[74]  Björn W. Schuller,et al.  The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits , 2014, J. Vis. Commun. Image Represent..

[75]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[76]  Marco Serra Dynamic Convolution , 2022 .