TransRPPG: Remote Photoplethysmography Transformer for 3D Mask Face Presentation Attack Detection

3D mask face presentation attack detection (PAD) plays a vital role in securing face recognition systems from emergent 3D mask attacks. Recently, remote photoplethysmography (rPPG) has been developed as an intrinsic liveness clue for 3D mask PAD without relying on the mask appearance. However, the rPPG features for 3D mask PAD are still needed expert knowledge to design manually, which limits its further progress in the deep learning and big data era. In this letter, we propose a pure rPPG transformer (TransRPPG) framework for learning intrinsic liveness representation efficiently. At first, rPPG-based multi-scale spatial-temporal maps (MSTmap) are constructed from facial skin and background regions. Then the transformer fully mines the global relationship within MSTmaps for liveness representation, and gives a binary prediction for 3D mask detection. Comprehensive experiments are conducted on two benchmark datasets to demonstrate the efficacy of the TransRPPG on both intra- and cross-dataset testings. Our TransRPPG is lightweight and efficient (with only 547 K parameters and 763 M FLOPs), which is promising for mobile-level applications.

[1]  Xiaoming Liu,et al.  Face De-Spoofing: Anti-Spoofing via Noise Modeling , 2018, ECCV.

[2]  Matti Pietikäinen,et al.  Generalized face anti-spoofing by detecting pulse from face videos , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[3]  Dong Cao,et al.  Learning Meta Face Recognition in Unseen Domains , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Gerard de Haan,et al.  Robust Pulse Rate From Chrominance-Based rPPG , 2013, IEEE Transactions on Biomedical Engineering.

[5]  Junjie Yan,et al.  A face antispoofing database with diverse attacks , 2012, 2012 5th IAPR International Conference on Biometrics (ICB).

[6]  Xiaoming Liu,et al.  Learning Deep Models for Face Anti-Spoofing: Binary or Auxiliary Supervision , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Anil K. Jain,et al.  Secure Face Unlock: Spoof Detection on Smartphones , 2016, IEEE Transactions on Information Forensics and Security.

[8]  Guodong Guo,et al.  A survey on 3D mask presentation attack detection and countermeasures , 2020, Pattern Recognit..

[9]  Nicolas Usunier,et al.  End-to-End Object Detection with Transformers , 2020, ECCV.

[10]  Sergio Escalera,et al.  Cross-ethnicity Face Anti-spoofing Recognition Challenge: A Review , 2020, IET Biom..

[11]  Hu Han,et al.  Video-based Remote Physiological Measurement via Cross-verified Feature Disentangling , 2020, ECCV.

[12]  Jingang Shi,et al.  AutoHR: A Strong End-to-End Baseline for Remote Heart Rate Measurement With Neural Searching , 2020, IEEE Signal Processing Letters.

[13]  Jukka Komulainen,et al.  Face anti-spoofing based on color texture analysis , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[14]  Louis-Philippe Morency,et al.  OpenFace 2.0: Facial Behavior Analysis Toolkit , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[15]  Sander Stuijk,et al.  Exploiting Spatial Redundancy of Image Sensor for Motion Robust rPPG , 2015, IEEE Transactions on Biomedical Engineering.

[16]  Guoying Zhao,et al.  Face Liveness Detection by rPPG Features and Contextual Patch-Based CNN , 2019, ICBEA.

[17]  Sébastien Marcel,et al.  Spoofing Face Recognition With 3D Masks , 2014, IEEE Transactions on Information Forensics and Security.

[18]  Anil K. Jain,et al.  Face Spoof Detection With Image Distortion Analysis , 2015, IEEE Transactions on Information Forensics and Security.

[19]  Matti Pietikäinen,et al.  Remote Heart Rate Measurement from Face Videos under Realistic Situations , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Daniel McDuff,et al.  DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks , 2018, ECCV.

[21]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[22]  Georg Heigold,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2021, ICLR.

[23]  Jingang Shi,et al.  Face Anti-Spoofing with Human Material Perception , 2020, ECCV.

[24]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[25]  Chenxu Zhao,et al.  Searching Central Difference Convolutional Networks for Face Anti-Spoofing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Guoying Zhao,et al.  Remote Photoplethysmograph Signal Measurement from Facial Videos Using Spatio-Temporal Networks , 2019, BMVC.

[27]  Pong C. Yuen,et al.  Remote Photoplethysmography Correspondence Feature for 3D Mask Face Presentation Attack Detection , 2018, ECCV.

[28]  Jukka Komulainen,et al.  Face Antispoofing Using Speeded-Up Robust Features and Fisher Vector Encoding , 2017, IEEE Signal Processing Letters.

[29]  Guoying Zhao,et al.  Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Xiangyu Zhu,et al.  Deep Spatial Gradient and Temporal Depth Learning for Face Anti-Spoofing , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Sander Stuijk,et al.  Algorithmic Principles of Remote PPG , 2017, IEEE Transactions on Biomedical Engineering.

[32]  Pichao Wang,et al.  TransReID: Transformer-based Object Re-Identification , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[33]  Xilin Chen,et al.  RhythmNet: End-to-End Heart Rate Estimation From Face via Spatial-Temporal Representation , 2019, IEEE Transactions on Image Processing.

[34]  Jukka Komulainen,et al.  Face Spoofing Detection Using Colour Texture Analysis , 2016, IEEE Transactions on Information Forensics and Security.

[35]  Jun Wan,et al.  NAS-FAS: Static-Dynamic Central Difference Network Search for Face Anti-Spoofing , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Jiri Matas,et al.  Visual Heart Rate Estimation with Convolutional Neural Network , 2018, BMVC.

[37]  L. O. Svaasand,et al.  Remote plethysmographic imaging using ambient light. , 2008, Optics express.

[38]  Guoying Zhao,et al.  3D Mask Face Anti-spoofing with Remote Photoplethysmography , 2016, ECCV.

[39]  Daniel McDuff,et al.  Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam , 2011, IEEE Transactions on Biomedical Engineering.

[40]  Zhaoqiang Xia,et al.  Revisiting Pixel-Wise Supervision for Face Anti-Spoofing , 2020, IEEE Transactions on Biometrics, Behavior, and Identity Science.

[41]  Xiaoyue Jiang,et al.  Replayed Video Attack Detection Based on Motion Blur Analysis , 2019, IEEE Transactions on Information Forensics and Security.