A General Remote Photoplethysmography Estimator with Spatiotemporal Convolutional Network

Remote PPG (rPPG) has been attracting increasing attention due to its potential in a wide range of application scenarios such as clinical monitoring, physical training and face presentation attack detection. On top of manually designed solutions, the deep-learning approach appears in rPPG estimation and achieves top-level performance. However, most of them try to integrate the steps of both preprocessing and ROI selection into an end-to-end network, which limits the generalization in other applications that use different input skin regions. The ROI selection model learned on face videos may not adapt to skin regions from other body parts with different appearance and size. In this paper, we leave the preprocessing apart and propose a lightweight rPPG estimation network—DeeprPPG for general use. DeeprPPG is based on spatiotemporal convolutions and can be used as a well-defined module in wider application scenarios with different types of input skin. To further boost the robustness, a spatiotemporal rPPG aggregation strategy is designed to adaptively aggregate rPPG signals from multiple skin regions into the final one. Extensive experiments are conducted and the results illustrate its robustness when facing unseen skin regions and unseen scenarios.

[1]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[2]  L. O. Svaasand,et al.  Remote plethysmographic imaging using ambient light. , 2008, Optics express.

[3]  Rosalind W. Picard,et al.  Non-contact, automated cardiac pulse measurements using video imaging and blind source separation , 2022 .

[4]  Tomasz Kocejko,et al.  Proceedings of the Federated Conference on Computer Science and Information Systems pp. 405–410 ISBN 978-83-60810-22-4 Measuring Pulse Rate with a Webcam – a Non-contact Method for Evaluating Cardiac Activity , 2022 .

[5]  Mohammad Soleymani,et al.  A Multimodal Database for Affect Recognition and Implicit Tagging , 2012, IEEE Transactions on Affective Computing.

[6]  Hagen Malberg,et al.  ROI Selection for Remote Photoplethysmography , 2013, Bildverarbeitung für die Medizin.

[7]  Peter Robinson,et al.  Constrained Local Neural Fields for Robust Facial Landmark Detection in the Wild , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[8]  Gerard de Haan,et al.  Robust Pulse Rate From Chrominance-Based rPPG , 2013, IEEE Transactions on Biomedical Engineering.

[9]  Andrew Zisserman,et al.  Two-Stream Convolutional Networks for Action Recognition in Videos , 2014, NIPS.

[10]  Horst-Michael Groß,et al.  Non-contact video-based pulse rate measurement on a mobile service robot , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[11]  Matti Pietikäinen,et al.  Remote Heart Rate Measurement from Face Videos under Realistic Situations , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[13]  Pavlo Molchanov,et al.  Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Matti Pietikäinen,et al.  Generalized face anti-spoofing by detecting pulse from face videos , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[15]  Sander Stuijk,et al.  A Novel Algorithm for Remote Photoplethysmography: Spatial Subspace Rotation , 2016, IEEE Transactions on Biomedical Engineering.

[16]  Nicu Sebe,et al.  Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Sébastien Marcel,et al.  A Reproducible Study on Remote Heart Rate Measurement , 2017, ArXiv.

[18]  Sander Stuijk,et al.  Algorithmic Principles of Remote PPG , 2017, IEEE Transactions on Biomedical Engineering.

[19]  Daniel McDuff,et al.  The Impact of Video Compression on Remote Cardiac Pulse Measurement Using Imaging Photoplethysmography , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[20]  Ashok Veeraraghavan,et al.  PPGSecure: Biometric Presentation Attack Detection Using Photopletysmograms , 2017, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[21]  Jiri Matas,et al.  Visual Heart Rate Estimation with Convolutional Neural Network , 2018, BMVC.

[22]  Daniel McDuff,et al.  DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks , 2018, ECCV.

[23]  Yann LeCun,et al.  A Closer Look at Spatiotemporal Convolutions for Action Recognition , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[24]  Pong C. Yuen,et al.  Remote Photoplethysmography Correspondence Feature for 3D Mask Face Presentation Attack Detection , 2018, ECCV.

[25]  Shiguang Shan,et al.  SynRhythm: Learning a Deep Heart Rate Estimator from General to Specific , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[26]  Ayoub Al-Hamadi,et al.  Effects of Video Encoding on Camera-Based Heart Rate Estimation , 2019, IEEE Transactions on Biomedical Engineering.

[27]  Guoying Zhao,et al.  Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).