Robust Remote Heart Rate Estimation from Face Utilizing Spatial-temporal Attention

In this work, we propose an end-to-end approach for robust remote heart rate (HR) measurement gleaned from facial videos. Specifically the approach is based on remote photoplethysmography (rPPG), which constitutes a pulse triggered perceivable chromatic variation, sensed in RGB-face videos. Consequently, rPPGs can be affected in less-constrained settings. To unpin the shortcoming, the proposed algorithm utilizes a spatio-temporal attention mechanism, which places focus on the salient features included in rPPG-signals. In addition, we propose an effective rPPG augmentation approach, generating multiple rPPG signals with varying HRs from a single face video. Experimental results on the public datasets VIPL-HR and MMSE-HR show that the proposed method outperforms state-of-the-art algorithms in remote HR estimation.

[1]  Shiguang Shan,et al.  Continuous heart rate measurement from face: A robust rPPG approach with distribution learning , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[2]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[3]  Rosalind W. Picard,et al.  Non-contact, automated cardiac pulse measurements using video imaging and blind source separation , 2022 .

[4]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[5]  Yu Sun,et al.  Photoplethysmography Revisited: From Contact to Noncontact, From Point to Imaging , 2016, IEEE Transactions on Biomedical Engineering.

[6]  Sander Stuijk,et al.  Exploiting Spatial Redundancy of Image Sensor for Motion Robust rPPG , 2015, IEEE Transactions on Biomedical Engineering.

[7]  Sander Stuijk,et al.  Algorithmic Principles of Remote PPG , 2017, IEEE Transactions on Biomedical Engineering.

[8]  Ronald A. Rensink The Dynamic Representation of Scenes , 2000 .

[9]  Sander Stuijk,et al.  Robust heart rate from fitness videos , 2017, Physiological measurement.

[10]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11]  Douglas G. Altman,et al.  Measurement in Medicine: The Analysis of Method Comparison Studies , 1983 .

[12]  Nicu Sebe,et al.  Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yoshinori Kuno,et al.  Robust Heart Rate Measurement from Video Using Select Random Patches , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[14]  Matti Pietikäinen,et al.  Remote Heart Rate Measurement from Face Videos under Realistic Situations , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Westgate Road,et al.  Photoplethysmography and its application in clinical physiological measurement , 2007 .

[16]  Arindam Sikdar,et al.  Computer-Vision-Guided Human Pulse Rate Estimation: A Review , 2016, IEEE Reviews in Biomedical Engineering.

[17]  Xiaogang Wang,et al.  Residual Attention Network for Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Shiguang Shan,et al.  VIPL-HR: A Multi-modal Database for Pulse Estimation from Less-constrained Face Video , 2018, ACCV.

[19]  Daniel McDuff,et al.  Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam , 2011, IEEE Transactions on Biomedical Engineering.

[20]  Shiguang Shan,et al.  SynRhythm: Learning a Deep Heart Rate Estimator from General to Specific , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[21]  Geoffrey E. Hinton,et al.  Learning to combine foveal glimpses with a third-order Boltzmann machine , 2010, NIPS.

[22]  Gang Sun,et al.  Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[23]  L. O. Svaasand,et al.  Remote plethysmographic imaging using ambient light. , 2008, Optics express.

[24]  Xing Ji,et al.  CosFace: Large Margin Cosine Loss for Deep Face Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25]  Gerard de Haan,et al.  Robust Pulse Rate From Chrominance-Based rPPG , 2013, IEEE Transactions on Biomedical Engineering.

[26]  Yi Yang,et al.  Attention to Scale: Scale-Aware Semantic Image Segmentation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Gee-Sern Hsu,et al.  Deep learning with time-frequency representation for pulse estimation from facial videos , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).

[28]  Mohammad Soleymani,et al.  A Multimodal Database for Affect Recognition and Implicit Tagging , 2012, IEEE Transactions on Affective Computing.

[29]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[30]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[31]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).