A novel one-stage framework for visual pulse rate estimation using deep neural networks

Abstract Estimation of the visual pulse rate (also called heart rate) refers to extraction of the pulse rate from a facial video. With the studies on extracting photoplethysmography (PPG) signals from a facial video, the non-contacted measurement method has aroused great interest among researchers over the past few years. In this study, a novel one-stage spatio-temporal framework, namely PRnet, is proposed to estimate the pulse rate from a stationary facial video. First, visual pulse rate estimation is defined as a regression task based on deep neural networks, in which a video is mapped to a pulse rate value. Then, 3D convolutional neural networks (Conv3D) and Long short-term memory (LSTM) modules are used to extract spatial and latent temporal information that is hidden in a video. Subsequently, one fully connected layer is applied in the last layer of PRnet to estimate the pulse rate directly. Based on the exquisite framework design, our proposed method realizes competitive performance, especially in terms of processing latency, since it does not rely on power spectral density (PSD) and traditional Fast Fourier Transform (FFT) algorithms. Using our method, only 60 frames of video (2 s) are required for the robust prediction of the pulse rate, whereas 6–30 s of video are typically required for other methods. Finally, a novel visual pulse rate estimation database, which includes pulse rate range at various times of day, is collected to evaluate the proposed framework. The results of extensive experiments demonstrate that PRnet performs competitively while compared with state-of-the-art methods.

[1]  Jingang Shi,et al.  AutoHR: A Strong End-to-End Baseline for Remote Heart Rate Measurement With Neural Searching , 2020, IEEE Signal Processing Letters.

[2]  Yunbo Wang,et al.  Eidetic 3D LSTM: A Model for Video Prediction and Beyond , 2019, ICLR.

[3]  Gerard de Haan,et al.  Robust Pulse Rate From Chrominance-Based rPPG , 2013, IEEE Transactions on Biomedical Engineering.

[4]  H. V. van Geijn,et al.  Heart Rate Variability , 1993, Annals of Internal Medicine.

[5]  L. Køber,et al.  Resting, night-time, and 24 h heart rate as markers of cardiovascular risk in middle-aged and elderly men and women with no apparent heart disease. , 2013, European heart journal.

[6]  U. Rajendra Acharya,et al.  Heart rate variability: a review , 2006, Medical and Biological Engineering and Computing.

[7]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[8]  Kilian Q. Weinberger,et al.  Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Vladlen Koltun,et al.  An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[10]  Frédéric Bousefsaf,et al.  3D Convolutional Neural Networks for Remote Pulse Rate Measurement and Mapping from Facial Video , 2019, Applied Sciences.

[11]  Zhigang Liu,et al.  Contact Wire Irregularity Stochastics and Effect on High-Speed Railway Pantograph–Catenary Interactions , 2020, IEEE Transactions on Instrumentation and Measurement.

[12]  John Allen Photoplethysmography and its application in clinical physiological measurement , 2007, Physiological measurement.

[13]  Bing-Fei Wu,et al.  Neural Network Based Luminance Variation Resistant Remote-Photoplethysmography for Driver’s Heart Rate Monitoring , 2019, IEEE Access.

[14]  Sander Stuijk,et al.  Exploiting Spatial Redundancy of Image Sensor for Motion Robust rPPG , 2015, IEEE Transactions on Biomedical Engineering.

[15]  Muhammad Bin Altaf,et al.  An ECG Processor for the Detection of Eight Cardiac Arrhythmias with Minimum False Alarms , 2019, 2019 IEEE Biomedical Circuits and Systems Conference (BioCAS).

[16]  Jie Liu,et al.  Non-contact Heart Rate Monitoring by Combining Convolutional Neural Network Skin Detection and Remote Photoplethysmography via a Low-Cost Camera , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[17]  L. O. Svaasand,et al.  Remote plethysmographic imaging using ambient light. , 2008, Optics express.

[18]  Mohammad Soleymani,et al.  A Multimodal Database for Affect Recognition and Implicit Tagging , 2012, IEEE Transactions on Affective Computing.

[19]  Shiguang Shan,et al.  Robust Remote Heart Rate Estimation from Face Utilizing Spatial-temporal Attention , 2019, 2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019).

[20]  Daniel McDuff,et al.  DeepPhys: Video-Based Physiological Measurement Using Convolutional Attention Networks , 2018, ECCV.

[21]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[22]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[23]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[24]  Juan Arteaga-Falconi,et al.  Real-Time Contactless Heart Rate Estimation from Facial Video , 2018 .

[25]  Jiri Matas,et al.  Visual Heart Rate Estimation with Convolutional Neural Network , 2018, BMVC.

[26]  Yannick Benezeth,et al.  Unsupervised skin tissue segmentation for remote photoplethysmography , 2017, Pattern Recognit. Lett..

[27]  Vladimir Blazek,et al.  Local Group Invariance for Heart Rate Estimation from Face Videos in the Wild , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[28]  Chiou-Ting Hsu,et al.  Siamese-rPPG network: remote photoplethysmography signal estimation from face videos , 2020, SAC.

[29]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[30]  Shiguang Shan,et al.  SynRhythm: Learning a Deep Heart Rate Estimator from General to Specific , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[31]  Yoshinori Kuno,et al.  Robust Heart Rate Measurement from Video Using Select Random Patches , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32]  Daniel McDuff,et al.  iPhys: An Open Non-Contact Imaging-Based Physiological Measurement Toolbox , 2019, 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[33]  Olga Perepelkina,et al.  HeartTrack: Convolutional neural network for remote video-based heart rate monitoring , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[34]  Guoying Zhao,et al.  Remote Photoplethysmograph Signal Measurement from Facial Videos Using Spatio-Temporal Networks , 2019, BMVC.

[35]  Xiaodan Zhang,et al.  Spatio-Temporal Memory Attention for Image Captioning , 2020, IEEE Transactions on Image Processing.

[36]  Heng Tao Shen,et al.  Hierarchical LSTMs with Adaptive Attention for Visual Captioning , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Guoying Zhao,et al.  Remote Heart Rate Measurement From Highly Compressed Facial Videos: An End-to-End Deep Learning Solution With Video Enhancement , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38]  Nicu Sebe,et al.  Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Robert Laganière,et al.  Scalable Kernel Correlation Filter with Sparse Feature Integration , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[40]  Xilin Chen,et al.  RhythmNet: End-to-End Heart Rate Estimation From Face via Spatial-Temporal Representation , 2019, IEEE Transactions on Image Processing.

[41]  Sander Stuijk,et al.  A Novel Algorithm for Remote Photoplethysmography: Spatial Subspace Rotation , 2016, IEEE Transactions on Biomedical Engineering.

[42]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[43]  Tim Morris,et al.  Adaptive skin segmentation via feature-based face detection , 2014, Photonics Europe.

[44]  Zhengguo Li,et al.  A Novel Framework for Remote Photoplethysmography Pulse Extraction on Compressed Videos , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[45]  Weihai Chen,et al.  Physiological Signal Preserving Video Compression for Remote Photoplethysmography , 2019, IEEE Sensors Journal.

[46]  Oscal T.-C. Chen,et al.  Real-time physiological and facial monitoring for safe driving , 2016, 2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC).

[47]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[48]  Muhammad Bin Altaf,et al.  A wearable long-term single-lead ECG processor for early detection of cardiac arrhythmia , 2018, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[49]  T. Pursche,et al.  Video-based heart rate measurement from human faces , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[50]  Haneen Farah,et al.  Heart Rate Analysis for Human Factors: Development and Validation of an Open Source Toolkit for Noisy Naturalistic Heart Rate Data , 2018 .

[51]  Sander Stuijk,et al.  Algorithmic Principles of Remote PPG , 2017, IEEE Transactions on Biomedical Engineering.

[52]  Gee-Sern Hsu,et al.  Deep learning with time-frequency representation for pulse estimation from facial videos , 2017, 2017 IEEE International Joint Conference on Biometrics (IJCB).