Supervised learning approach to remote heart rate estimation from facial videos

A supervised machine learning approach to remote video-based heart rate (HR) estimation is proposed. We demonstrate the possibility of training a discriminative statistical model to estimate the Blood Volume Pulse signal (BVP) from the human face using ambient light and any off-the-shelf webcam. The proposed algorithm is 120 times faster than state of the art approach and returns a confidence metric to evaluate the HR estimates plausibility. The algorithm was evaluated against the state-of-the-art on 120 minutes of face videos, the largest video-based heart rate evaluation to date. The evaluation results showed a 53% decrease in the Root Mean Squared Error (RMSE) compared to state-of-the-art.

[1]  Seungjin Choi,et al.  Independent Component Analysis , 2009, Handbook of Natural Computing.

[2]  Daniël Lakens,et al.  Using a Smartphone to Measure Heart Rate Changes during Relived Happiness and Anger , 2013, IEEE Transactions on Affective Computing.

[3]  Stéphane Cook,et al.  High heart rate: a cardiovascular risk factor? , 2006, European heart journal.

[4]  Daniel McDuff,et al.  Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam , 2011, IEEE Transactions on Biomedical Engineering.

[5]  T. Chau,et al.  Comparison of blood volume pulse and skin conductance responses to mental and affective stimuli at different anatomical sites , 2011, Physiological measurement.

[6]  Jean-Claude Tardif,et al.  Resting heart rate in cardiovascular disease. , 2007, Journal of the American College of Cardiology.

[7]  Hong Yan,et al.  A Machine Learning Approach to Improve Contactless Heart Rate Monitoring Using a Webcam , 2014, IEEE Journal of Biomedical and Health Informatics.

[8]  E. Oja,et al.  Independent Component Analysis , 2001 .

[9]  Rosalind W. Picard,et al.  Non-contact, automated cardiac pulse measurements using video imaging and blind source separation , 2022 .

[10]  Frédo Durand,et al.  Detecting Pulse from Head Motions in Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Annie Lang Involuntary Attention and Physiological Arousal Evoked by Structural Features and Emotional Content in TV Commercials , 1990 .

[12]  Daniel McDuff,et al.  A medical mirror for non-contact health monitoring , 2011, SIGGRAPH '11.

[13]  L. O. Svaasand,et al.  Remote plethysmographic imaging using ambient light. , 2008, Optics express.

[14]  Erkki Oja,et al.  Independent Component Analysis Aapo Hyvärinen, Juha Karhunen, , 2004 .