Estimating Heart Rate and Rhythm via 3D Motion Tracking in Depth Video

Low-cost depth sensors, such as Microsoft Kinect, have potential for noncontact health monitoring that is robust to ambient lighting conditions. However, captured depth images typically suffer from high acquisition noise, and hence, processing them to estimate biometrics is difficult. In this paper, we propose to capture depth video of a human subject using Kinect 2.0 to estimate his/her heart rate and rhythm; as blood is pumped from the heart to circulate through the head, tiny oscillatory head motion due to Newtonian mechanics can be detected for periodicity analysis. Specifically, we first restore a captured depth video via a joint bit-depth enhancement/denoising procedure, using a graph-signal smoothness prior for regularization. Second, we track an automatically detected head region throughout the depth video to deduce 3D motion vectors. The detected vectors are fed back to the depth restoration module in a loop to ensure that the motion information in two modules is consistent, improving performance of both restoration and motion tracking. Third, the computed 3D motion vectors are projected onto its principal component for 1D signal analysis, composed of trend removal, bandpass filtering, and wavelet-based motion denoising. Finally, the heart rate is estimated via Welch power spectrum analysis, and the heart rhythm is computed via peak detection. Experimental results show accurate estimation of the heart rate and rhythm using our proposed algorithm as compared to rate and rhythm estimated by a portable oximeter.

[1]  Xianming Liu,et al.  Random Walk Graph Laplacian-Based Smoothness Prior for Soft Decoding of JPEG Images , 2016, IEEE Transactions on Image Processing.

[2]  L. O. Svaasand,et al.  Remote plethysmographic imaging using ambient light. , 2008, Optics express.

[3]  Magdalena Madej,et al.  Measuring Pulse Rate with a Webcam , 2012 .

[4]  Oscar C. Au,et al.  Optimal graph laplacian regularization for natural image denoising , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Lorenzo Scalise,et al.  Non contact measurement of heart and respiration rates based on Kinect™ , 2014, 2014 IEEE International Symposium on Medical Measurements and Applications (MeMeA).

[6]  Ling Shao,et al.  Visual Tracking Using Strong Classifier and Structural Local Sparse Descriptors , 2015, IEEE Transactions on Multimedia.

[7]  Oscar C. Au,et al.  Rate-Constrained 3D Surface Estimation From Noise-Corrupted Multiview Depth Videos , 2014, IEEE Transactions on Image Processing.

[8]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[10]  Ming Yang,et al.  Vital sign estimation from passive thermal video , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Zhaohua Wu,et al.  On the trend, detrending, and variability of nonlinear and nonstationary time series , 2007, Proceedings of the National Academy of Sciences.

[12]  William H. Press,et al.  Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[13]  Aly A. Farag,et al.  A Fully Automatic Method to Extract the Heart Rate from Thermal Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[14]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[15]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[16]  Ling Shao,et al.  Enhanced Computer Vision With Microsoft Kinect Sensor: A Review , 2013, IEEE Transactions on Cybernetics.

[17]  Ming-Sui Lee,et al.  Multiparameter Sleep Monitoring Using a Depth Camera , 2012, BIOSTEC.

[18]  Matti Pietikäinen,et al.  Remote Heart Rate Measurement from Face Videos under Realistic Situations , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[20]  Arcangelo Merla,et al.  Thermal Infrared Imaging-Based Computational Psychophysiology for Psychometrics , 2015, Comput. Math. Methods Medicine.

[21]  Muhammad Murtaza Khan,et al.  A computationally efficient heart rate measurement system using video cameras , 2015, 2015 International Conference on Emerging Technologies (ICET).

[22]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[23]  Javier Ruiz Hidalgo,et al.  Real-Time Head and Hand Tracking Based on 2.5D Data , 2012 .

[24]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[25]  Jing Peng,et al.  SVM vs regularized least squares classification , 2004, ICPR 2004.

[26]  Andrew W. Fitzgibbon,et al.  Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[27]  Gene Cheung,et al.  Graph Laplacian Regularization for Image Denoising: Analysis in the Continuous Domain , 2016, IEEE Transactions on Image Processing.

[28]  Aly A. Farag,et al.  Thermal Imaging of the Superficial Temporal Artery: An Arterial Pulse Recovery Model , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Massimo Piccardi,et al.  Local Depth Patterns for Tracking in Depth Videos , 2015, ACM Multimedia.

[30]  I. Daubechies Ten Lectures on Wavelets , 1992 .

[31]  Leonhard Held,et al.  Gaussian Markov Random Fields: Theory and Applications , 2005 .

[32]  Thomas B. Moeslund,et al.  Improved pulse detection from head motions using DCT , 2014, 2014 International Conference on Computer Vision Theory and Applications (VISAPP).

[33]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[34]  Robert C. Bolles,et al.  Parametric Correspondence and Chamfer Matching: Two New Techniques for Image Matching , 1977, IJCAI.

[35]  Jason M. O'Kane,et al.  Collecting Heart Rate Using a High Precision, Non-contact, Single-Point Infrared Temperature Sensor , 2012, ICSR.

[36]  Wen-Jiin Tsai,et al.  Face-Based Heart Rate Signal Decomposition and Evaluation Using Multiple Linear Regression , 2016, IEEE Sensors Journal.

[37]  Gene Cheung,et al.  Graph-based Dequantization of Block-Compressed Piecewise Smooth Images , 2016, IEEE Signal Processing Letters.

[38]  Rama Chellappa,et al.  Fast directional chamfer matching , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[39]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Lan-Rong Dung,et al.  Measurement of heart rate variability using off-the-shelf smart phones , 2016, BioMedical Engineering OnLine.

[41]  Daniel McDuff,et al.  Advancements in Noncontact, Multiparameter Physiological Measurements Using a Webcam , 2011, IEEE Transactions on Biomedical Engineering.

[42]  T. Poggio,et al.  Regularized Least-Squares Classification 133 In practice , although , 2007 .

[43]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  H. Macher,et al.  FIRST EXPERIENCES WITH KINECT V2 SENSOR FOR CLOSE RANGE 3D MODELLING , 2015 .

[45]  Horst-Michael Groß,et al.  Non-contact video-based pulse rate measurement on a mobile service robot , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[46]  Weisi Lin,et al.  Visual Object Tracking by Structure Complexity Coefficients , 2015, IEEE Transactions on Multimedia.

[47]  Frédo Durand,et al.  Eulerian video magnification for revealing subtle changes in the world , 2012, ACM Trans. Graph..

[48]  Minghui Yu,et al.  Video-based heart rate measurement using head motion tracking and ICA , 2013, 2013 6th International Congress on Image and Signal Processing (CISP).

[49]  Gene Cheung,et al.  Estimating heart rate via depth video motion tracking , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[50]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[51]  Fatih Erden,et al.  Sensors in Assisted Living: A survey of signal and image processing methods , 2016, IEEE Signal Processing Magazine.

[52]  Byongmin Kang,et al.  Parametric model-based noise reduction for ToF depth sensors , 2012, Electronic Imaging.

[53]  Rui Caseiro,et al.  Exploiting the Circulant Structure of Tracking-by-Detection with Kernels , 2012, ECCV.

[54]  Lawrence W. Solomon Yale University School of Medicine Heart Book , 1992, The Yale Journal of Biology and Medicine.

[55]  Vladimir Stankovic,et al.  Sleep Apnea Detection via Depth Video and Audio Feature Learning , 2017, IEEE Transactions on Multimedia.

[56]  Andrew C. Singer,et al.  Non-contact heart rate detection via periodic signal detection methods , 2015, 2015 49th Asilomar Conference on Signals, Systems and Computers.

[57]  H. Emrah Tasli,et al.  Remote PPG based vital sign measurement using adaptive facial regions , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[58]  Steffen Leonhardt,et al.  Beat-to-beat heart rate estimation fusing multimodal video and sensor data. , 2015, Biomedical optics express.

[59]  Livio Pinto,et al.  Calibration of Kinect for Xbox One and Comparison between the Two Generations of Microsoft Sensors , 2015, Sensors.

[60]  Stéphane Mallat,et al.  A Wavelet Tour of Signal Processing - The Sparse Way, 3rd Edition , 2008 .

[61]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  James M. Keller,et al.  Histogram of Oriented Normal Vectors for Object Recognition with a Depth Sensor , 2012, ACCV.

[63]  Jake K. Aggarwal,et al.  Human detection using depth information by Kinect , 2011, CVPR 2011 WORKSHOPS.

[64]  Kwang Suk Park,et al.  Validation of heart rate extraction using video imaging on a built-in camera system of a smartphone , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[65]  A. Farag,et al.  Non-contact , Wavelet-based Measurement of Vital Signs using Thermal Imaging , 2005 .

[66]  Michael Felsberg,et al.  The Visual Object Tracking VOT2015 Challenge Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[67]  Matti Pietikäinen,et al.  Computer Vision Using Local Binary Patterns , 2011, Computational Imaging and Vision.

[68]  Gene Cheung,et al.  Sleep Apnea Detection via Depth Video & Audio Feature Learning , 2016 .

[69]  Jianxiong Xiao,et al.  Tracking Revisited Using RGBD Camera: Unified Benchmark and Baselines , 2013, 2013 IEEE International Conference on Computer Vision.

[70]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[71]  Athanasios V. Vasilakos,et al.  A Survey on Ambient Intelligence in Healthcare , 2013, Proceedings of the IEEE.

[72]  Zhengyou Zhang,et al.  Microsoft Kinect Sensor and Its Effect , 2012, IEEE Multim..

[73]  Gene Cheung,et al.  Graph-based depth video denoising and event detection for sleep monitoring , 2014, 2014 IEEE 16th International Workshop on Multimedia Signal Processing (MMSP).

[74]  Ming-Hsuan Yang,et al.  Object Tracking Benchmark , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[75]  Rosalind W. Picard,et al.  Non-contact, automated cardiac pulse measurements using video imaging and blind source separation , 2022 .

[76]  Frédo Durand,et al.  Detecting Pulse from Head Motions in Video , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[77]  Gene Cheung,et al.  Sleep monitoring via depth video compression & analysis , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[78]  Oscar C. Au,et al.  Redefining self-similarity in natural images for denoising using graph signal gradient , 2014, Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2014 Asia-Pacific.