Calorie Counter: RGB-Depth Visual Estimation of Energy Expenditure at Home

We present a new framework for vision-based estimation of calorific expenditure from RGB-D data - the first that is validated on physical gas exchange measurements and applied to daily living scenarios. Deriving a person’s energy expenditure from sensors is an important tool in tracking physical activity levels for health and lifestyle monitoring. Most existing methods use metabolic lookup tables (METs) for a manual estimate or systems with inertial sensors which ultimately require users to wear devices. In contrast, the proposed pose-invariant and individual-independent vision framework allows for a remote estimation of calorific expenditure. We introduce, and evaluate our approach on, a new dataset called SPHERE-calorie, for which visual estimates can be compared against simultaneously obtained, indirect calorimetry measures based on gas exchange. We conclude from our experiments that the proposed vision pipeline is suitable for home monitoring in a controlled environment, with calorific expenditure estimates above accuracy levels of commonly used manual estimations via METs. With the dataset released, our work establishes a baseline for future research for this little-explored area of computer vision.

[1]  J.K. Aggarwal,et al.  Human activity analysis , 2011, ACM Comput. Surv..

[2]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  B E Ainsworth,et al.  Compendium of physical activities: an update of activity codes and MET intensities. , 2000, Medicine and science in sports and exercise.

[4]  E. Ravussin,et al.  Determinants of 24-hour energy expenditure in man. Methods and results using a respiratory chamber. , 1986, The Journal of clinical investigation.

[5]  Thomas Mensink,et al.  Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[6]  Chao-Cheng Wu,et al.  Estimation of Calories Consumption for Aerobics Using Kinect Based Skeleton Tracking , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[7]  Du Tran,et al.  Human Activity Recognition with Metric Learning , 2008, ECCV.

[8]  Niall Twomey,et al.  Bridging e-Health and the Internet of Things: The SPHERE Project , 2015, IEEE Intelligent Systems.

[9]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[10]  Majid Mirmehdi,et al.  A comparative home activity monitoring study using visual and inertial sensors , 2015, 2015 17th International Conference on E-health Networking, Application & Services (HealthCom).

[11]  Larry H. Matthies,et al.  Pooled motion features for first-person videos , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Guodong Guo,et al.  A survey on still image based human action recognition , 2014, Pattern Recognit..

[13]  Majid Mirmehdi,et al.  A comparative study of pose representation and dynamics modelling for online motion quality assessment , 2016, Comput. Vis. Image Underst..

[14]  Ivan Laptev,et al.  On Space-Time Interest Points , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Majid Mirmehdi,et al.  A multi-modal sensor infrastructure for healthcare in a residential environment , 2015, 2015 IEEE International Conference on Communication Workshop (ICCW).

[16]  Frank I. Katch,et al.  Exercise Physiology: Energy, Nutrition, and Human Performance, 3rd Edition , 1991 .

[17]  Julien Penders,et al.  Estimating Energy Expenditure Using Body-Worn Accelerometers: A Comparison of Methods, Sensors Number and Positioning , 2015, IEEE Journal of Biomedical and Health Informatics.

[18]  Zicheng Liu,et al.  HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Nasser Kehtarnavaz,et al.  Improving Human Action Recognition Using Fusion of Depth Camera and Inertial Sensors , 2015, IEEE Transactions on Human-Machine Systems.

[20]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[22]  Jake K. Aggarwal,et al.  Human activity recognition from 3D data: A review , 2014, Pattern Recognit. Lett..

[23]  Frank Vahid,et al.  Estimating Daily Energy Expenditure from Video for Assistive Monitoring , 2013, 2013 IEEE International Conference on Healthcare Informatics.

[24]  Thomas G. Dietterich Machine Learning for Sequential Data: A Review , 2002, SSPR/SPR.

[25]  Marco La Cascia,et al.  3D skeleton-based human action classification: A survey , 2016, Pattern Recognit..

[26]  Matthias Egger,et al.  Domains of physical activity and all-cause mortality: systematic review and dose-response meta-analysis of cohort studies. , 2011, International journal of epidemiology.

[27]  Patricia A. Deuster,et al.  Exercise Physiology: Energy, Nutrition and Human Performance , 1991 .