A Novel Vision-based Approach for Dietary Assessment using Deep Learning View Synthesis

Dietary assessment system has proven as an effective tool to evaluate the eating behavior of patients suffering from diabetes and obesity. To assess the dietary intake, the traditional method is to carry out a 24-hour dietary recall (24HR), a structured interview aimed at capturing information on food items and portion size consumed by participants. However, unconscious biases are developed easily due to individual's subjective perception in this self-reporting technique which may lead to inaccuracy. Thus, this paper proposed a novel vision-based approach for estimating the volume of food items based on deep learning view synthesis and depth sensing techniques. In this paper, a point completion network is applied to perform 3D reconstruction of food items using a single depth image captured from any convenient viewing angle. Compared to previous approaches, the proposed method has addressed several key challenges in vision-based dietary assessment, such as view occlusion and scale ambiguity. Experiments have been carried out to examine this approach and showed the feasibility of the algorithm in accurate estimation of food volume.

[1]  Zhen Li,et al.  An exploratory study on a chest-worn computer for evaluation of diet, physical activity and lifestyle. , 2015, Journal of healthcare engineering.

[2]  Edward J. Delp,et al.  A comparison of food portion size estimation using geometric models and depth images , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[3]  Benny P. L. Lo,et al.  Food volume estimation for quantifying dietary intake with a wearable camera , 2018, 2018 IEEE 15th International Conference on Wearable and Implantable Body Sensor Networks (BSN).

[4]  David B. Haytowitz,et al.  USDA Database for the Flavonoid Content of Selected Foods Release 3.2 , 2015 .

[5]  David S. Ebert,et al.  The Use of Mobile Devices in Aiding Dietary Assessment and Evaluation , 2010, IEEE Journal of Selected Topics in Signal Processing.

[6]  Yingnan Sun,et al.  Food Volume Estimation Based on Deep Learning View Synthesis from a Single Depth Map , 2018, Nutrients.

[7]  Martial Hebert,et al.  PCN: Point Completion Network , 2018, 2018 International Conference on 3D Vision (3DV).

[8]  Asako Kanezaki,et al.  Unsupervised Image Segmentation by Backpropagation , 2018, 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  P. Abbeel,et al.  Yale-CMU-Berkeley dataset for robotic manipulation research , 2017, Int. J. Robotics Res..