DARWIN: Deformable Patient Avatar Representation With Deep Image Network

In this paper, we present a technical approach to robustly estimate the detailed patient body surface mesh under clothing cover from a single snapshot of a range sensor. Existing methods either lack level of detail of the estimated patient body model, fail to estimate the body model robustly under clothing cover, or lack sufficient evaluation over real patient datasets. In this work, we overcome these limitations by learning deep convolutional networks over real clinical dataset with large variation and augmentation. Our approach is validated with experiments conducted over 1063 human subjects from 3 different hospitals and surface errors are measured against groundtruth from CT data.

[1]  Lena Maier-Hein,et al.  Real-Time Range Imaging in Health Care: A Survey , 2013, Time-of-Flight and Depth Imaging.

[2]  Michael J. Black,et al.  Home 3D body scans from noisy image and range data , 2011, 2011 International Conference on Computer Vision.

[3]  Andrew Blake,et al.  Efficient Human Pose Estimation from Single Depth Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[5]  Thambipillai Srikanthan,et al.  Vision-based patient monitoring: a comprehensive review of algorithms and technologies , 2018, J. Ambient Intell. Humaniz. Comput..

[6]  Rainer Stiefelhagen,et al.  Sleep position classification from a depth camera using Bed Aligned Maps , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[7]  Michael J. Black,et al.  Home 3D Body Scans from a Single Kinect , 2013, Consumer Depth Cameras for Computer Vision.

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Sven Haase,et al.  Multi-modal surface registration for markerless initial patient setup in radiation therapy using microsoft's Kinect sensor , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[10]  Sebastian Thrun,et al.  SCAPE: shape completion and animation of people , 2005, SIGGRAPH 2005.

[11]  Peter V. Gehler,et al.  Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image , 2016, ECCV.

[12]  Kathleen M. Robinette,et al.  Civilian American and European Surface Anthropometry Resource (CAESAR), Final Report. Volume 1. Summary , 2002 .

[13]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Dorin Comaniciu,et al.  Robust Multi-scale Anatomical Landmark Detection in Incomplete 3D-CT Data , 2017, MICCAI.

[15]  Nassir Navab,et al.  Patient MoCap: Human Pose Estimation Under Blanket Occlusion for Hospital Monitoring Applications , 2016, MICCAI.

[16]  Vivek Kumar Singh,et al.  Estimating a Patient Surface Model for Optimizing the Medical Scanning Workflow , 2014, MICCAI.

[17]  Jonathan Tompson,et al.  Efficient object localization using Convolutional Networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).