Dynamic Time-of-Flight

Time-of-flight (TOF) depth cameras provide robust depth inference at low power requirements in a wide variety of consumer and industrial applications. These cameras reconstruct a single depth frame from a given set of infrared (IR) frames captured over a very short exposure period. Operating in this mode the camera essentially forgets all information previously captured - and performs depth inference from scratch for every frame. We challenge this practice and propose using previously captured information when inferring depth. An inherent problem we have to address is camera motion over this longer period of collecting observations. We derive a probabilistic framework combining a simple but robust model of camera and object motion, together with an observation model. This combination allows us to integrate information over multiple frames while remaining robust to rapid changes. Operating the camera in this manner has implications in terms of both computational efficiency and how information should be captured. We address these two issues and demonstrate a realtime TOF system with robust temporal integration that improves depth accuracy over strong baseline methods including adaptive spatio-temporal filters.

[1]  Mongi A. Abidi,et al.  Spatial and Temporal Enhancement of Depth Images Captured by a Time-of-Flight Depth Sensor , 2010, 2010 20th International Conference on Pattern Recognition.

[2]  Karen O. Egiazarian,et al.  Practical Poissonian-Gaussian Noise Modeling and Fitting for Single-Image Raw-Data , 2008, IEEE Transactions on Image Processing.

[3]  Andrew J. Davison,et al.  DTAM: Dense tracking and mapping in real-time , 2011, 2011 International Conference on Computer Vision.

[4]  Siem Jan Koopman,et al.  Time Series Analysis by State Space Methods , 2001 .

[5]  Dieter Fox,et al.  DynamicFusion: Reconstruction and tracking of non-rigid scenes in real-time , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Sebastian Nowozin,et al.  Bayesian Time-of-Flight for Realtime Shape, Illumination and Albedo , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Po-Hsun Cheng,et al.  Temporal and Spatial Denoising of Depth Maps , 2015, Sensors.

[8]  Jian Sun,et al.  Guided Image Filtering , 2010, ECCV.

[9]  Andrew W. Fitzgibbon,et al.  Model-Based Tracking at 300Hz Using Raw Time-of-Flight Observations , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10]  Toby Sharp,et al.  Implementing Decision Trees and Forests on a GPU , 2008, ECCV.

[11]  Rahul Nair,et al.  Time of flight motion compensation revisited , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[12]  Ju Shen,et al.  Layer Depth Denoising and Completion for Structured-Light RGB-D Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Marc Levoy,et al.  Real-time 3D model acquisition , 2002, ACM Trans. Graph..

[14]  W. K. Hastings,et al.  Monte Carlo Sampling Methods Using Markov Chains and Their Applications , 1970 .

[15]  Jun S. Liu,et al.  Monte Carlo strategies in scientific computing , 2001 .

[16]  Yuval Peres,et al.  Decayed MCMC Filtering , 2012, UAI.

[17]  Alan Yuille,et al.  Active Vision , 2014, Computer Vision, A Reference Guide.

[18]  P. Laplace Memoir on the Probability of the Causes of Events , 1986 .

[19]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[20]  Luis Salgado,et al.  Efficient spatio-temporal hole filling strategy for Kinect depth maps , 2012, Electronic Imaging.

[21]  M. Degroot Optimal Statistical Decisions , 1970 .

[22]  A. Doucet,et al.  A Tutorial on Particle Filtering and Smoothing: Fifteen years later , 2008 .

[23]  Jun S. Liu,et al.  Metropolized independent sampling with comparisons to rejection sampling and importance sampling , 1996, Stat. Comput..

[24]  Dmitriy Vatolin,et al.  Temporal filtering for depth maps generated by Kinect depth camera , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[25]  Ling Shao,et al.  Recent advances and trends in visual tracking: A review , 2011, Neurocomputing.

[26]  Andrew D. Payne,et al.  A 0.13 μm CMOS System-on-Chip for a 512 × 424 Time-of-Flight Image Sensor With Multi-Frequency Photo-Demodulation up to 130 MHz and 2 GS/s ADC , 2015, IEEE Journal of Solid-State Circuits.

[27]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  John W. Fisher,et al.  Information-Driven Adaptive Structured-Light Scanners , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Luc Van Gool,et al.  Fast 3D Scanning with Automatic Motion Compensation , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[31]  Simon J. Godsill,et al.  An Overview of Existing Methods and Recent Advances in Sequential Monte Carlo , 2007, Proceedings of the IEEE.

[32]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[33]  Yiannis Aloimonos,et al.  Active vision , 2004, International Journal of Computer Vision.

[34]  Mirko Schmidt,et al.  SRA: Fast Removal of General Multipath for ToF Sensors , 2014, ECCV.

[35]  Radu Horaud,et al.  Time-of-Flight Cameras: Principles, Methods and Applications , 2012 .

[36]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[37]  David Cohen,et al.  A fast global shutter image sensor based on the VOD mechanism , 2014, IEEE SENSORS 2014 Proceedings.

[38]  Andrew Gelman,et al.  Handbook of Markov Chain Monte Carlo , 2011 .

[39]  Yuval Peres,et al.  Decayed MCMC iltering , 2002, UAI 2002.

[40]  MOHIT GUPTA,et al.  Phasor Imaging , 2015, ACM Trans. Graph..