Real-Time Enhancement of Dynamic Depth Videos with Non-Rigid Deformations

We propose a novel approach for enhancing depth videos containing non-rigidly deforming objects. Depth sensors are capable of capturing depth maps in real-time but suffer from high noise levels and low spatial resolutions. While solutions for reconstructing 3D details in static scenes, or scenes with rigid global motions have been recently proposed, handling unconstrained non-rigid deformations in relative complex scenes remains a challenge. Our solution consists in a recursive dynamic multi-frame super-resolution algorithm where the relative local 3D motions between consecutive frames are directly accounted for. We rely on the assumption that these 3D motions can be decoupled into lateral motions and radial displacements. This allows to perform a simple local per-pixel tracking where both depth measurements and deformations are dynamically optimized. The geometric smoothness is subsequently added using a multi-level $L_1$ minimization with a bilateral total variation regularization. The performance of this method is thoroughly evaluated on both real and synthetic data. As compared to alternative approaches, the results show a clear improvement in reconstruction accuracy and in robustness to noise, to relative large non-rigid deformations, and to topological changes. Moreover, the proposed approach, implemented on a CPU, is shown to be computationally efficient and working in real-time.

[1]  Björn E. Ottersten,et al.  Dynamic super resolution of depth sequences with non-rigid motions , 2013, 2013 IEEE International Conference on Image Processing.

[2]  Alfred M. Bruckstein,et al.  RGBD-fusion: Real-time high precision depth recovery , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Daniel Cremers,et al.  Stereoscopic Scene Flow Computation for 3D Motion Understanding , 2011, International Journal of Computer Vision.

[4]  Michael Elad,et al.  Superresolution restoration of an image sequence: adaptive filtering approach , 1999, IEEE Trans. Image Process..

[5]  Björn E. Ottersten,et al.  Kinect Deform: Enhanced 3D Reconstruction of Non-rigidly Deforming Objects , 2014, 2014 2nd International Conference on 3D Vision.

[6]  Björn E. Ottersten,et al.  Pixel weighted average strategy for depth sensor data fusion , 2010, 2010 IEEE International Conference on Image Processing.

[7]  Jihun Yu,et al.  Realtime facial animation with on-the-fly correctives , 2013, ACM Trans. Graph..

[8]  Ruigang Yang,et al.  Reliability Fusion of Time-of-Flight Depth and Stereo Geometry for High Quality Depth Maps , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Horst Bischof,et al.  Image Guided Depth Upsampling Using Anisotropic Total Generalized Variation , 2013, 2013 IEEE International Conference on Computer Vision.

[10]  Alberto Del Bimbo,et al.  Face Recognition by Super-Resolved 3D Models From Consumer Depth Cameras , 2014, IEEE Transactions on Information Forensics and Security.

[11]  Björn E. Ottersten,et al.  A new multi-lateral filter for real-time depth enhancement , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[12]  Hans-Peter Seidel,et al.  Lightweight binocular facial performance capture under uncontrolled lighting , 2012, ACM Trans. Graph..

[13]  Andrew W. Fitzgibbon,et al.  3D scanning deformable objects with a single RGBD sensor , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Dieter Fox,et al.  DynamicFusion: Reconstruction and tracking of non-rigid scenes in real-time , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Björn E. Ottersten,et al.  Real-time non-rigid multi-frame depth video super-resolution , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[16]  Marc Rioux,et al.  Direct Estimation of Range Flow on Deformable Shape From a Video Rate Range Camera , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Joachim Hornegger,et al.  Standardization of intensity-values acquired by Time-of-Flight-cameras , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Didier Stricker,et al.  Algorithms for 3D Shape Scanning with a Depth Camera , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Björn E. Ottersten,et al.  Unified multi-lateral filter for real-time depth map enhancement , 2015, Image Vis. Comput..

[20]  Michal Irani,et al.  Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[21]  Ligang Liu,et al.  Scanning 3D Full Human Bodies Using Kinects , 2012, IEEE Transactions on Visualization and Computer Graphics.

[22]  Andrew W. Fitzgibbon,et al.  Real-time non-rigid reconstruction using an RGB-D camera , 2014, ACM Trans. Graph..

[23]  Michael Elad,et al.  Video-to-Video Dynamic Super-Resolution for Grayscale and Color Sequences , 2006, EURASIP J. Adv. Signal Process..

[24]  Alberto Del Bimbo,et al.  Superfaces: A Super-Resolution Model for 3D Faces , 2012, ECCV Workshops.

[25]  Douglas A. Gray,et al.  Modified kalman filtering for image super-resolution: Experimental convergence results , 2007, SIP.

[26]  Hongbin Zha,et al.  Similarity-Aware Patchwork Assembly for Depth Image Super-resolution , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[28]  Richard Bowden,et al.  Scene Particles: Unregularized Particle Based Scene Flow Estimation. , 2013, IEEE transactions on pattern analysis and machine intelligence.

[29]  Sebastian Thrun,et al.  LidarBoost: Depth superresolution for ToF 3D shape scanning , 2009, CVPR.

[30]  Hagen Spies,et al.  Evaluating the range flow motion constraint , 2002, Object recognition supported by user interaction for service robots.

[31]  Ruigang Yang,et al.  Spatial-Depth Super Resolution for Range Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Jean Ponce,et al.  Robust image filtering using joint static and dynamic guidance , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Björn E. Ottersten,et al.  Bilateral filter evaluation based on exponential kernels , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[34]  Michael Elad,et al.  Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[35]  Michael Elad,et al.  On the bilateral filter and ways to improve it , 2002 .

[36]  Peyman Milanfar,et al.  A General Framework for Regularized, Similarity-Based Image Restoration , 2014, IEEE Transactions on Image Processing.

[37]  Kai-Kuang Ma,et al.  A new state-space approach for super-resolution image sequence reconstruction , 2005, IEEE International Conference on Image Processing 2005.

[38]  Gabriel J. Brostow,et al.  Patch Based Synthesis for Single Depth Image Super-Resolution , 2012, ECCV.

[39]  Eli Shechtman,et al.  PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, ACM Trans. Graph..

[40]  Lifeng Sun,et al.  Joint Example-Based Depth Map Super-Resolution , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[41]  Björn E. Ottersten,et al.  Patch-based Statistical Performance Analysis of Upsampling for Precise Super-Resolution , 2015, VISAPP.

[42]  Peyman Milanfar,et al.  A Tour of Modern Image Filtering: New Insights and Methods, Both Practical and Theoretical , 2013, IEEE Signal Processing Magazine.

[43]  Björn E. Ottersten,et al.  Enhancement of dynamic depth scenes by upsampling for precise super-resolution (UP-SR) , 2016, Comput. Vis. Image Underst..

[44]  Jonathan T. Barron,et al.  3D self-portraits , 2013, ACM Trans. Graph..

[45]  Sebastian Thrun,et al.  An Application of Markov Random Fields to Range Sensing , 2005, NIPS.

[46]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[47]  Bernd Jähne,et al.  Range Flow Estimation , 2002, Comput. Vis. Image Underst..

[48]  Michael Elad,et al.  A General Iterative Regularization Framework For Image Denoising , 2006, 2006 40th Annual Conference on Information Sciences and Systems.

[49]  Björn E. Ottersten,et al.  Surface UP-SR for an improved face recognition using low resolution depth cameras , 2014, 2014 11th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[50]  Carsten Rother,et al.  Depth Super Resolution by Rigid Body Self-Similarity in 3D , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Horst Bischof,et al.  aTGV-SF: Dense Variational Scene Flow through Projective Warping and Higher Order Regularization , 2014, 2014 2nd International Conference on 3D Vision.

[52]  Janis Fehr,et al.  Computing Range Flow from Multi-modal Kinect Data , 2011, ISVC.

[53]  Shen Xu,et al.  A parameter-adaptive iterative regularization model for image denoising , 2012, EURASIP J. Adv. Signal Process..

[54]  Wotao Yin,et al.  An Iterative Regularization Method for Total Variation-Based Image Restoration , 2005, Multiscale Model. Simul..

[55]  Richard Bowden,et al.  Kinecting the dots: Particle based scene flow from depth sensors , 2011, 2011 International Conference on Computer Vision.

[56]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, SIGGRAPH 2007.

[57]  Bernd Jähne,et al.  Regularised Range Flow , 2000, ECCV.

[58]  Dieter Fox,et al.  RGB-D flow: Dense 3-D motion estimation using color and depth , 2013, 2013 IEEE International Conference on Robotics and Automation.

[59]  Björn E. Ottersten,et al.  Depth Super-Resolution by Enhanced Shift and Add , 2013, CAIP.