Computer Vision – ECCV 2014

Heavy occlusions in cluttered scenes impose significant challenges to many computer vision applications. Recent light field imaging systems provide new see-through capabilities through synthetic aperture imaging (SAI) to overcome the occlusion problem. Existing synthetic aperture imaging methods, however, emulate focusing at a specific depth layer but is incapable of producing an all-in-focus see-through image. Alternative in-painting algorithms can generate visually plausible results but can not guarantee the correctness of the result. In this paper, we present a novel depth free all-in-focus SAI technique based on lightfield visibility analysis. Specifically, we partition the scene into multiple visibility layers to directly deal with layer-wise occlusion and apply an optimization framework to propagate the visibility information between multiple layers. On each layer, visibility and optimal focus depth estimation is formulated as a multiple label energy minimization problem. The energy integrates the visibility mask from previous layers, multi-view intensity consistency, and depth smoothness constraint. We compare our method with the state-of-the-art solutions. Extensive experimental results with qualitative and quantitative analysis demonstrate the effectiveness and superiority of our approach.

[1]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Brendan J. Frey,et al.  Learning appearance and transparency manifolds of occluded objects in layers , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[3]  Jitendra Malik,et al.  Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Derek Hoiem,et al.  Support Surface Prediction in Indoor Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[6]  Anita Sellent,et al.  Motion Field Estimation from Alternate Exposure Images , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Daniel Cremers,et al.  Variational space-time motion segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8]  R. Horaud,et al.  Surface feature detection and description with applications to mesh matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  David J. Kriegman,et al.  Synthetic Aperture Tracking: Tracking through Occlusions , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10]  David J. Fleet,et al.  A Layered Motion Representation with Occlusion and Compact Spatial Support , 2002, ECCV.

[11]  Seungyong Lee,et al.  Video deblurring for hand-held cameras using patch-based synthesis , 2012, ACM Trans. Graph..

[12]  Katsushi Ikeuchi,et al.  Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Jean Ponce,et al.  Non-uniform Deblurring for Shaken Images , 2012, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Ze-Nian Li,et al.  Review and Preview: Disocclusion by Inpainting for Image-Based Rendering , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[15]  Andrea Thelen,et al.  Improvements in Shape-From-Focus for Holographic Reconstructions With Regard to Focus Operators, Neighborhood-Size, and Height Value Interpolation , 2009, IEEE Transactions on Image Processing.

[16]  Daniel Cremers,et al.  A Coding-Cost Framework for Super-Resolution Motion Layer Decomposition , 2012, IEEE Transactions on Image Processing.

[17]  Frédo Durand,et al.  Unstructured Light Fields , 2012, Comput. Graph. Forum.

[18]  Takeo Kanade,et al.  Super-Resolution Optical Flow , 1999 .

[19]  Brendan J. Frey,et al.  Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  Ying Wu,et al.  Removing partial blur in a single image , 2009, CVPR.

[21]  Michael J. Black,et al.  The Dense Estimation of Motion and Appearance in Layers , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[22]  Q. M. Jonathan Wu,et al.  3D Shape from Focus and Depth Map Computation Using Steerable Filters , 2009, ICIAR.

[23]  Richard Szeliski,et al.  An integrated Bayesian approach to layer extraction from image sequences , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[24]  Marc Levoy,et al.  Using plane + parallax for calibrating dense camera arrays , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[25]  Jared Glover,et al.  Bingham procrustean alignment for object detection in clutter , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Guillermo Sapiro,et al.  A Variational Framework for Simultaneous Motion Estimation and Restoration of Motion-Blurred Video , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[27]  Marwan Torki,et al.  RGBD object pose recognition using local-global multi-kernel regression , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[28]  Soo-Won Kim,et al.  Reduced Energy-Ratio Measure for Robust Autofocusing in Digital Camera , 2009, IEEE Signal Processing Letters.

[29]  Wojciech Matusik,et al.  Structure and motion from scene registration , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Andrew Y. Ng,et al.  Convolutional-Recursive Deep Learning for 3D Object Classification , 2012, NIPS.

[31]  Alfred M. Bruckstein,et al.  Variational Approach for Joint Optic-Flow Computation and Video Restoration , 2005 .

[32]  Dieter Fox,et al.  Detection-based object labeling in 3D scenes , 2012, 2012 IEEE International Conference on Robotics and Automation.

[33]  Jian Zhang,et al.  Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors , 2013, 2013 IEEE International Conference on Computer Vision.

[34]  Sanja Fidler,et al.  Holistic Scene Understanding for 3D Object Detection with RGBD Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[35]  Dieter Fox,et al.  Object recognition with hierarchical kernel descriptors , 2011, CVPR 2011.

[36]  William T. Freeman,et al.  Removing camera shake from a single photograph , 2006, SIGGRAPH 2006.

[37]  Andrew Blake,et al.  Motion Deblurring and Super-resolution from an Image Sequence , 1996, ECCV.

[38]  Stefano Soatto,et al.  Dynamic Shape and Appearance Modeling Via Moving and Deforming Layers , 2005, EMMCVPR.

[39]  Markus Vincze,et al.  Ensemble of shape functions for 3D object classification , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[40]  Jianxiong Xiao,et al.  A Linear Approach to Matching Cuboids in RGBD Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  X.C. He,et al.  Motion estimation method for blurred videos and application of deblurring with spatially varying blur kernels , 2010, 5th International Conference on Computer Sciences and Convergence Information Technology.

[42]  Michel Barlaud,et al.  Two deterministic half-quadratic regularization algorithms for computed imaging , 1994, Proceedings of 1st International Conference on Image Processing.

[43]  Marc Levoy,et al.  Synthetic Aperture Focusing using a Shear-Warp Factorization of the Viewing Transform , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[44]  Xiuwei Zhang,et al.  A novel multi-object detection method in complex scene using synthetic aperture imaging , 2012, Pattern Recognit..

[45]  Michael S. Brown,et al.  Motion Regularization for Matting Motion Blurred Objects , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Silvio Savarese,et al.  3D Scene Understanding by Voxel-CRF , 2013, 2013 IEEE International Conference on Computer Vision.

[47]  Siddhartha S. Srinivasa,et al.  Object Recognition Robust to Imperfect Depth Data , 2012, ECCV Workshops.

[48]  Steven M. Seitz,et al.  Filter flow , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[49]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[50]  Hui Chen,et al.  3D free-form object recognition in range images using local surface patches , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[51]  Dieter Fox,et al.  Unsupervised Feature Learning for RGB-D Based Object Recognition , 2012, ISER.

[52]  Andrew E. Johnson,et al.  Spin-Images: A Representation for 3-D Surface Matching , 1997 .

[53]  Marc Levoy,et al.  Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[54]  A. N. Rajagopalan,et al.  Non-uniform Motion Deblurring for Bilayer Scenes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  Daniel P. Huttenlocher,et al.  Generating sharp panoramas from motion-blurred videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[56]  Dale Schuurmans,et al.  Maximum Margin Clustering , 2004, NIPS.

[57]  Sang Hwa Lee,et al.  Recovery of blurred video signals using iterative image restoration combined with motion estimation , 1997, Proceedings of International Conference on Image Processing.

[58]  Ahmed M. Elgammal,et al.  Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis , 2013, AAAI.

[59]  Andrew Zisserman,et al.  Learning Layered Motion Segmentations of Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[60]  Michael J. Black,et al.  Layered segmentation and optical flow estimation over time , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[61]  Martial Hebert,et al.  3-D scene analysis via sequenced predictions over points and regions , 2011, 2011 IEEE International Conference on Robotics and Automation.

[62]  Anat Levin,et al.  Blind Motion Deblurring Using Image Statistics , 2006, NIPS.

[63]  Shree K. Nayar,et al.  PiCam , 2013, ACM Trans. Graph..

[64]  Martin D. Levine,et al.  Recovering parametric geons from multiview range data , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[65]  Irfan A. Essa,et al.  Calibration-free rolling shutter removal , 2012, 2012 IEEE International Conference on Computational Photography (ICCP).

[66]  Sunghyun Cho,et al.  Fast motion deblurring , 2009, SIGGRAPH 2009.

[67]  Pushmeet Kohli,et al.  Unwrap mosaics: a new representation for video editing , 2008, SIGGRAPH 2008.

[68]  Dieter Fox,et al.  A Scalable Tree-Based Approach for Joint Object and Pose Recognition , 2011, AAAI.

[69]  Yasuyuki Matsushita,et al.  Motion detail preserving optical flow estimation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[70]  Kun Zhou,et al.  An interactive approach to semantic modeling of indoor scenes with an RGBD camera , 2012, ACM Trans. Graph..

[71]  Rui Yu,et al.  A New Hybrid Synthetic Aperture Imaging Model for Tracking and Seeing People Through Occlusion , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[72]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[73]  Li Zhang,et al.  Optical flow in the presence of spatially-varying motion blur , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[74]  Chengtao Cai,et al.  Motion deblurring from a single image , 2016, 2016 IEEE 20th International Conference on Computer Supported Cooperative Work in Design (CSCWD).

[75]  Dieter Fox,et al.  Sparse distance learning for object recognition combining RGB and depth information , 2011, 2011 IEEE International Conference on Robotics and Automation.

[76]  Dieter Fox,et al.  Depth kernel descriptors for object recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[77]  Michael J. Black,et al.  A Fully-Connected Layered Model of Foreground and Background Flow , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[78]  Wei Xiong,et al.  Rotational Motion Deblurring of a Rigid Object from a Single Image , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[79]  Vladimir G. Kim,et al.  Shape-based recognition of 3D point clouds in urban environments , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[80]  Hongbin Zha,et al.  Segmentation and classification of range image from an intelligent vehicle in urban environment , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[81]  Kostas Daniilidis,et al.  Single image 3D object detection and pose estimation for grasping , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[82]  Peyman Milanfar,et al.  Removing Motion Blur With Space–Time Processing , 2011, IEEE Transactions on Image Processing.

[83]  Bernt Schiele,et al.  3D object recognition from range images using local feature histograms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[84]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[85]  Roberto Cipolla,et al.  Visual tracking in the presence of motion blur , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[86]  Fei-Fei Li,et al.  Object discovery in 3D scenes via shape analysis , 2013, 2013 IEEE International Conference on Robotics and Automation.

[87]  William T. Freeman,et al.  Analyzing spatially-varying blur , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[88]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[89]  Yair Weiss,et al.  Smoothness in layers: Motion segmentation using nonparametric mixture estimation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[90]  Silvio Savarese,et al.  Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[91]  Worthy N. Martin,et al.  Image Motion Estimation From Motion Smear-A New Computational Model , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[92]  Vladimir Kolmogorov,et al.  An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[93]  Ying Wu,et al.  Motion from blur , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[94]  Tsuhan Chen,et al.  3D-Based Reasoning with Blocks, Support, and Stability , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[95]  D. A. Fish,et al.  Blind deconvolution by means of the Richardson-Lucy algorithm. , 1995 .

[96]  Leonard McMillan,et al.  Dynamically reparameterized light fields , 2000, SIGGRAPH.

[97]  Marc Levoy,et al.  High performance imaging using large camera arrays , 2005, SIGGRAPH 2005.

[98]  Marcel Körtgen,et al.  3D Shape Matching with 3D Shape Contexts , 2003 .

[99]  Yanning Zhang,et al.  Synthetic aperture imaging using pixel labeling via energy minimization , 2013, Pattern Recognit..

[100]  Takuma Yamaguchi,et al.  Video Deblurring and Super-Resolution Technique for Multiple Moving Objects , 2010, ACCV.