论文信息 - Computer Vision – ECCV 2014

Computer Vision – ECCV 2014

Heavy occlusions in cluttered scenes impose significant challenges to many computer vision applications. Recent light field imaging systems provide new see-through capabilities through synthetic aperture imaging (SAI) to overcome the occlusion problem. Existing synthetic aperture imaging methods, however, emulate focusing at a specific depth layer but is incapable of producing an all-in-focus see-through image. Alternative in-painting algorithms can generate visually plausible results but can not guarantee the correctness of the result. In this paper, we present a novel depth free all-in-focus SAI technique based on lightfield visibility analysis. Specifically, we partition the scene into multiple visibility layers to directly deal with layer-wise occlusion and apply an optimization framework to propagate the visibility information between multiple layers. On each layer, visibility and optimal focus depth estimation is formulated as a multiple label energy minimization problem. The energy integrates the visibility mask from previous layers, multi-view intensity consistency, and depth smoothness constraint. We compare our method with the state-of-the-art solutions. Extensive experimental results with qualitative and quantitative analysis demonstrate the effectiveness and superiority of our approach.

[1] Jean Ponce,et al. Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Brendan J. Frey,et al. Learning appearance and transparency manifolds of occluded objects in layers , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[3] Jitendra Malik,et al. Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Paul J. Besl,et al. A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Derek Hoiem,et al. Support Surface Prediction in Indoor Scenes , 2013, 2013 IEEE International Conference on Computer Vision.

[6] Anita Sellent,et al. Motion Field Estimation from Alternate Exposure Images , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Daniel Cremers,et al. Variational space-time motion segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8] R. Horaud,et al. Surface feature detection and description with applications to mesh matching , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[9] David J. Kriegman,et al. Synthetic Aperture Tracking: Tracking through Occlusions , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10] David J. Fleet,et al. A Layered Motion Representation with Occlusion and Compact Spatial Support , 2002, ECCV.

[11] Seungyong Lee,et al. Video deblurring for hand-held cameras using patch-based synthesis , 2012, ACM Trans. Graph..

[12] Katsushi Ikeuchi,et al. Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13] Jean Ponce,et al. Non-uniform Deblurring for Shaken Images , 2012, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14] Ze-Nian Li,et al. Review and Preview: Disocclusion by Inpainting for Image-Based Rendering , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[15] Andrea Thelen,et al. Improvements in Shape-From-Focus for Holographic Reconstructions With Regard to Focus Operators, Neighborhood-Size, and Height Value Interpolation , 2009, IEEE Transactions on Image Processing.

[16] Daniel Cremers,et al. A Coding-Cost Framework for Super-Resolution Motion Layer Decomposition , 2012, IEEE Transactions on Image Processing.

[17] Frédo Durand,et al. Unstructured Light Fields , 2012, Comput. Graph. Forum.

[18] Takeo Kanade,et al. Super-Resolution Optical Flow , 1999 .

[19] Brendan J. Frey,et al. Learning flexible sprites in video layers , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20] Ying Wu,et al. Removing partial blur in a single image , 2009, CVPR.

[21] Michael J. Black,et al. The Dense Estimation of Motion and Appearance in Layers , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[22] Q. M. Jonathan Wu,et al. 3D Shape from Focus and Depth Map Computation Using Steerable Filters , 2009, ICIAR.

[23] Richard Szeliski,et al. An integrated Bayesian approach to layer extraction from image sequences , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[24] Marc Levoy,et al. Using plane + parallax for calibrating dense camera arrays , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[25] Jared Glover,et al. Bingham procrustean alignment for object detection in clutter , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26] Guillermo Sapiro,et al. A Variational Framework for Simultaneous Motion Estimation and Restoration of Motion-Blurred Video , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[27] Marwan Torki,et al. RGBD object pose recognition using local-global multi-kernel regression , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[28] Soo-Won Kim,et al. Reduced Energy-Ratio Measure for Robust Autofocusing in Digital Camera , 2009, IEEE Signal Processing Letters.

[29] Wojciech Matusik,et al. Structure and motion from scene registration , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30] Andrew Y. Ng,et al. Convolutional-Recursive Deep Learning for 3D Object Classification , 2012, NIPS.

[31] Alfred M. Bruckstein,et al. Variational Approach for Joint Optic-Flow Computation and Video Restoration , 2005 .

[32] Dieter Fox,et al. Detection-based object labeling in 3D scenes , 2012, 2012 IEEE International Conference on Robotics and Automation.

[33] Jian Zhang,et al. Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors , 2013, 2013 IEEE International Conference on Computer Vision.

[34] Sanja Fidler,et al. Holistic Scene Understanding for 3D Object Detection with RGBD Cameras , 2013, 2013 IEEE International Conference on Computer Vision.

[35] Dieter Fox,et al. Object recognition with hierarchical kernel descriptors , 2011, CVPR 2011.

[36] William T. Freeman,et al. Removing camera shake from a single photograph , 2006, SIGGRAPH 2006.

[37] Andrew Blake,et al. Motion Deblurring and Super-resolution from an Image Sequence , 1996, ECCV.

[38] Stefano Soatto,et al. Dynamic Shape and Appearance Modeling Via Moving and Deforming Layers , 2005, EMMCVPR.

[39] Markus Vincze,et al. Ensemble of shape functions for 3D object classification , 2011, 2011 IEEE International Conference on Robotics and Biomimetics.

[40] Jianxiong Xiao,et al. A Linear Approach to Matching Cuboids in RGBD Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[41] X.C. He,et al. Motion estimation method for blurred videos and application of deblurring with spatially varying blur kernels , 2010, 5th International Conference on Computer Sciences and Convergence Information Technology.

[42] Michel Barlaud,et al. Two deterministic half-quadratic regularization algorithms for computed imaging , 1994, Proceedings of 1st International Conference on Image Processing.

[43] Marc Levoy,et al. Synthetic Aperture Focusing using a Shear-Warp Factorization of the Viewing Transform , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[44] Xiuwei Zhang,et al. A novel multi-object detection method in complex scene using synthetic aperture imaging , 2012, Pattern Recognit..

[45] Michael S. Brown,et al. Motion Regularization for Matting Motion Blurred Objects , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46] Silvio Savarese,et al. 3D Scene Understanding by Voxel-CRF , 2013, 2013 IEEE International Conference on Computer Vision.

[47] Siddhartha S. Srinivasa,et al. Object Recognition Robust to Imperfect Depth Data , 2012, ECCV Workshops.

[48] Steven M. Seitz,et al. Filter flow , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[49] Jitendra Malik,et al. Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[50] Hui Chen,et al. 3D free-form object recognition in range images using local surface patches , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[51] Dieter Fox,et al. Unsupervised Feature Learning for RGB-D Based Object Recognition , 2012, ISER.

[52] Andrew E. Johnson,et al. Spin-Images: A Representation for 3-D Surface Matching , 1997 .

[53] Marc Levoy,et al. Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[54] A. N. Rajagopalan,et al. Non-uniform Motion Deblurring for Bilayer Scenes , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[55] Daniel P. Huttenlocher,et al. Generating sharp panoramas from motion-blurred videos , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[56] Dale Schuurmans,et al. Maximum Margin Clustering , 2004, NIPS.

[57] Sang Hwa Lee,et al. Recovery of blurred video signals using iterative image restoration combined with motion estimation , 1997, Proceedings of International Conference on Image Processing.

[58] Ahmed M. Elgammal,et al. Joint Object and Pose Recognition Using Homeomorphic Manifold Analysis , 2013, AAAI.

[59] Andrew Zisserman,et al. Learning Layered Motion Segmentations of Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[60] Michael J. Black,et al. Layered segmentation and optical flow estimation over time , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[61] Martial Hebert,et al. 3-D scene analysis via sequenced predictions over points and regions , 2011, 2011 IEEE International Conference on Robotics and Automation.

[62] Anat Levin,et al. Blind Motion Deblurring Using Image Statistics , 2006, NIPS.

[63] Shree K. Nayar,et al. PiCam , 2013, ACM Trans. Graph..

[64] Martin D. Levine,et al. Recovering parametric geons from multiview range data , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[65] Irfan A. Essa,et al. Calibration-free rolling shutter removal , 2012, 2012 IEEE International Conference on Computational Photography (ICCP).

[66] Sunghyun Cho,et al. Fast motion deblurring , 2009, SIGGRAPH 2009.

[67] Pushmeet Kohli,et al. Unwrap mosaics: a new representation for video editing , 2008, SIGGRAPH 2008.

[68] Dieter Fox,et al. A Scalable Tree-Based Approach for Joint Object and Pose Recognition , 2011, AAAI.

[69] Yasuyuki Matsushita,et al. Motion detail preserving optical flow estimation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[70] Kun Zhou,et al. An interactive approach to semantic modeling of indoor scenes with an RGBD camera , 2012, ACM Trans. Graph..

[71] Rui Yu,et al. A New Hybrid Synthetic Aperture Imaging Model for Tracking and Seeing People Through Occlusion , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[72] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[73] Li Zhang,et al. Optical flow in the presence of spatially-varying motion blur , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[74] Chengtao Cai,et al. Motion deblurring from a single image , 2016, 2016 IEEE 20th International Conference on Computer Supported Cooperative Work in Design (CSCWD).

[75] Dieter Fox,et al. Sparse distance learning for object recognition combining RGB and depth information , 2011, 2011 IEEE International Conference on Robotics and Automation.

[76] Dieter Fox,et al. Depth kernel descriptors for object recognition , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[77] Michael J. Black,et al. A Fully-Connected Layered Model of Foreground and Background Flow , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[78] Wei Xiong,et al. Rotational Motion Deblurring of a Rigid Object from a Single Image , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[79] Vladimir G. Kim,et al. Shape-based recognition of 3D point clouds in urban environments , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[80] Hongbin Zha,et al. Segmentation and classification of range image from an intelligent vehicle in urban environment , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[81] Kostas Daniilidis,et al. Single image 3D object detection and pose estimation for grasping , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[82] Peyman Milanfar,et al. Removing Motion Blur With Space–Time Processing , 2011, IEEE Transactions on Image Processing.

[83] Bernt Schiele,et al. 3D object recognition from range images using local feature histograms , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[84] Federico Tombari,et al. Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[85] Roberto Cipolla,et al. Visual tracking in the presence of motion blur , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[86] Fei-Fei Li,et al. Object discovery in 3D scenes via shape analysis , 2013, 2013 IEEE International Conference on Robotics and Automation.

[87] William T. Freeman,et al. Analyzing spatially-varying blur , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[88] Edward H. Adelson,et al. Representing moving images with layers , 1994, IEEE Trans. Image Process..

[89] Yair Weiss,et al. Smoothness in layers: Motion segmentation using nonparametric mixture estimation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[90] Silvio Savarese,et al. Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[91] Worthy N. Martin,et al. Image Motion Estimation From Motion Smear-A New Computational Model , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[92] Vladimir Kolmogorov,et al. An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[93] Ying Wu,et al. Motion from blur , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[94] Tsuhan Chen,et al. 3D-Based Reasoning with Blocks, Support, and Stability , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[95] D. A. Fish,et al. Blind deconvolution by means of the Richardson-Lucy algorithm. , 1995 .

[96] Leonard McMillan,et al. Dynamically reparameterized light fields , 2000, SIGGRAPH.

[97] Marc Levoy,et al. High performance imaging using large camera arrays , 2005, SIGGRAPH 2005.

[98] Marcel Körtgen,et al. 3D Shape Matching with 3D Shape Contexts , 2003 .

[99] Yanning Zhang,et al. Synthetic aperture imaging using pixel labeling via energy minimization , 2013, Pattern Recognit..

[100] Takuma Yamaguchi,et al. Video Deblurring and Super-Resolution Technique for Multiple Moving Objects , 2010, ACCV.