Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics

In this paper, we present an approach for scene understanding by reasoning physical stability of objects from point cloud. We utilize a simple observation that, by human design, objects in static scenes should be stable with respect to gravity. This assumption is applicable to all scene categories and poses useful constraints for the plausible interpretations (parses) in scene understanding. Our method consists of two major steps: 1) geometric reasoning: recovering solid 3D volumetric primitives from defective point cloud, and 2) physical reasoning: grouping the unstable primitives to physically stable objects by optimizing the stability and the scene prior. We propose to use a novel disconnectivity graph (DG) to represent the energy landscape and use a Swendsen-Wang Cut (MCMC) method for optimization. In experiments, we demonstrate that the algorithm achieves substantially better performance for i) object segmentation, ii) 3D volumetric recovery of the scene, and iii) better parsing result for scene understanding in comparison to state-of-the-art methods in both public dataset and our own new dataset.

[1]  Alexei A. Efros,et al.  Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics , 2010, ECCV.

[2]  Richard Szeliski,et al.  Manhattan-world stereo , 2009, CVPR.

[3]  A. Heuer Energy Landscapes. Applications to Clusters, Biomolecules and Glasses. By David J. Wales. , 2005 .

[4]  Katsushi Ikeuchi,et al.  Adaptively merging large-scale range data with reflectance properties , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Thomas A. Funkhouser,et al.  A benchmark for 3D mesh segmentation , 2009, ACM Trans. Graph..

[6]  Adrian Barbu,et al.  Generalizing Swendsen-Wang to sampling arbitrary posterior probabilities , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Song-Chun Zhu,et al.  Image Parsing via Stochastic Scene Grammar , 2011 .

[8]  Alexei A. Efros,et al.  From 3D scene geometry to human workspace , 2011, CVPR 2011.

[9]  Song-Chun Zhu,et al.  Image Parsing with Stochastic Scene Grammar , 2011, NIPS.

[10]  Andreas Birk,et al.  Fast plane detection and polygonalization in noisy 3D range images , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  David B. Cooper,et al.  The 3L Algorithm for Fitting Implicit Polynomial Curves and Surfaces to Data , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[13]  I. Biederman,et al.  Scene perception: Detecting and judging objects undergoing relational violations , 1982, Cognitive Psychology.

[14]  Takeo Kanade,et al.  Geometric reasoning for single image structure recovery , 2009, CVPR.

[15]  Katsushi Ikeuchi,et al.  An Adaptive and Stable Method for Fitting Implicit Polynomial Curves and Surfaces , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  David J. Kriegman,et al.  Let Them Fall Where They May: Capture Regions of Curved Objects and Polyhedra , 1997, Int. J. Robotics Res..

[17]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[18]  R. Podgornik Energy Landscapes: Applications to Clusters, Biomolecules and Glasses (Cambridge Molecular Science) , 2007 .

[19]  Marco Attene,et al.  Hierarchical mesh segmentation based on fitting primitives , 2006, The Visual Computer.

[20]  Takeo Kanade,et al.  Estimating Spatial Layout of Rooms using Volumetric Reasoning about Objects and Surfaces , 2010, NIPS.

[21]  Jessica B. Hamrick Internal physics models guide probabilistic judgments about object dynamics , 2011 .

[22]  David A. Forsyth,et al.  Thinking Inside the Box: Using Appearance Models and Context Based on Room Geometry , 2010, ECCV.

[23]  Jonathan T. Barron,et al.  A category-level 3-D object dataset: Putting the Kinect to work , 2011, ICCV Workshops.

[24]  H. Bülthoff,et al.  Perceived object stability is affected by the internal representation of gravity , 2010 .