Ask the OWL: Object detection constrained by a probabilistic ontological model

The problem of object detection in computer vision is a difficult and interesting problem which is far from being solved due in no small part to the challenges of perception. Nevertheless, by introducing top-down priors such as semantics, the problem of segmenting and detecting objects becomes tractable. This paper proposes such an approach by relying on the ontological relationships that make up parts of objects in order to enhance their detection. The proposed method processes the point cloud of a scene and clusters it into pools of potential objects. Hypotheses on the object identity are generated using geometric and customized ontological definitions to generate probabilistic models, which constitute the building blocks for the decision making process, whereas an object labeling scheme derived by minimizing an energy function is presented. Finally, objects are replaced by matching them to generic CAD models. To evaluate the proposed method, we run our experiments on three well-known datasets and compare with results in the literature, showing superiority of the proposed method over prior art.

[1]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Andrew Zisserman,et al.  MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[3]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[4]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[5]  Joachim Hertzberg,et al.  6D SLAM—3D mapping outdoor environments , 2007, J. Field Robotics.

[6]  Joachim Hertzberg,et al.  A Toolkit for Automatic Generation of Polygonal Maps - Las Vegas Reconstruction , 2012, ROBOTIK.

[7]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[8]  Thorsten Joachims,et al.  Contextually Guided Semantic Labeling and Search for 3D Point Clouds , 2011, ArXiv.

[9]  Daniel Cremers,et al.  LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[10]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[11]  C. Griswold,et al.  Platonic Writings/Platonic Readings , 1988 .

[12]  Bhaskara Marthi,et al.  An object-based semantic world model for long-term change detection and semantic querying , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Michela Bertolotto,et al.  Octree-based region growing for point cloud segmentation , 2015 .

[14]  Thorsten Joachims,et al.  Semantic Labeling of 3D Point Clouds for Indoor Scenes , 2011, NIPS.

[15]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[16]  Michael G. Strintzis,et al.  Knowledge-assisted semantic video object detection , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Roland Siegwart,et al.  Fusion of IMU and Vision for Absolute Scale Estimation in Monocular SLAM , 2011, J. Intell. Robotic Syst..

[18]  Florentin Wörgötter,et al.  Voxel Cloud Connectivity Segmentation - Supervoxels for Point Clouds , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Joachim Hertzberg,et al.  An evaluation of open source surface reconstruction software for robotic applications , 2013, 2013 16th International Conference on Advanced Robotics (ICAR).

[20]  Paul H. J. Kelly,et al.  SLAM++: Simultaneous Localisation and Mapping at the Level of Objects , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Hugh Durrant-Whyte,et al.  Simultaneous localization and mapping (SLAM): part II , 2006 .

[22]  Anna Helena Reali Costa,et al.  Object-based Visual SLAM: How Object Identity Informs Geometry , 2008 .

[23]  Jitendra Malik,et al.  Region-Based Convolutional Networks for Accurate Object Detection and Segmentation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.