Basic 3D Solid Recognition in RGB-D Images

The paper deals with the problem of recognition of 3D objects for the purpose of their subsequent grasping and manipulation by a two-handed robot. We describe the idea of a general framework for object recognition rooted in the compositional model of the world. This approach threats complex objects as entities constructed of simpler, elementary ones, termed solids. In particular, we focus on recognition of two types of such solids: cuboids and generalized cones. We present details of their operation, starting from the low-level processing of RGB-D images and ending with the generation of hypotheses regarding the presence and parameters of those types of solids.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Maciej Stefanczyk,et al.  Multimodal Segmentation of Dense Depth Maps and Associated Color Information , 2012, ICCVG.

[3]  Radu Bogdan Rusu,et al.  3D is here: Point Cloud Library (PCL) , 2011, 2011 IEEE International Conference on Robotics and Automation.

[4]  Włodzimierz Kasprzak,et al.  A linguistic approach to 3-D object recognition , 1987, Comput. Graph..

[5]  Silvio Savarese,et al.  Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Adrian Kaehler,et al.  Learning OpenCV 3: Computer Vision in C++ with the OpenCV Library , 2016 .

[7]  H. Blum Biological shape and visual science (part I) , 1973 .

[8]  Yunde Jia Description and recognition of curved objects , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol. III. Conference C: Image, Speech and Signal Analysis,.

[9]  Martin A. Fischler,et al.  The Representation and Matching of Pictorial Structures , 1973, IEEE Transactions on Computers.

[10]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[12]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[13]  Franc Solina,et al.  Part-level object recognition using superquadrics , 2004, Comput. Vis. Image Underst..

[14]  Michael I. Miller,et al.  Pattern Theory: From Representation to Inference , 2007 .

[15]  H. Blum Biological shape and visual science. I. , 1973, Journal of theoretical biology.

[16]  Maciej Stefanczyk,et al.  3D Semantic Map Computation Based on Depth Map and Video Image , 2012, ICCVG.

[17]  Yi Yang,et al.  Articulated pose estimation with flexible mixtures-of-parts , 2011, CVPR 2011.

[18]  Gary R. Bradski,et al.  Learning OpenCV - computer vision with the OpenCV library: software that sees , 2008 .

[19]  Silvio Savarese,et al.  Articulated part-based model for joint object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[20]  Ryszard Tadeusiewicz,et al.  Computer Vision and Graphics , 2014, Lecture Notes in Computer Science.

[21]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.