论文信息 - Shape-Based Instance Detection Under Arbitrary Viewpoint

Shape-Based Instance Detection Under Arbitrary Viewpoint

Shape-based instance detection under arbitrary viewpoint is a very challenging problem. Current approaches for handling viewpoint variation can be divided into two main categories: invariant and non-invariant. Invariant approaches explicitly represent the structural relationships of high-level, view-invariant shape primitives. Non-invariant approaches, on the other hand, create a template for each viewpoint of the object, and can operate directly on low-level features. We summarize the main advantages and disadvantages of invariant and non-invariant approaches, and conclude that non-invariant approaches are well-suited for capturing fine-grained details needed for specific object recognition while also being computationally efficient. Finally, we discuss approaches that are needed to address ambiguities introduced by recognizing shape under arbitrary viewpoint.

Martial Hebert | Edward Hsiao | M. Hebert | Edward Hsiao

[1] Shuicheng Yan,et al. An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2] Katsushi Ikeuchi,et al. Generating an interpretation tree from a CAD model for 3D-object recognition in bin-picking tasks , 1987, International Journal of Computer Vision.

[3] Alexei A. Efros,et al. Putting Objects in Perspective , 2006, CVPR.

[4] Vincent Lepetit,et al. Dominant orientation templates for real-time detection of texture-less objects , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5] Anil K. Jain,et al. CAD-Based Computer Vision: From CAD Models to Relational Graphs , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6] W. Eric L. Grimson,et al. Localizing Overlapping Parts by Searching the Interpretation Tree , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Nan Wang,et al. Who Blocks Who: Simultaneous clothing segmentation for grouping images , 2011, 2011 International Conference on Computer Vision.

[8] Jianbo Shi,et al. Many-to-one contour matching for describing and discriminating object shape , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9] Martial Hebert,et al. Occlusion reasoning for object detection under arbitrary viewpoint , 2012, CVPR.

[10] Jitendra Malik,et al. From contours to regions: An empirical evaluation , 2009, CVPR.

[11] Ben Taskar,et al. Object detection via boundary structure segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12] Mark R. Stevens,et al. Integrating Graphics and Vision for Object Recognition , 2000 .

[13] B. Schiele,et al. Combined Object Categorization and Segmentation With an Implicit Shape Model , 2004 .

[14] Marc Levoy,et al. Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[15] James J. Little,et al. Explicit Occlusion Reasoning for 3D Object Detection , 2011, BMVC.

[16] Ramakant Nevatia,et al. Part-Based 3D Descriptions of Complex Objects from a Single Image , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[17] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[18] Frédéric Jurie,et al. Groups of Adjacent Contour Segments for Object Detection , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[19] A. Witkin,et al. On the Role of Structure in Vision , 1983 .

[20] Terrance E. Boult,et al. Multi-attribute spaces: Calibration for attribute fusion and similarity search , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[21] David A. McAllester,et al. Object Detection with Grammar Models , 2011, NIPS.

[22] Luc Van Gool,et al. A Mean Field EM-algorithm for Coherent Occlusion Handling in MAP-Estimation Prob , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23] Silvio Savarese,et al. Toward coherent object detection and scene layout understanding , 2011, Image Vis. Comput..

[24] Martial Hebert,et al. Beyond Local Appearance: Category Recognition from Pairwise Interactions of Simple Features , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Luc Van Gool,et al. Object Detection by Contour Segment Networks , 2006, ECCV.

[26] Jan J. Koenderink,et al. Solid shape , 1990 .

[27] Ian Reid,et al. fastHOG – a real-time GPU implementation of HOG , 2011 .

[28] Pietro Perona,et al. Pedestrian detection: A benchmark , 2009, CVPR.

[29] Zhuowen Tu,et al. Supervised Learning of Edges and Object Boundaries , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30] Ramakant Nevatia,et al. Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[31] David G. Lowe,et al. Perceptual Organization and Visual Recognition , 2012 .

[32] Vincent Lepetit,et al. Gradient Response Maps for Real-Time Detection of Textureless Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Alexei A. Efros,et al. Ensemble of exemplar-SVMs for object detection and beyond , 2011, 2011 International Conference on Computer Vision.

[34] Navneet Dalal,et al. Finding People in Images and Videos , 2006 .

[35] T. Poggio,et al. A network that learns to recognize three-dimensional objects , 1990, Nature.

[36] Guillaume-Alexandre Bilodeau,et al. Generic modeling of 3D objects from single 2D images , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[37] Andriy Myronenko,et al. Point Set Registration: Coherent Point Drift , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38] Charles R. Dyer,et al. Visibility, occlusion, and the aspect graph , 1990, International Journal of Computer Vision.

[39] Alvaro Collet,et al. Making specific features less discriminative to improve point-based 3D object recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[40] Todd A. Cass,et al. Robust Affine Structure Matching for 3D Object Recognition , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[41] Shimon Ullman,et al. Recognizing solid objects by alignment with an image , 1990, International Journal of Computer Vision.

[42] David A. McAllester,et al. A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[43] I. Biederman. Recognizing depth-rotated objects: a review of recent research and theory. , 2000, Spatial vision.

[44] I. Biederman. Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[45] Sinisa Todorovic,et al. From contours to 3D object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.