论文信息 - Parsing IKEA Objects: Fine Pose Estimation

Parsing IKEA Objects: Fine Pose Estimation

We address the problem of localizing and estimating the fine-pose of objects in the image with exact 3D models. Our main focus is to unify contributions from the 1970s with recent advances in object detection: use local keypoint detectors to find candidate poses and score global alignment of each candidate pose to the image. Moreover, we also provide a new dataset containing fine-aligned objects with their exactly matched 3D models, and a set of models for widely used objects. We also evaluate our algorithm both on object detection and fine pose estimation, and show that our method outperforms state-of-the art algorithms.

[1] Jitendra Malik,et al. Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[2] Joseph L. Mundy,et al. Object Recognition in the Geometric Era: A Retrospective , 2006, Toward Category-Level Object Recognition.

[3] Song-Chun Zhu,et al. Image Parsing via Stochastic Scene Grammar , 2011 .

[4] Peter V. Gehler,et al. 3D2PM - 3D Deformable Part Models , 2012, ECCV.

[5] Cordelia Schmid,et al. 3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints , 2006, International Journal of Computer Vision.

[6] Dieter Fox,et al. RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[7] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[8] Jianxiong Xiao,et al. Localizing 3D cuboids in single-view images , 2012, NIPS.

[9] 智一吉田,et al. Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[10] Deva Ramanan,et al. Analyzing 3D Objects in Cluttered Images , 2012, NIPS.

[11] Luc Van Gool,et al. The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[12] Sven J. Dickinson,et al. 3D Object Detection and Viewpoint Estimation with a Deformable 3D Cuboid Model , 2012, NIPS.

[13] Song-Chun Zhu,et al. Image Parsing with Stochastic Scene Grammar , 2011, NIPS.

[14] Jitendra Malik,et al. Discriminative Decorrelation for Clustering and Classification , 2012, ECCV.

[15] Silvio Savarese,et al. Estimating the aspect layout of object categories , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[16] Martial Hebert,et al. Data-Driven Scene Understanding from 3D Models , 2012, BMVC.

[17] Pat Hanrahan,et al. Context-based search for 3D models , 2010, ACM Trans. Graph..

[18] Jianguo Zhang,et al. The PASCAL Visual Object Classes Challenge , 2006 .

[19] Joseph J. Lim,et al. Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[21] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[22] David G. Lowe,et al. Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..