论文信息 - 3D2PM - 3D Deformable Part Models

3D2PM - 3D Deformable Part Models

As objects are inherently 3-dimensional, they have been modeled in 3D in the early days of computer vision. Due to the ambiguities arising from mapping 2D features to 3D models, 2D feature-based models are the predominant paradigm in object recognition today. While such models have shown competitive bounding box (BB) detection performance, they are clearly limited in their capability of fine-grained reasoning in 3D or continuous viewpoint estimation as required for advanced tasks such as 3D scene understanding. This work extends the deformable part model [1] to a 3D object model. It consists of multiple parts modeled in 3D and a continuous appearance model. As a result, the model generalizes beyond BB oriented object detection and can be jointly optimized in a discriminative fashion for object detection and viewpoint estimation. Our 3D Deformable Part Model (3D2PM) leverages on CAD data of the object class, as a 3D geometry proxy.

[1] Silvio Savarese,et al. Learning a dense multi-view representation for detection, viewpoint classification and synthesis of object categories , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[2] Xiaofeng Ren,et al. Discriminative Mixture-of-Templates for Viewpoint Classification , 2010, ECCV.

[3] P. Fua,et al. Pose estimation for category specific multiview object localization , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Luc Van Gool,et al. Towards Multi-View Object Class Detection , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[5] Silvio Savarese,et al. Semantic structure from motion , 2011, CVPR 2011.

[6] Thorsten Joachims,et al. Learning structural SVMs with latent variables , 2009, ICML '09.

[7] Peter V. Gehler,et al. Teaching 3D geometry to deformable part models , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Michael Goesele,et al. Back to the Future: Learning Shape Models from 3D CAD Data , 2010, BMVC.

[9] Cordelia Schmid,et al. Multi-view object class detection with a 3D geometric model , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10] Bill Triggs,et al. Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11] Ronen Basri,et al. Constructing implicit 3D shape models for pose estimation , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[12] Bernt Schiele,et al. Robust Object Detection with Interleaved Categorization and Segmentation , 2008, International Journal of Computer Vision.

[13] Luc Van Gool,et al. Branch&Rank: Non-Linear Object Detection , 2011, BMVC.

[14] Silvio Savarese,et al. Depth-Encoded Hough Voting for Joint Object Detection and Shape Recovery , 2010, ECCV.

[15] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[16] Andrew J. Davison,et al. Active Matching , 2008, ECCV.

[17] Mubarak Shah,et al. 3D Model based Object Class Detection in An Arbitrary View , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[18] Christoph H. Lampert,et al. Learning to Localize Objects with Structured Output Regression , 2008, ECCV.

[19] Dmitry B. Goldgof,et al. Function-based recognition from incomplete knowledge of shape , 1993 .

[20] Bernt Schiele,et al. Monocular 3D Scene Modeling and Inference: Understanding Multi-Object Traffic Scenes , 2010, ECCV.

[21] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] D. Marr,et al. Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23] Bernt Schiele,et al. Revisiting 3D geometric models for accurate object shape and pose , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[24] Pietro Perona,et al. Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[25] Sinisa Todorovic,et al. From contours to 3D object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[26] David G. Lowe,et al. Three-Dimensional Object Recognition from Single Two-Dimensional Images , 1987, Artif. Intell..

[27] Ronen Basri,et al. Viewpoint-aware object detection and pose estimation , 2011, 2011 International Conference on Computer Vision.

[28] Thomas Deselaers,et al. ClassCut for Unsupervised Class Segmentation , 2010, ECCV.

[29] Rodney A. Brooks,et al. Symbolic Reasoning Among 3-D Models and 2-D Images , 1981, Artif. Intell..

[30] Alexei A. Efros,et al. Putting Objects in Perspective , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[31] Silvio Savarese,et al. 3D generic object categorization, localization and pose estimation , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[32] Alex Pentland,et al. Perceptual Organization and the Representation of Natural Form , 1986, Artif. Intell..

[33] Luc Van Gool,et al. Robust Multiperson Tracking from a Mobile Platform , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34] Silvio Savarese,et al. Deformable part models revisited: A performance evaluation for object category pose estimation , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[35] Kevin W. Bowyer,et al. Generic Recognition of Articulated Objects through Reasoning about Potential Function , 1995, Comput. Vis. Image Underst..