论文信息 - Learning part-in-whole relation of 3D shapes for part-based 3D model retrieval

Learning part-in-whole relation of 3D shapes for part-based 3D model retrieval

Abstract Given a query that specifies partial 3D shape, a Part-based 3D Model Retrieval (P3DMR) system finds 3D shapes whose part or parts matches the query. An approach to P3DMR is to partition or segment whole models into sub-parts and performs query-part-to-target-parts matching. Whatever the definition of part, e.g., a rectangular volume in Euclidean space or a part segmented on a mesh manifold, the computation will be very costly. The part-whole matching must account for, for each 3D whole shape in a database, varying position, scale and orientation of the segmented sub parts. Another approach, in an attempt to make part-whole matching efficient, tries to approximate part-whole inclusion test with a single comparison between a pair of features, one representing the part-based query and the other representing the whole shape. Aggregation of local geometrical features of parts into a feature per whole 3D shape, e.g., via Bag-of-Features approach, is an example. This approach so far suffered from inaccuracy as the aggregation is not optimized for part-whole inclusion test of 3D shapes. This paper proposes a novel P3DMR algorithm called Part-Whole Relation Embedding network (PWRE-net) that effectively and efficiently performs part-whole inclusion test via learned embedding into a common feature space. Using deep neural network, the PWRE-net learns, from a large number of part-whole shape pairs, a common embedding of partial shapes and their associated whole shapes. For the training, training datasets containing part-whole shape pairs are created automatically from unlabeled 3D models. Experimental evaluation shows that PWRE-net outperforms existing algorithms both in terms of retrieval accuracy and efficiency.

Ryutarou Ohbuchi | Takahiko Furuya

[1] Ryutarou Ohbuchi,et al. Dense sampling and fast encoding for 3D model retrieval using bag-of-visual features , 2009, CIVR '09.

[2] Ryutarou Ohbuchi,et al. Accurate Aggregation of Local Features by using K-sparse Autoencoder for 3D Model Retrieval , 2016, ICMR.

[3] Paul Suetens,et al. SHREC '11 Track: Shape Retrieval on Non-rigid 3D Watertight Meshes , 2011, 3DOR@Eurographics.

[4] Marco Attene,et al. Thesaurus-based 3D Object Retrieval with Part-in-Whole Matching , 2010, International Journal of Computer Vision.

[5] Ralph R. Martin,et al. Partial Shape Queries for 3D Object Retrieval , 2016, 3DOR@Eurographics.

[6] Andrew E. Johnson,et al. Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[7] Subhransu Maji,et al. Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[8] Mohammed Bennamoun,et al. Rotational Projection Statistics for 3D Local Surface Description and Object Recognition , 2013, International Journal of Computer Vision.

[9] Petros Daras,et al. SHREC'09 Track: Querying with Partial Models , 2009, 3DOR@Eurographics.

[10] Yasuo Kuniyoshi,et al. Partial matching of real textured 3D objects using color cubic higher-order local auto-correlation features , 2010, The Visual Computer.

[11] Kun Zhou,et al. A Survey on Partial Retrieval of 3D Shapes , 2013, Journal of Computer Science and Technology.

[12] Bernard Chazelle,et al. Shape distributions , 2002, TOGS.

[13] William H. Press,et al. The Art of Scientific Computing Second Edition , 1998 .

[14] Rahul Sukthankar,et al. MatchNet: Unifying feature and metric learning for patch-based matching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Leonidas J. Guibas,et al. ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[16] Pierre Vandergheynst,et al. Geodesic Convolutional Neural Networks on Riemannian Manifolds , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[17] Ryutarou Ohbuchi,et al. Deep Aggregation of Local 3D Geometric Features for 3D Model Retrieval , 2016, BMVC.

[18] Gabriela Csurka,et al. Visual categorization with bags of keypoints , 2002, eccv 2004.

[19] Yi Liu,et al. Shape Topics: A Compact Representation and New Algorithms for 3D Partial Shape Retrieval , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20] Eric Wahl,et al. Surflet-pair-relation histograms: a statistical 3D-shape representation for rapid classification , 2003, Fourth International Conference on 3-D Digital Imaging and Modeling, 2003. 3DIM 2003. Proceedings..

[21] Marco Attene,et al. Part-in-whole 3D shape matching and docking , 2011, The Visual Computer.

[22] Ronan Sicre,et al. Particular object retrieval with integral max-pooling of CNN activations , 2015, ICLR.

[23] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[24] Jianxiong Xiao,et al. Sliding Shapes for 3D Object Detection in Depth Images , 2014, ECCV.

[25] Fang Wang,et al. Sketch-based 3D shape retrieval using Convolutional Neural Networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Daniel Cohen-Or,et al. Part Analogies in Sets of Objects , 2008, 3DOR@Eurographics.

[27] Yann LeCun,et al. Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28] Ryutarou Ohbuchi,et al. Randomized Sub-Volume Partitioning for Part-Based 3D Model Retrieval , 2015, 3DOR@Eurographics.

[29] Noel E. O'Connor,et al. Bags of Local Convolutional Features for Scalable Instance Search , 2016, ICMR.

[30] Nico Blodow,et al. Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[31] Li Fei-Fei,et al. ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[32] Thomas S. Huang,et al. Image Classification Using Super-Vector Coding of Local Image Descriptors , 2010, ECCV.

[33] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[34] Longin Jan Latecki,et al. GIFT: A Real-Time and Scalable 3D Shape Search Engine , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Ryutarou Ohbuchi,et al. Shape-similarity search of 3D models by using enhanced shape functions , 2005, Int. J. Comput. Appl. Technol..

[36] Terrence J. Sejnowski,et al. Edges are the Independent Components of Natural Scenes , 1996, NIPS.

[37] Thomas Mensink,et al. Improving the Fisher Kernel for Large-Scale Image Classification , 2010, ECCV.

[38] Marc Alexa,et al. How do humans sketch objects? , 2012, ACM Trans. Graph..

[39] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[40] Shin'ichi Satoh,et al. Faster R-CNN Features for Instance Search , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[41] Satyandra K. Gupta,et al. Retrieving Matching CAD Models by Using Partial 3D Point Clouds , 2007 .

[42] Michalis A. Savelonas,et al. Fisher Encoding of Adaptive Fast Persistent Feature Histograms for Partial Retrieval of 3D Pottery Objects , 2014, 3DOR@Eurographics.

[43] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[44] Yi Fang,et al. Heat Diffusion Long-Short Term Memory Learning for 3D Shape Analysis , 2016, ECCV.

[45] Leonidas J. Guibas,et al. Joint embeddings of shapes and images via CNN image purification , 2015, ACM Trans. Graph..

[46] Ryutarou Ohbuchi,et al. Diffusion-on-Manifold Aggregation of Local Features for Shape-based 3D Model Retrieval , 2015, ICMR.

[47] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[48] Yann LeCun,et al. Computing the stereo matching cost with a convolutional neural network , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).