Squeezing bag-of-features for scalable and semantic 3D model retrieval

We have previously proposed a multiple-view, densely-sampled, bag-of-visual features algorithm for shape-based 3D model retrieval [2]. The method achieved good retrieval performance for moderately sized benchmark datasets (−1,000 3D models), including both rigid and articulated 3D shapes. It is also much faster than the other methods having similar retrieval performance. However, the method does not exploit semantic knowledge. We want the retrieval results to reflect multiple (e.g., −100) semantic classes. Also, if applied to a larger database (e.g., −1M models), search through the database can be expensive due to its large feature vector size (e.g., 30k dimensions). This paper proposes a method to "squeeze" its length feature vector by projecting it onto a manifold that incorporates multiple semantic classes. Experimental evaluation has shown that retrieval performances equal or better than original feature can be achieved while reducing feature vector size, e.g., from 30k down to less than 100.

[1]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[2]  Karthik Ramani,et al.  SHape REtrieval contest 2008: CAD models , 2008, 2008 IEEE International Conference on Shape Modeling and Applications.

[3]  Karthik Ramani,et al.  Developing an engineering shape benchmark for CAD models , 2006, Comput. Aided Des..

[4]  Karthik Ramani,et al.  Three-dimensional shape searching: state-of-the-art review and future trends , 2005, Comput. Aided Des..

[5]  Remco C. Veltkamp,et al.  SHREC2006: 3D Shape Retrieval Contest , 2006 .

[6]  Akihiro Yamamoto,et al.  Learning semantic categories for 3D model retrieval , 2007, MIR '07.

[7]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[8]  Wei-Ying Ma,et al.  Learning an image manifold for retrieval , 2004, MULTIMEDIA '04.

[9]  R. Veltkamp,et al.  SHREC 2007 3 D Shape Retrieval Contest , 2006 .

[10]  Ryutarou Ohbuchi,et al.  Salient local visual features for shape-based 3D model retrieval , 2008, 2008 IEEE International Conference on Shape Modeling and Applications.

[11]  Cordelia Schmid,et al.  Packing bag-of-features , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[12]  Remco C. Veltkamp,et al.  A survey of content based 3D shape retrieval methods , 2004, Proceedings Shape Modeling Applications, 2004..

[13]  Thomas A. Funkhouser,et al.  The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[14]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[15]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[16]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[17]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[18]  Ryutarou Ohbuchi,et al.  Dense sampling and fast encoding for 3D model retrieval using bag-of-visual features , 2009, CIVR '09.