View-Based Discriminative Probabilistic Modeling for 3D Object Retrieval and Recognition

In view-based 3D object retrieval and recognition, each object is described by multiple views. A central problem is how to estimate the distance between two objects. Most conventional methods integrate the distances of view pairs across two objects as an estimation of their distance. In this paper, we propose a discriminative probabilistic object modeling approach. It builds probabilistic models for each object based on the distribution of its views, and the distance between two objects is defined as the upper bound of the Kullback-Leibler divergence of the corresponding probabilistic models. 3D object retrieval and recognition is accomplished based on the distance measures. We first learn models for each object by the adaptation from a set of global models with a maximum likelihood principle. A further adaption step is then performed to enhance the discriminative ability of the models. We conduct experiments on the ETH 3D object dataset, the National Taiwan University 3D model dataset, and the Princeton Shape Benchmark. We compare our approach with different methods, and experimental results demonstrate the superiority of our approach.

[1]  Andrea Salgian,et al.  Using Multiple Patches for 3D Object Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Yue Gao,et al.  k-Partite graph reinforcement and its application in multimedia information retrieval , 2012, Inf. Sci..

[3]  Remco C. Veltkamp,et al.  A survey of content based 3D shape retrieval methods , 2004, Proceedings Shape Modeling Applications, 2004..

[4]  Petros Daras,et al.  A 3D Shape Retrieval Framework Supporting Multimodal Queries , 2010, International Journal of Computer Vision.

[5]  Qi Tian,et al.  Less is More: Efficient 3-D Object Retrieval With Query View Selection , 2011, IEEE Transactions on Multimedia.

[6]  Jun Qin,et al.  Content based 3D model retrieval: A survey , 2008, 2008 International Workshop on Content-Based Multimedia Indexing.

[7]  Shiri Gordon,et al.  An efficient image similarity measure based on approximations of KL-divergence between two gaussian mixtures , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8]  Alberto Del Bimbo,et al.  Curvature Correlograms for Content Based Retrieval of 3D Objects , 2005, ICIAP.

[9]  Biing-Hwang Juang,et al.  A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[10]  Ryutarou Ohbuchi,et al.  Salient local visual features for shape-based 3D model retrieval , 2008, 2008 IEEE International Conference on Shape Modeling and Applications.

[11]  Thomas A. Funkhouser,et al.  The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[12]  Mohamed Daoudi,et al.  3D models retrieval by using characteristic views , 2002, Object recognition supported by user interaction for service robots.

[13]  Nuno Vasconcelos,et al.  On the efficient evaluation of probabilistic similarity functions for image retrieval , 2004, IEEE Transactions on Information Theory.

[14]  Mohamed Daoudi,et al.  A Bayesian 3-D Search Engine Using Adaptive Views Clustering , 2007, IEEE Transactions on Multimedia.

[15]  Igor Guskov,et al.  3D object recognition from range images using pyramid matching , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[16]  Thomas S. Huang,et al.  Face age estimation using patch-based hidden Markov model supervectors , 2008, 2008 19th International Conference on Pattern Recognition.

[17]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Whoi-Yul Kim,et al.  A region-based shape descriptor using Zernike moments , 2000, Signal Process. Image Commun..

[19]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[20]  Bernard Chazelle,et al.  Shape distributions , 2002, TOGS.

[21]  Alireza Khotanzad,et al.  Invariant Image Recognition by Zernike Moments , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Bingbing Ni,et al.  Assistive tagging: A survey of multimedia tagging with human-computer joint exploration , 2012, CSUR.

[23]  E. Nadaraya On Estimating Regression , 1964 .

[24]  Meng Wang,et al.  Unified Video Annotation via Multigraph Learning , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Florent Perronnin,et al.  A similarity measure between unordered vector sets with application to image categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Alberto Del Bimbo,et al.  Content-based retrieval of 3D models , 2006, TOMCCAP.

[27]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[28]  Chang-Hsing Lee,et al.  A new 3D model retrieval approach based on the elevation descriptor , 2007, Pattern Recognit..

[29]  Manuele Bicego,et al.  A Hidden Markov Model approach for appearance-based 3D object recognition , 2005, Pattern Recognit. Lett..

[30]  Yue Gao,et al.  3D model retrieval using weighted bipartite graph matching , 2011, Signal Process. Image Commun..

[31]  BENJAMIN BUSTOS,et al.  Feature-based similarity search in 3D object databases , 2005, CSUR.

[32]  Petros Daras,et al.  SHREC '11 Track: Generic Shape Retrieval , 2009, 3DOR@Eurographics.

[33]  Bernard F. Buxton,et al.  A Bayesian Approach to 3D Object Recognition Using Linear Combination of 2D Views , 2008, VISAPP.

[34]  Qionghai Dai,et al.  View-based 3D object retrieval and recognition using tangent subspace analysis , 2008, Electronic Imaging.

[35]  Qionghai Dai,et al.  Weighted Subspace Distance and Its Applications to Object Recognition and Retrieval With Image Sets , 2009, IEEE Signal Processing Letters.

[36]  Florent Perronnin,et al.  Modeling images as mixtures of reference images , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Bernt Schiele,et al.  Analyzing appearance and contour based methods for object categorization , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[38]  Xian-Sheng Hua,et al.  Towards a Relevant and Diverse Search of Social Images , 2010, IEEE Transactions on Multimedia.

[39]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Nuno Vasconcelos,et al.  The Kullback-Leibler Kernel as a Framework for Discriminant and Localized Representations for Visual Recognition , 2004, ECCV.

[41]  Dejan V. VraniC An improvement of rotation invariant 3D-shape based on functions on concentric spheres , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[42]  Ning He,et al.  Content-based similarity for 3D model retrieval and classification , 2009 .

[43]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[44]  Ming Ouhyoung,et al.  On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[45]  Nuno Vasconcelos,et al.  A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications , 2003, NIPS.

[46]  Chen Feng,et al.  Probabilistic 3D object recognition from 2D invariant view sequence based on similarity , 2007, Neurocomputing.

[47]  Shuicheng Yan,et al.  SIFT-Bag kernel for video event analysis , 2008, ACM Multimedia.

[48]  Yaming Wang,et al.  Robust Object Matching Using a Modified Version of the Hausdorff Measure , 2002, Int. J. Image Graph..

[49]  Meng Wang,et al.  Multimodal Graph-Based Reranking for Web Image Search , 2012, IEEE Transactions on Image Processing.

[50]  Nikolaos G. Bourbakis,et al.  3-D Object Recognition Using 2-D Views , 2008, IEEE Transactions on Image Processing.

[51]  In-So Kweon,et al.  Scalable representation for 3D object recognition using feature sharing and view clustering , 2008, Pattern Recognit..

[52]  Arnold W. M. Smeulders,et al.  The Amsterdam Library of Object Images , 2004, International Journal of Computer Vision.

[53]  Yuxin Peng,et al.  Clip-based similarity measure for query-dependent clip retrieval and video summarization , 2006, IEEE Trans. Circuits Syst. Video Technol..

[54]  Yue Gao,et al.  3-D Object Retrieval and Recognition With Hypergraph Analysis , 2012, IEEE Transactions on Image Processing.

[55]  Touradj Ebrahimi,et al.  MESH: measuring errors between surfaces using the Hausdorff distance , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[56]  Zhang Yao,et al.  Content-Based 3-D Model Retrieval: A Survey , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[57]  Marc Rioux,et al.  Nefertiti: A tool for 3-D shape databases management , 1999 .

[58]  Xian-Sheng Hua,et al.  A joint appearance-spatial distance for kernel-based image categorization , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.