3D model retrieval based on deep Autoencoder neural networks

The rapid growth of 3D model resources for 3D printing has created an urgent need for 3D model retrieval systems. Benefiting from the evolution of hardware devices, visualized 3D models can be easily rendered using a tablet computer or handheld mobile device. In this paper, we present a novel 3D model retrieval method involving view-based features and deep learning. Because 2D images are highly distinguishable, constructing a 3D model from multiple 2D views is one of the most common methods of 3D model retrieval. Normalization is typically challenging and time-consuming for view-based retrieval methods; however, this work utilized an unsupervised deep learning technique, called Autoencoder, to refine compact view-based features. Therefore, the proposed method is rotation-invariant, requiring only the normalization of the translation and the scale of the 3D models in the dataset. For robustness, we applied Fourier descriptors and Zernike moments to represent the 2D features. The experimental results testing our method on the online Princeton Shape Benchmark Dataset demonstrate more accurate retrieval performance than other existing methods.

[1]  Edward K. Wong,et al.  Deepshape: Deep learned shape descriptor for 3D shape matching and retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Kai-Lung Hua,et al.  Edge-Preserving Depth Map Upsampling by Joint Trilateral Filter , 2018, IEEE Transactions on Cybernetics.

[3]  Berthold K. P. Horn Extended Gaussian images , 1984, Proceedings of the IEEE.

[4]  Meng Wang,et al.  3D deep shape descriptor , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Thomas A. Funkhouser,et al.  The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[6]  Edward K. Wong,et al.  DeepShape: Deep-Learned Shape Descriptor for 3D Shape Retrieval , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Dejan V. VraniC An improvement of rotation invariant 3D-shape based on functions on concentric spheres , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[8]  Ling Shao,et al.  Learning View-Model Joint Relevance for 3D Object Retrieval , 2015, IEEE Transactions on Image Processing.

[9]  Mei-Chen Yeh,et al.  Artist-based Classification via Deep Learning with Multi-scale Weighted Pooling , 2016, ACM Multimedia.

[10]  Wen-Huang Cheng,et al.  Computer-aided classification of lung nodules on computed tomography images via deep learning technique , 2015, OncoTargets and therapy.

[11]  Ming Ouhyoung,et al.  On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[12]  Wen-Huang Cheng,et al.  A Spatial-Pyramid Scene Categorization Algorithm based on Locality-aware Sparse Coding , 2016, 2016 IEEE Second International Conference on Multimedia Big Data (BigMM).

[13]  Francoise J. Preteux,et al.  3D-shape-based retrieval within the MPEG-7 framework , 2001, IS&T/SPIE Electronic Imaging.

[14]  Bernard Chazelle,et al.  Matching 3D models with shape distributions , 2001, Proceedings International Conference on Shape Modeling and Applications.

[15]  Hans-Peter Kriegel,et al.  Nearest Neighbor Classification in 3D Protein Databases , 1999, ISMB.

[16]  Yue Gao,et al.  3D model comparison using spatial structure circular descriptor , 2010, Pattern Recognit..

[17]  Petros Daras,et al.  A 3D Shape Retrieval Framework Supporting Multimodal Queries , 2010, International Journal of Computer Vision.

[18]  Longin Jan Latecki,et al.  3D Shape Matching via Two Layer Coding , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Dietmar Saupe,et al.  3D Model Retrieval with Spherical Harmonics and Moments , 2001, DAGM-Symposium.

[20]  Wen-Huang Cheng,et al.  Clothing genre classification by exploiting the style elements , 2012, ACM Multimedia.

[21]  Afzal Godil,et al.  Non-rigid 3D shape retrieval using Multidimensional Scaling and Bag-of-Features , 2010, 2010 IEEE International Conference on Image Processing.

[22]  Min-Chun Hu,et al.  Locality Constrained Sparse Representation for Cat Recognition , 2016, MMM.

[23]  Dengsheng Zhang,et al.  A comparative study on shape retrieval using Fourier descriptiors with different shape signatures , 2001 .

[24]  Ralph Roskies,et al.  Fourier Descriptors for Plane Closed Curves , 1972, IEEE Transactions on Computers.

[25]  Szymon Rusinkiewicz,et al.  Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[26]  Wen-Huang Cheng,et al.  A robust tracking algorithm for 3D hand gesture with rapid hand motion through deep learning , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[27]  Wen-Huang Cheng,et al.  An interactive 3D social media browsing system in a tech-art gallery , 2015, SIGGRAPH Asia Posters.

[28]  Shahriar B. Shokouhi,et al.  Classification of benign and malignant masses based on Zernike moments , 2011, Comput. Biol. Medicine.

[29]  Katsushi Ikeuchi,et al.  Determining 3-D object pose using the complex extended Gaussian image , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[30]  Yunhui Liu,et al.  3D model retrieval using Bag-of-View-Words , 2013, Multimedia Tools and Applications.