Asymmetric Supervised Deep Autoencoder for Depth Image based 3D Model Retrieval

In this paper, we propose a new asymmetric supervised deep autoencoder approach to retrieve 3D shapes based on depth images. The asymmetric supervised autoencoder is trained with real and synthetic depth images together. The novelty of this research lies in the asymmetric structure of a supervised deep autoencoder. The proposed asymmetric deep supervised autoencoder deals with the incompleteness and ambiguity present in the depth images by balancing reconstruction and classification capabilities in a unified way with mixed depth images. We investigate the relationship between the encoder layers and decoder layers, and claim that an asymmetric structure of a supervised deep autoencoder reduces the chance of overfitting by 8% and is capable of extracting more robust features with respect to the variance of input than that of a symmetric structure. The experimental results on the NYUD2 and ModelNet10 datasets demonstrate that the proposed supervised method outperforms the recent approaches for cross modal 3D model retrieval.

[1]  Guoliang Fan,et al.  Supervised Deep-Autoencoder for Depth Image-Based 3D Model Retrieval , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[2]  Nigel Bosch,et al.  Unsupervised Deep Autoencoders for Feature Extraction with Educational Data , 2017 .

[3]  Roger Levy,et al.  A new approach to cross-modal multimedia retrieval , 2010, ACM Multimedia.

[4]  Jeffrey Pennington,et al.  Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[5]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Paolo Favaro,et al.  Self-Supervised Feature Learning by Learning to Spot Artifacts , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Jing Zhu,et al.  Learning Pairwise Neural Network Encoder for Depth Image-based 3D Model Retrieval , 2015, ACM Multimedia.

[8]  Zhang Yi,et al.  Learning a good representation with unsymmetrical auto-encoder , 2015, Neural Computing and Applications.

[9]  Jiaya Jia,et al.  Reducing boundary artifacts in image deconvolution , 2008, 2008 15th IEEE International Conference on Image Processing.

[10]  Thomas A. Funkhouser,et al.  The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[11]  Kai-Lung Hua,et al.  3D model retrieval based on deep Autoencoder neural networks , 2017, 2017 International Conference on Signals and Systems (ICSigSys).

[12]  Eunho Yang,et al.  Deep Asymmetric Multi-task Feature Learning , 2017, ICML.

[13]  Song Bai,et al.  Deep learning representation using autoencoder for 3D shape retrieval , 2014, SPAC.

[14]  Angshul Majumdar,et al.  Asymmetric stacked autoencoder , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[15]  Li Liao,et al.  Deep Neural Networks with Parallel Autoencoders for Learning Pairwise Relations: Handwritten Digits Subtraction , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[16]  Jianxiong Xiao,et al.  Sliding Shapes for 3D Object Detection in Depth Images , 2014, ECCV.

[17]  Fang Wang,et al.  Sketch-based 3D shape retrieval using Convolutional Neural Networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[19]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.