Constructing an efficient and adaptive learning model for 3D object generation

Studying representation learning and generative modelling has been at the core of the 3D learning domain. By leveraging the generative adversarial networks and convolutional neural networks for point-cloud representations, we propose a novel framework, which can directly generate 3D objects represented by point clouds. The novelties of the proposed method are threefold. First, the generative adversarial networks are applied to 3D object generation in the point-cloud space, where the model learns object representation from point clouds independently. In this work, we propose a 3D spatial transformer network, and integrate it into a generation model, whose ability for extracting and reconstructing features for 3D objects can be improved. Second, a point-wise approach is developed to reduce the computational complexity of the proposed network. Third, an evaluation system is proposed to measure the performance of our model by employing various categories and methods, and the error, considered as the difference between synthesized objects and raw objects are quantitatively compared, is less than 2.8%. Extensive experiments on benchmark dataset show that this method has a strong ability to generate 3D objects in the point-cloud space, and the synthesized objects have slight differences with man-made 3D objects.

[1]  Ceren Guzel Turhan,et al.  Class-aware single image to 3D object translational autoencoder , 2020, IET Image Process..

[2]  Haibin Lv,et al.  BIM Big Data Storage in WebVRGIS , 2020, IEEE Transactions on Industrial Informatics.

[3]  Bin Li,et al.  3D-ReConstnet: A Single-View 3D-Object Point Cloud Reconstruction Network , 2020, IEEE Access.

[4]  Dan Song,et al.  Multi-View Hierarchical Fusion Network for 3D Object Retrieval and Classification , 2019, IEEE Access.

[5]  Mun-Taek Choi,et al.  Trilateral convolutional neural network for 3D shape reconstruction of objects from a single depth view , 2019, IET Image Process..

[6]  Young Min Kim,et al.  RL-GAN-Net: A Reinforcement Learning Agent Controlled GAN Network for Real-Time Point Cloud Shape Completion , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Yan Zhou,et al.  3D shape classification and retrieval based on polar view , 2019, Inf. Sci..

[8]  Yang Zhang,et al.  Point Cloud GAN , 2018, DGS@ICLR.

[9]  Jianru Xue,et al.  Building discriminative CNN image representations for object retrieval using the replicator equation , 2018, Pattern Recognit..

[10]  Abdul Nurunnabi,et al.  Robust statistical approaches for circle fitting in laser scanning three-dimensional point cloud data , 2018, Pattern Recognit..

[11]  Hui Wei,et al.  Understanding of indoor scenes based on projection of spatial rectangles , 2018, Pattern Recognit..

[12]  Ioannis A. Kakadiaris,et al.  Monocular 3D facial shape reconstruction from a single 2D image with coupled-dictionary learning and sparse coding , 2018, Pattern Recognit..

[13]  Rehab F. Abdel-Kader,et al.  Plane detection in 3D point cloud using octree-balanced density down-sampling and iterative adaptive plane extraction , 2018, IET Image Process..

[14]  Junsong Yuan,et al.  Multi-view Harmonized Bilinear Network for 3D Object Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15]  Michael I. Jordan,et al.  Information Constraints on Auto-Encoding Variational Bayes , 2018, NeurIPS.

[16]  Tao Zhang,et al.  BRoPH: An efficient and compact binary descriptor for 3D point clouds , 2018, Pattern Recognit..

[17]  Dong Tian,et al.  FoldingNet: Point Cloud Auto-Encoder via Deep Grid Deformation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[18]  Ioannis Pratikakis,et al.  Ensemble of PANORAMA-based convolutional neural networks for 3D model classification and retrieval , 2017, Comput. Graph..

[19]  Leonidas J. Guibas,et al.  Learning Representations and Generative Models for 3D Point Clouds , 2017, ICML.

[20]  Ting Liu,et al.  Recent advances in convolutional neural networks , 2015, Pattern Recognit..

[21]  Ludovico Minto,et al.  Deep Learning for 3D Shape Classification based on Volumetric Density and Surface Approximation Clues , 2018, VISIGRAPP.

[22]  Songyang Lao,et al.  Two-view underwater 3D reconstruction for cameras with unknown poses under flat refractive interfaces , 2017, Pattern Recognit..

[23]  Bo Yang,et al.  3D Object Reconstruction from a Single Depth View with Adversarial Learning , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[24]  Gene Cheung,et al.  Estimating Heart Rate and Rhythm via 3D Motion Tracking in Depth Video , 2017, IEEE Transactions on Multimedia.

[25]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[26]  Nadir Farah,et al.  Localisation of topological features using 3D object representations , 2017, IET Image Process..

[27]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[28]  Vaibhava Goel,et al.  McGan: Mean and Covariance Feature Matching GAN , 2017, ICML.

[29]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Ping Tan,et al.  An example-based approach to 3D man-made object reconstruction from line drawings , 2016, Pattern Recognit..

[31]  Jiajun Wu,et al.  Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling , 2016, NIPS.

[32]  Leonidas J. Guibas,et al.  FPNN: Field Probing Neural Networks for 3D Data , 2016, NIPS.

[33]  Oliver Grau,et al.  VConv-DAE: Deep Volumetric Shape Learning Without Object Labels , 2016, ECCV Workshops.

[34]  Leonidas J. Guibas,et al.  Volumetric and Multi-view CNNs for Object Classification on 3D Data , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Abhinav Gupta,et al.  Learning a Predictable and Generative Vector Representation for Objects , 2016, ECCV.

[36]  Zhihan Lv,et al.  Virtual Reality Smart City Based on WebVRGIS , 2016, IEEE Internet of Things Journal.

[37]  Jinxing Hu,et al.  Managing Big City Information Based on WebVRGIS , 2016, IEEE Access.

[38]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[39]  Zhichao Zhou,et al.  DeepPano: Deep Panoramic Representation for 3-D Shape Recognition , 2015, IEEE Signal Processing Letters.

[40]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[41]  Evangelos Kalogerakis,et al.  Eurographics Symposium on Geometry Processing 2015 Analysis and Synthesis of 3d Shape Families via Deep-learned Generative Models of Surfaces , 2022 .

[42]  Andrew Zisserman,et al.  Spatial Transformer Networks , 2015, NIPS.

[43]  Leonidas J. Guibas,et al.  Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[44]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[45]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Chong Wang,et al.  Superpixel-Based Hand Gesture Recognition With Kinect Depth Camera , 2015, IEEE Transactions on Multimedia.

[47]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[48]  Siddhartha Chaudhuri,et al.  A probabilistic model for component-based shape synthesis , 2012, ACM Trans. Graph..