论文信息 - Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era

Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era

3D reconstruction is a longstanding ill-posed problem, which has been explored for decades by the computer vision, computer graphics, and machine learning communities. Since 2015, image-based 3D reconstruction using convolutional neural networks (CNN) has attracted increasing interest and demonstrated an impressive performance. Given this new era of rapid evolution, this article provides a comprehensive survey of the recent developments in this field. We focus on the works which use deep learning techniques to estimate the 3D shape of generic objects either from a single or multiple RGB images. We organize the literature based on the shape representations, the network architectures, and the training mechanisms they use. While this survey is intended for methods which reconstruct generic objects, we also review some of the recent works which focus on specific object classes such as human body shapes and faces. We provide an analysis and comparison of the performance of some key papers, summarize some of the open problems in this field, and discuss promising directions for future research.

[1] Anders P. Eriksson,et al. Compact Model Representation for 3D Reconstruction , 2017, 2017 International Conference on 3D Vision (3DV).

[2] Pieter Peers,et al. Synthesizing 3D Shapes From Silhouette Image Collections Using Multi-Projection Generative Adversarial Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Karthik Ramani,et al. SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Ian D. Reid,et al. Optimizable Object Reconstruction from a Single View , 2018, ArXiv.

[5] Leonidas J. Guibas,et al. Render for CNN: Viewpoint Estimation in Images Using CNNs Trained with Rendered 3D Model Views , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[6] Hao Zhang,et al. Learning Implicit Fields for Generative Shape Modeling , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Trevor Darrell,et al. Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Marcus A. Magnor,et al. Tex2Shape: Detailed Full Human Body Geometry From a Single Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[9] Alexey Dosovitskiy,et al. Unsupervised Learning of Shape and Pose with Differentiable Point Clouds , 2018, NeurIPS.

[10] Jiajun Wu,et al. MarrNet: 3D Shape Reconstruction via 2.5D Sketches , 2017, NIPS.

[11] Max Jaderberg,et al. Unsupervised Learning of 3D Structure from Images , 2016, NIPS.

[12] Jiajun Wu,et al. Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling , 2016, NIPS.

[13] Hamid Laga,et al. A Survey on Deep Learning Architectures for Image-based Depth Reconstruction , 2019, ArXiv.

[14] Jitendra Malik,et al. Learning Category-Specific Deformable 3D Models for Object Reconstruction , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15] Pietro Perona,et al. Caltech-UCSD Birds 200 , 2010 .

[16] R. Venkatesh Babu,et al. 3D-PSRNet: Part Segmented 3D Point Cloud Reconstruction From a Single Image , 2018, ECCV Workshops.

[17] Alla Sheffer,et al. Fundamentals of spherical parameterization for 3D meshes , 2003, ACM Trans. Graph..

[18] Patrick Pérez,et al. MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[19] Abhinav Gupta,et al. Learning a Predictable and Generative Vector Representation for Objects , 2016, ECCV.

[20] Jitendra Malik,et al. Factoring Shape, Pose, and Layout from the 2D Image of a 3D Scene , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[21] Daniel Cohen-Or,et al. Pix2Vex: Image-to-Geometry Reconstruction using a Smooth Differentiable Renderer , 2019, ArXiv.

[22] Matan Sela,et al. 3D Face Reconstruction by Learning from Synthetic Data , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[23] Xiaojuan Qi,et al. GAL: Geometric Adversarial Loss for Single-View 3D-Object Reconstruction , 2018, ECCV.

[24] William E. Lorensen,et al. Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[25] Chongyang Ma,et al. Deep Volumetric Video From Very Sparse Multi-view Performance Capture , 2018, ECCV.

[26] Antonio Torralba,et al. Parsing IKEA Objects: Fine Pose Estimation , 2013, 2013 IEEE International Conference on Computer Vision.

[27] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Jitendra Malik,et al. End-to-End Recovery of Human Shape and Pose , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29] Andreas Geiger,et al. Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30] A. Laurentini,et al. The Visual Hull Concept for Silhouette-Based Image Understanding , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[31] Michael J. Black,et al. SMPL: A Skinned Multi-Person Linear Model , 2023 .

[32] Anders P. Eriksson,et al. Image2Mesh: A Learning Framework for Single Image 3D Reconstruction , 2017, ACCV.

[33] Silvio Savarese,et al. 3D Semantic Parsing of Large-Scale Indoor Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Michael J. Black,et al. OpenDR: An Approximate Differentiable Renderer , 2014, ECCV.

[35] Jitendra Malik,et al. Hierarchical Surface Prediction , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Subhransu Maji,et al. 3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks , 2017, 2017 International Conference on 3D Vision (3DV).

[37] Frédéric Maire,et al. Learning Free-Form Deformations for 3D Object Reconstruction , 2018, ACCV.

[38] Gregory D. Hager,et al. Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Thomas Brox,et al. Multi-view 3D Models from Single Images with a Convolutional Network , 2015, ECCV.

[40] Honglak Lee,et al. Perspective Transformer Nets: Learning Single-View 3D Object Reconstruction without 3D Supervision , 2016, NIPS.

[41] Silvio Savarese,et al. 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction , 2016, ECCV.

[42] Yue Wang,et al. PointGrow: Autoregressively Learned Point Cloud Generation with Self-Attention , 2018, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[43] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[44] Subhransu Maji,et al. 3D Shape Induction from 2D Views of Multiple Objects , 2016, 2017 International Conference on 3D Vision (3DV).

[45] Andrew Zisserman,et al. SilNet : Single- and Multi-View Reconstruction by Learning from Silhouettes , 2017, BMVC.

[46] Jiajun Wu,et al. Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[47] Silvio Savarese,et al. DeformNet: Free-Form Deformation Network for 3D Shape Reconstruction from a Single Image , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[48] Thomas A. Funkhouser,et al. Semantic Scene Completion from a Single Depth Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Bernhard Egger,et al. Morphable Face Models - An Open Framework , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[50] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[51] Lourdes Agapito,et al. Reconstructing PASCAL VOC , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[52] Hamid Laga,et al. Landmark‐Guided Elastic Shape Analysis of Spherically‐Parameterized Surfaces , 2013, Comput. Graph. Forum.

[53] Peter V. Gehler,et al. Keep It SMPL: Automatic Estimation of 3D Human Pose and Shape from a Single Image , 2016, ECCV.

[54] Alexei A. Efros,et al. 3D Sketching using Multi-View Deep Volumetric Prediction , 2017, PACMCGIT.

[55] Jonathan Krause,et al. 3D Object Representations for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[56] Michael M. Kazhdan,et al. Screened poisson surface reconstruction , 2013, TOGS.

[57] Jitendra Malik,et al. Multi-view Consistency as Supervisory Signal for Learning Shape and Pose Prediction , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[58] Peter V. Gehler,et al. DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Richard A. Newcombe,et al. DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[60] Robert B. Fisher,et al. 3D Shape Analysis: Fundamentals, Theory, and Applications , 2019 .

[61] Mathieu Aubry,et al. AtlasNet: A Papier-M\^ach\'e Approach to Learning 3D Surface Generation , 2018, CVPR 2018.

[62] Matthias Nießner,et al. ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[63] Matthew Turk,et al. A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[64] C. Lee Giles,et al. Learning a Hierarchical Latent-Variable Model of 3D Shapes , 2017, 2018 International Conference on 3D Vision (3DV).

[65] Wei Liu,et al. Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images , 2018, ECCV.

[66] Chao Yang,et al. Shape Inpainting Using 3D Generative Adversarial Network and Recurrent Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[67] Alexander M. Bronstein,et al. Deformable Shape Completion with Graph Convolutional Autoencoders , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[68] William T. Freeman,et al. Unsupervised Training for 3D Morphable Model Regression , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[69] Jun Li,et al. Im2Struct: Recovering 3D Shape Structure from a Single RGB Image , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[70] Fabio Remondino,et al. Image-to-Voxel Model Translation with Conditional Adversarial Networks , 2018, ECCV Workshops.

[71] John S. Zelek,et al. Point Cloud Completion of Foot Shape from a Single Depth Map for Fit Matching Using Deep Learning View Synthesis , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[72] Sebastian Nowozin,et al. Occupancy Networks: Learning 3D Reconstruction in Function Space , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[73] Silvio Savarese,et al. Weakly Supervised Generative Adversarial Networks for 3D Reconstruction , 2017, ArXiv.

[74] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[75] Hugues Hoppe,et al. Spherical parametrization and remeshing , 2003, ACM Trans. Graph..

[76] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[77] Vittorio Ferrari,et al. Learning to Generate and Reconstruct 3D Meshes with only 2D Supervision , 2018, BMVC.

[78] Gustavo Carneiro,et al. Scaling CNNs for High Resolution Volumetric Reconstruction from a Single Image , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[79] Jitendra Malik,et al. Learning Category-Specific Mesh Reconstruction from Image Collections , 2018, ECCV.

[80] R. Venkatesh Babu,et al. Dense 3D Point Cloud Reconstruction Using a Deep Pyramid Network , 2019, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[81] P. Hu,et al. Method for registration of 3D shapes without overlap for known 3D priors , 2021, Electronics Letters.

[82] Paul J. Besl,et al. A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[83] Jiajun Wu,et al. Learning Shape Priors for Single-View 3D Completion and Reconstruction , 2018, ECCV.

[84] Stefan Roth,et al. Matryoshka Networks: Predicting 3D Geometry via Nested Shape Layers , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[85] Gérard G. Medioni,et al. Object modelling by registration of multiple range images , 1992, Image Vis. Comput..

[86] Marcus A. Magnor,et al. Learning to Reconstruct People in Clothing From a Single RGB Camera , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[87] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[88] Hamid Laga,et al. The Shape Space of 3D Botanical Tree Models , 2018, ACM Trans. Graph..

[89] Markus H. Gross,et al. HS-Nets: Estimating Human Body Shape from Silhouettes with Convolutional Neural Networks , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[90] Ron Kimmel,et al. Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[91] Leonidas J. Guibas,et al. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[92] David Meger,et al. Improved Adversarial Systems for 3D Object Generation and Reconstruction , 2017, CoRL.

[93] Thomas Brox,et al. Octree Generating Networks: Efficient Convolutional Architectures for High-resolution 3D Outputs , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[94] Vincent Lepetit,et al. Geometry-Aware Network for Non-rigid Shape Prediction from a Single View , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[95] Luc Van Gool,et al. Learning Where to Classify in Multi-view Semantic Segmentation , 2014, ECCV.

[96] Leonidas J. Guibas,et al. ObjectNet3D: A Large Scale Database for 3D Object Recognition , 2016, ECCV.

[97] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[98] Peter V. Gehler,et al. Neural Body Fitting: Unifying Deep Learning and Model Based Human Pose and Shape Estimation , 2018, 2018 International Conference on 3D Vision (3DV).

[99] Hamid Laga,et al. Numerical Inversion of SRNF Maps for Elastic Shape Analysis of Genus-Zero Surfaces , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[100] Bo Yang,et al. Dense 3D Object Reconstruction from a Single Depth View , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[101] Ersin Yumer,et al. 3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[102] Subhransu Maji,et al. Multiresolution Tree Networks for 3D Point Cloud Processing , 2018, ECCV.

[103] David Meger,et al. Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object Representation , 2018, NeurIPS.

[104] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[105] Georgios Tzimiropoulos,et al. Large Pose 3D Face Reconstruction from a Single Image via Direct Volumetric CNN Regression , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[106] Jonathan Masci,et al. Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[107] Matthias Nießner,et al. Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[108] Marcel van Gerven,et al. Deep disentangled representations for volumetric reconstruction , 2016, ECCV Workshops.

[109] Cordelia Schmid,et al. BodyNet: Volumetric Inference of 3D Human Body Shapes , 2018, ECCV.

[110] Hamid Laga,et al. A Survey on Non-rigid 3D Shape Analysis , 2018, ArXiv.

[111] Leonidas J. Guibas,et al. ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[112] Chad DeChant,et al. Shape completion enabled robotic grasping , 2016, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[113] Shi-Min Hu,et al. Learning to Reconstruct High-Quality 3D Shapes with Cascaded Fully Convolutional Networks , 2018, ECCV.

[114] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[115] Jan-Michael Frahm,et al. Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[116] Hamid Izadinia,et al. IM2CAD , 2016, 1608.05137.

[117] Ming Cai,et al. Single-view Object Shape Reconstruction Using Deep Shape Prior and Silhouette , 2018, BMVC.

[118] Jitendra Malik,et al. Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[119] Silvio Savarese,et al. Beyond PASCAL: A benchmark for 3D object detection in the wild , 2014, IEEE Winter Conference on Applications of Computer Vision.

[120] Jiajun Wu,et al. Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[121] Leonidas J. Guibas,et al. A concise and provably informative multi-scale signature based on heat diffusion , 2009 .

[122] Simon Lucey,et al. Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction from a Single Image , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[123] Mohammed Bennamoun,et al. RGB-D image-based Object Detection: from Traditional Methods to Deep Learning Techniques , 2019, RGB-D Image Analysis and Processing.

[124] Ian D. Reid,et al. Efficient Dense Point Cloud Object Reconstruction Using Deformation Vector Fields , 2018, ECCV.

[125] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[126] Jiajun Wu,et al. Learning to Reconstruct Shapes from Unseen Classes , 2018, NeurIPS.

[127] R. Venkatesh Babu,et al. 3D-LMNet: Latent Embedding Matching for Accurate and Diverse 3D Point Cloud Reconstruction from a Single Image , 2018, BMVC.

[128] Dragomir Anguelov,et al. SCAPE: shape completion and animation of people , 2005, ACM Trans. Graph..

[129] Markus H. Gross,et al. Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[130] Hamid Laga,et al. Statistical Modeling of the 3D Geometry and Topology of Botanical Trees , 2018, Comput. Graph. Forum.

[131] James M. Rehg,et al. 3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[132] Yang Zhang,et al. Point Cloud GAN , 2018, DGS@ICLR.

[133] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[134] Shengping Zhang,et al. Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[135] Jianxiong Xiao,et al. 3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[136] Silvio Savarese,et al. Joint 2D-3D-Semantic Data for Indoor Scene Understanding , 2017, ArXiv.

[137] Marc Pollefeys,et al. Learning Priors for Semantic 3D Reconstruction , 2018, ECCV.

[138] Zoran Popovic,et al. The space of human body shapes: reconstruction and parameterization from range scans , 2003, ACM Trans. Graph..

[139] Yiyi Liao,et al. Deep Marching Cubes: Learning Explicit Surface Representations , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[140] Timothy F. Cootes,et al. Active Appearance Models , 1998, ECCV.

[141] Christian Theobalt,et al. Multi-Garment Net: Learning to Dress 3D People From Images , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[142] Serge J. Belongie,et al. Learning Single-View 3D Reconstruction with Limited Pose Supervision , 2018, ECCV.

[143] T. Gevers,et al. Inferring Point Clouds from Single Monocular Images by Depth Intermediation , 2018, ArXiv.

[144] Chen Kong,et al. Learning Efficient Point Cloud Generation for Dense 3D Object Reconstruction , 2017, AAAI.

[145] Zhen Li,et al. High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[146] J. Tenenbaum,et al. MarrNet : 3 D Shape Reconstruction via 2 . 5 D Sketches , 2017 .

[147] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[148] Yaonan Wang,et al. 3D Face Reconstruction from Light Field Images: A Model-free Approach , 2017, ECCV.

[149] Roberto Cipolla,et al. PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[150] Gabriel Taubin,et al. SSD: Smooth Signed Distance Surface Reconstruction , 2011, Comput. Graph. Forum.

[151] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[152] Tatsuya Harada,et al. Neural 3D Mesh Renderer , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[153] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[154] Tatsuya Harada,et al. Learning View Priors for Single-View 3D Reconstruction , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[155] Leonidas J. Guibas,et al. GRASS: Generative Recursive Autoencoders for Shape Structures , 2017, ACM Trans. Graph..

[156] Thomas Brox,et al. What Do Single-View 3D Reconstruction Networks Learn? , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[157] Andrea Vedaldi,et al. Capturing the Geometry of Object Categories from Video Supervision , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[158] Matan Sela,et al. Learning Detailed Face Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[159] Tal Hassner,et al. Regressing Robust and Discriminative 3D Morphable Models with a Very Deep Neural Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[160] Yang Liu,et al. Adaptive O-CNN: A Patch-based Deep Representation of 3D Shapes , 2018 .

[161] Hao Su,et al. A Point Set Generation Network for 3D Object Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[162] Bo Yang,et al. 3D Object Reconstruction from a Single Depth View with Adversarial Learning , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[163] Gernot Riegler,et al. OctNet: Learning Deep 3D Representations at High Resolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[164] Marc Levoy,et al. A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[165] Yan Lu,et al. MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image , 2018, AAAI.