论文信息 - Neural Fields as Learnable Kernels for 3D Reconstruction

Neural Fields as Learnable Kernels for 3D Reconstruction

We present Neural Kernel Fields: a novel method for reconstructing implicit 3D shapes based on a learned kernel ridge regression. Our technique achieves state-of-the-art results when reconstructing 3D objects and large scenes from sparse oriented points, and can reconstruct shape categories outside the training set with almost no drop in accuracy. The core insight of our approach is that kernel methods are extremely effective for reconstructing shapes when the chosen kernel has an appropriate inductive bias. We thus factor the problem of shape reconstruction into two parts: (1) a backbone neural network which learns kernel parameters from data, and (2) a kernel ridge regression that fits the input points on-the-fly by solving a simple positive definite linear system using the learned kernel. As a result of this factorization, our reconstruction gains the benefits of datadriven methods under sparse point density while maintaining interpolatory behavior, which converges to the ground truth shape as input sampling density increases. Our experiments demonstrate a strong generalization capability to objects outside the train-set category and scanned scenes. Source code and pretrained models are available at https:// nv-tlabs.github.io/nkf.

[1] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2] Gordon Wetzstein,et al. MetaSDF: Meta-learning Signed Distance Functions , 2020, NeurIPS.

[3] Simon Lucey,et al. SDF-SRN: Learning Signed Distance 3D Object Reconstruction from Static Images , 2020, NeurIPS.

[4] Daniel Cohen-Or,et al. Point2Mesh , 2020, ACM Trans. Graph..

[5] Joan Bruna,et al. Gradient Dynamics of Shallow Univariate ReLU Networks , 2019, NeurIPS.

[6] Leonidas J. Guibas,et al. CaSPR: Learning Canonical Spatiotemporal Point Cloud Representations , 2020, NeurIPS.

[7] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Alexander M. Bronstein,et al. Deformable Shape Completion with Graph Convolutional Autoencoders , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9] Denis Zorin,et al. Neural Splines: Fitting 3D Surfaces with Infinitely-Wide Neural Networks , 2020, ArXiv.

[10] Jitendra Malik,et al. Mesh R-CNN , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[11] Daniel Cohen-Or,et al. EC-Net: an Edge-aware Point set Consolidation Network , 2018, ECCV.

[12] Yinda Zhang,et al. DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Jonathan T. Barron,et al. Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains , 2020, NeurIPS.

[14] Leonidas J. Guibas,et al. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[15] Jiajun Wu,et al. MarrNet: 3D Shape Reconstruction via 2.5D Sketches , 2017, NIPS.

[16] Ron Kimmel,et al. The Whole Is Greater Than the Sum of Its Nonrigid Parts , 2020, ArXiv.

[17] Sanja Fidler,et al. Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis , 2021, NeurIPS.

[18] Andrea Tagliasacchi,et al. NASA: Neural Articulated Shape Approximation , 2020, ECCV.

[19] Leonidas J. Guibas,et al. Object-Centric Multi-View Aggregation , 2020, ArXiv.

[20] Joan Bruna,et al. Deep Geometric Prior for Surface Reconstruction , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Lorenzo Rosasco,et al. FALKON: An Optimal Large Scale Kernel Method , 2017, NIPS.

[22] Thomas A. Funkhouser,et al. Learning Shape Templates With Structured Implicit Functions , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23] Hao Su,et al. A Point Set Generation Network for 3D Object Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Ronen Basri,et al. Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance , 2020, NeurIPS.

[25] Gerard Pons-Moll,et al. Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Marc Pollefeys,et al. Convolutional Occupancy Networks , 2020, ECCV.

[27] Federico Tombari,et al. 3D Point Capsule Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Andrea Tagliasacchi,et al. Vector Neurons: A General Framework for SO(3)-Equivariant Networks , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[29] Silvio Savarese,et al. 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction , 2016, ECCV.

[30] Jitendra Malik,et al. Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Leonidas J. Guibas,et al. ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[32] Mathieu Aubry,et al. A Papier-Mache Approach to Learning 3D Surface Generation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[33] Xiaoguang Han,et al. Deep Mesh Reconstruction From Single RGB Images via Topology Modification Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[34] Hao Zhang,et al. Learning Implicit Fields for Generative Shape Modeling , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Yaron Lipman,et al. SAL: Sign Agnostic Learning of Shapes From Raw Data , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Geoffrey E. Hinton,et al. Using Deep Belief Nets to Learn Covariance Kernels for Gaussian Processes , 2007, NIPS.

[37] Sebastian Nowozin,et al. Occupancy Networks: Learning 3D Reconstruction in Function Space , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Andrew Gordon Wilson,et al. Deep Kernel Learning , 2015, AISTATS.

[39] Leonidas Guibas,et al. StrobeNet: Category-Level Multiview Reconstruction of Articulated Objects , 2021, ArXiv.

[40] B. Li,et al. A Survey of Methods for Moving Least Squares Surfaces , 2008, VG/PBG@SIGGRAPH.

[41] Abhinav Gupta,et al. Learning a Predictable and Generative Vector Representation for Objects , 2016, ECCV.

[42] Yaron Lipman,et al. Implicit Geometric Regularization for Learning Shapes , 2020, ICML.

[43] Chen Kong,et al. Learning Efficient Point Cloud Generation for Dense 3D Object Reconstruction , 2017, AAAI.

[44] Thomas Funkhouser,et al. Local Deep Implicit Functions for 3D Shape , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Daniel Cohen-Or,et al. PU-Net: Point Cloud Upsampling Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[46] Geoffrey E. Hinton,et al. Canonical Capsules: Unsupervised Capsules in Canonical Pose , 2020, ArXiv.

[47] Hao Zhang,et al. BSP-Net: Generating Compact Meshes via Binary Space Partitioning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Wei Hu,et al. Diffusion Probabilistic Models for 3D Point Cloud Generation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Daniele Panozzo,et al. VoronoiNet : General Functional Approximators with Local Support , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[50] Eddy Ilg,et al. Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction , 2020, ECCV.

[51] Thomas Funkhouser,et al. Local Implicit Grid Representations for 3D Scenes , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[52] Linqi Zhou,et al. 3D Shape Generation and Completion through Point-Voxel Diffusion , 2021, ArXiv.

[53] Andrea Tagliasacchi,et al. CvxNet: Learnable Convex Decomposition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[54] Sanja Fidler,et al. Learning Deformable Tetrahedral Meshes for 3D Reconstruction , 2020, NeurIPS.

[55] Subhransu Maji,et al. Deep Manifold Prior , 2020, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[56] Elliot J. Crowley,et al. Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels , 2020, NeurIPS.

[57] Marc Pollefeys,et al. Shape As Points: A Differentiable Poisson Solver , 2021, NeurIPS.

[58] Richard A. Newcombe,et al. DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Nathan Srebro,et al. A Function Space View of Bounded Norm Infinite Width ReLU Nets: The Multivariate Case , 2019, ICLR.

[60] Charles T. Loop,et al. Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[61] Michael M. Kazhdan,et al. Screened poisson surface reconstruction , 2013, TOGS.

[62] Jan Kautz,et al. Meshlet Priors for 3D Mesh Reconstruction , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[63] Mathieu Aubry,et al. Learning elementary structures for 3D shape generation and matching , 2019, NeurIPS.

[64] Matthias Nießner,et al. ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[65] Thomas Brox,et al. What Do Single-View 3D Reconstruction Networks Learn? , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[66] Jitendra Malik,et al. Hierarchical Surface Prediction for 3D Object Reconstruction , 2017, 2017 International Conference on 3D Vision (3DV).

[67] Yaron Lipman,et al. SAL++: Sign Agnostic Learning with Derivatives , 2020, ICLR.

[68] Gordon Wetzstein,et al. Implicit Neural Representations with Periodic Activation Functions , 2020, NeurIPS.

[69] Zihao Wang,et al. Secrets of 3D Implicit Object Shape Reconstruction in the Wild , 2021, ArXiv.