论文信息 - Extracting Triangular 3D Models, Materials, and Lighting From Images

Extracting Triangular 3D Models, Materials, and Lighting From Images

We present an efficient method for joint optimization of topology, materials and lighting from multi-view image observations. Unlike recent multi-view reconstruction approaches, which typically produce entangled 3D representations encoded in neural networks, we output triangle meshes with spatially-varying materials and environment lighting that can be deployed in any traditional graphics engine unmodified. We leverage recent work in differentiable rendering, coordinate-based networks to compactly represent volumetric texturing, alongside differentiable marching tetrahedrons to enable gradient-based optimization directly on the surface mesh. Finally, we introduce a differentiable formulation of the split sum approximation of environment lighting to efficiently recover all-frequency lighting. Experiments show our extracted models used in advanced scene editing, material decomposition, and high quality view interpolation, all running at interactive rates in triangle-based renderers (rasterizers and path tracers).

[1] Supasorn Suwajanakorn,et al. NeX: Real-time View Synthesis with Neural Basis Expansion , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Jonathan T. Barron,et al. NeRD: Neural Reflectance Decomposition from Image Collections , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[3] James T. Kajiya,et al. The rendering equation , 1986, SIGGRAPH.

[4] Jonathan T. Barron,et al. Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition , 2021, NeurIPS.

[5] Michael M. Kazhdan,et al. Screened poisson surface reconstruction , 2013, TOGS.

[6] Jean Ponce,et al. Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Kavita Bala,et al. Unified Shape and SVBRDF Recovery using Differentiable Monte Carlo Rendering , 2021, Comput. Graph. Forum.

[8] Ronen Basri,et al. Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance , 2020, NeurIPS.

[9] R. Giryes,et al. SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization , 2021, NeurIPS.

[10] Giuseppe Claudio Guarnera,et al. BRDF Representation and Acquisition , 2016, Comput. Graph. Forum.

[11] Kalyan Sunkavalli,et al. MaterialGAN , 2020, ACM Trans. Graph..

[12] Kalyan Sunkavalli,et al. Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Olga Sorkine-Hornung,et al. Laplacian Mesh Processing , 2005, Eurographics.

[14] Robert L. Cook,et al. A Reflectance Model for Computer Graphics , 1987, TOGS.

[15] Andrew Gardner,et al. Linear light source reflectometry , 2003, ACM Trans. Graph..

[16] Steve Marschner,et al. Microfacet Models for Refraction through Rough Surfaces , 2007, Rendering Techniques.

[17] Brent Burley. Physically-Based Shading at Disney , 2012 .

[18] Hans-Peter Seidel,et al. Planned Sampling of Spatially Varying BRDFs , 2003, Comput. Graph. Forum.

[19] Pratul P. Srinivasan,et al. NeRF , 2020, ECCV.

[20] Jaakko Lehtinen,et al. Modular primitives for high-performance differentiable rendering , 2020, ACM Trans. Graph..

[21] Sanja Fidler,et al. DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable Renderer , 2021, NeurIPS.

[22] Paul Debevec,et al. NeRFactor , 2021, ACM Trans. Graph..

[23] Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer , 2019, NeurIPS.

[24] Ross B. Girshick,et al. PointRend: Image Segmentation As Rendering , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Jonathan T. Barron,et al. NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Tomas Akenine-Möller,et al. FLIP: A Difference Evaluator for Alternating Images , 2020, Proc. ACM Comput. Graph. Interact. Tech..

[27] Brian E. Smits,et al. Practical physically-based shading in film and game production , 2012, SIGGRAPH '12.

[28] Noah Snavely,et al. PhySG: Inverse Rendering with Spherical Gaussians for Physics-based Material Editing and Relighting , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Kalyan Sunkavalli,et al. Learning to reconstruct shape and spatially-varying reflectance from a single image , 2018, ACM Trans. Graph..

[30] Matthew Anderson,et al. Proposal for a Standard Default Color Space for the Internet - sRGB , 1996, CIC.

[31] Taku Komura,et al. NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction , 2021, ArXiv.

[32] Ren Ng,et al. PlenOctrees for Real-time Rendering of Neural Radiance Fields , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[33] Reinhard Klein,et al. Advances in geometry and reflectance acquisition (course notes) , 2015, SIGGRAPH Asia Courses.

[34] K. Schindler,et al. Gipuma: Massively Parallel Multi-view Stereo Reconstruction , 2016 .

[35] Kai Zhang,et al. NeRF++: Analyzing and Improving Neural Radiance Fields , 2020, ArXiv.

[36] V. Prisacariu,et al. NeRF--: Neural Radiance Fields Without Known Camera Parameters , 2021, 2102.07064.

[37] T. Müller,et al. Instant neural graphics primitives with a multiresolution hash encoding , 2022, ACM Trans. Graph..

[38] Jan Kautz,et al. Two-Shot Spatially-Varying BRDF and Shape Estimation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Jonathan T. Barron,et al. NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Francesc Moreno-Noguer,et al. D-NeRF: Neural Radiance Fields for Dynamic Scenes , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41] Andreas Geiger,et al. GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Gernot Riegler,et al. On Joint Estimation of Pose, Geometry and svBRDF From a Handheld Scanner , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Henrik Aanæs,et al. Large Scale Multi-view Stereopsis Evaluation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[44] Yiyi Liao,et al. Deep Marching Cubes: Learning Explicit Surface Representations , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[45] Pieter Peers,et al. Deep inverse rendering for high-resolution SVBRDF estimation from an arbitrary number of images , 2019, ACM Trans. Graph..

[46] Hao Li,et al. Soft Rasterizer: A Differentiable Renderer for Image-Based 3D Reasoning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[47] Hujun Bao,et al. Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[48] Sanja Fidler,et al. Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis , 2021, NeurIPS.

[49] Brian Karis,et al. Real Shading in Unreal Engine 4 by , 2013 .

[50] Yannick Hold-Geoffroy,et al. Neural Reflectance Fields for Appearance Acquisition , 2020, ArXiv.

[51] Richard Szeliski,et al. Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[52] Jeremy S. De Bonet,et al. Poxels: Probabilistic Voxelized Volume Reconstruction , 1999 .

[53] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[54] Sanja Fidler,et al. Learning Deformable Tetrahedral Meshes for 3D Reconstruction , 2020, NeurIPS.

[55] Steven M. Seitz,et al. Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[56] Stephan J. Garbin,et al. FastNeRF: High-Fidelity Neural Rendering at 200FPS , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[57] Jirí Filip,et al. Visual Texture , 2013, Advances in Computer Vision and Pattern Recognition.

[58] Sanja Fidler,et al. Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research , 2019, ArXiv.

[59] Merlin Nimier-David,et al. Radiative Backpropagation: An Adjoint Method for Lightning-Fast Differentiable Rendering , 2020 .

[60] Jan-Michael Frahm,et al. Pixelwise View Selection for Unstructured Multi-View Stereo , 2016, ECCV.

[61] Steven M. Seitz,et al. Photorealistic Scene Reconstruction by Voxel Coloring , 1997, International Journal of Computer Vision.

[62] Pieter Peers,et al. Estimating Specular Roughness and Anisotropy from Second Order Spherical Gradient Illumination , 2009, Comput. Graph. Forum.

[63] William E. Lorensen,et al. Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[64] Jaakko Lehtinen,et al. Appearance-Driven Automatic 3D Model Simplification , 2021, EGSR.

[65] Andreas Geiger,et al. Differentiable Volumetric Rendering: Learning Implicit 3D Representations Without 3D Supervision , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[66] Luc Van Gool,et al. From images to 3D models , 2002, CACM.

[67] Andreas Geiger,et al. UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[68] Matthias Zwicker,et al. SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[69] Merlin Nimier-David,et al. Material and Lighting Reconstruction for Complex Indoor Scenes with Texture-space Differentiable Rendering , 2021, EGSR.