Extracting Triangular 3D Models, Materials, and Lighting From Images

We present an efficient method for joint optimization of topology, materials and lighting from multi-view image observations. Unlike recent multi-view reconstruction approaches, which typically produce entangled 3D representations encoded in neural networks, we output triangle meshes with spatially-varying materials and environment lighting that can be deployed in any traditional graphics engine unmodified. We leverage recent work in differentiable rendering, coordinate-based networks to compactly represent volumetric texturing, alongside differentiable marching tetrahedrons to enable gradient-based optimization directly on the surface mesh. Finally, we introduce a differentiable formulation of the split sum approximation of environment lighting to efficiently recover all-frequency lighting. Experiments show our extracted models used in advanced scene editing, material decomposition, and high quality view interpolation, all running at interactive rates in triangle-based renderers (rasterizers and path tracers).

[1]  Supasorn Suwajanakorn,et al.  NeX: Real-time View Synthesis with Neural Basis Expansion , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Jonathan T. Barron,et al.  NeRD: Neural Reflectance Decomposition from Image Collections , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[3]  James T. Kajiya,et al.  The rendering equation , 1986, SIGGRAPH.

[4]  Jonathan T. Barron,et al.  Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition , 2021, NeurIPS.

[5]  Michael M. Kazhdan,et al.  Screened poisson surface reconstruction , 2013, TOGS.

[6]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Kavita Bala,et al.  Unified Shape and SVBRDF Recovery using Differentiable Monte Carlo Rendering , 2021, Comput. Graph. Forum.

[8]  Ronen Basri,et al.  Multiview Neural Surface Reconstruction by Disentangling Geometry and Appearance , 2020, NeurIPS.

[9]  R. Giryes,et al.  SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization , 2021, NeurIPS.

[10]  Giuseppe Claudio Guarnera,et al.  BRDF Representation and Acquisition , 2016, Comput. Graph. Forum.

[11]  Kalyan Sunkavalli,et al.  MaterialGAN , 2020, ACM Trans. Graph..

[12]  Kalyan Sunkavalli,et al.  Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Olga Sorkine-Hornung,et al.  Laplacian Mesh Processing , 2005, Eurographics.

[14]  Robert L. Cook,et al.  A Reflectance Model for Computer Graphics , 1987, TOGS.

[15]  Andrew Gardner,et al.  Linear light source reflectometry , 2003, ACM Trans. Graph..

[16]  Steve Marschner,et al.  Microfacet Models for Refraction through Rough Surfaces , 2007, Rendering Techniques.

[17]  Brent Burley Physically-Based Shading at Disney , 2012 .

[18]  Hans-Peter Seidel,et al.  Planned Sampling of Spatially Varying BRDFs , 2003, Comput. Graph. Forum.

[19]  Pratul P. Srinivasan,et al.  NeRF , 2020, ECCV.

[20]  Jaakko Lehtinen,et al.  Modular primitives for high-performance differentiable rendering , 2020, ACM Trans. Graph..

[21]  Sanja Fidler,et al.  DIB-R++: Learning to Predict Lighting and Material with a Hybrid Differentiable Renderer , 2021, NeurIPS.

[22]  Paul Debevec,et al.  NeRFactor , 2021, ACM Trans. Graph..

[23]  Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer , 2019, NeurIPS.

[24]  Ross B. Girshick,et al.  PointRend: Image Segmentation As Rendering , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Jonathan T. Barron,et al.  NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Tomas Akenine-Möller,et al.  FLIP: A Difference Evaluator for Alternating Images , 2020, Proc. ACM Comput. Graph. Interact. Tech..

[27]  Brian E. Smits,et al.  Practical physically-based shading in film and game production , 2012, SIGGRAPH '12.

[28]  Noah Snavely,et al.  PhySG: Inverse Rendering with Spherical Gaussians for Physics-based Material Editing and Relighting , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Kalyan Sunkavalli,et al.  Learning to reconstruct shape and spatially-varying reflectance from a single image , 2018, ACM Trans. Graph..

[30]  Matthew Anderson,et al.  Proposal for a Standard Default Color Space for the Internet - sRGB , 1996, CIC.

[31]  Taku Komura,et al.  NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction , 2021, ArXiv.

[32]  Ren Ng,et al.  PlenOctrees for Real-time Rendering of Neural Radiance Fields , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[33]  Reinhard Klein,et al.  Advances in geometry and reflectance acquisition (course notes) , 2015, SIGGRAPH Asia Courses.

[34]  K. Schindler,et al.  Gipuma: Massively Parallel Multi-view Stereo Reconstruction , 2016 .

[35]  Kai Zhang,et al.  NeRF++: Analyzing and Improving Neural Radiance Fields , 2020, ArXiv.

[36]  V. Prisacariu,et al.  NeRF--: Neural Radiance Fields Without Known Camera Parameters , 2021, 2102.07064.

[37]  T. Müller,et al.  Instant neural graphics primitives with a multiresolution hash encoding , 2022, ACM Trans. Graph..

[38]  Jan Kautz,et al.  Two-Shot Spatially-Varying BRDF and Shape Estimation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Jonathan T. Barron,et al.  NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Francesc Moreno-Noguer,et al.  D-NeRF: Neural Radiance Fields for Dynamic Scenes , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Andreas Geiger,et al.  GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Gernot Riegler,et al.  On Joint Estimation of Pose, Geometry and svBRDF From a Handheld Scanner , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Henrik Aanæs,et al.  Large Scale Multi-view Stereopsis Evaluation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[44]  Yiyi Liao,et al.  Deep Marching Cubes: Learning Explicit Surface Representations , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[45]  Pieter Peers,et al.  Deep inverse rendering for high-resolution SVBRDF estimation from an arbitrary number of images , 2019, ACM Trans. Graph..

[46]  Hao Li,et al.  Soft Rasterizer: A Differentiable Renderer for Image-Based 3D Reasoning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[47]  Hujun Bao,et al.  Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[48]  Sanja Fidler,et al.  Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis , 2021, NeurIPS.

[49]  Brian Karis,et al.  Real Shading in Unreal Engine 4 by , 2013 .

[50]  Yannick Hold-Geoffroy,et al.  Neural Reflectance Fields for Appearance Acquisition , 2020, ArXiv.

[51]  Richard Szeliski,et al.  Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[52]  Jeremy S. De Bonet,et al.  Poxels: Probabilistic Voxelized Volume Reconstruction , 1999 .

[53]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[54]  Sanja Fidler,et al.  Learning Deformable Tetrahedral Meshes for 3D Reconstruction , 2020, NeurIPS.

[55]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[56]  Stephan J. Garbin,et al.  FastNeRF: High-Fidelity Neural Rendering at 200FPS , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[57]  Jirí Filip,et al.  Visual Texture , 2013, Advances in Computer Vision and Pattern Recognition.

[58]  Sanja Fidler,et al.  Kaolin: A PyTorch Library for Accelerating 3D Deep Learning Research , 2019, ArXiv.

[59]  Merlin Nimier-David,et al.  Radiative Backpropagation: An Adjoint Method for Lightning-Fast Differentiable Rendering , 2020 .

[60]  Jan-Michael Frahm,et al.  Pixelwise View Selection for Unstructured Multi-View Stereo , 2016, ECCV.

[61]  Steven M. Seitz,et al.  Photorealistic Scene Reconstruction by Voxel Coloring , 1997, International Journal of Computer Vision.

[62]  Pieter Peers,et al.  Estimating Specular Roughness and Anisotropy from Second Order Spherical Gradient Illumination , 2009, Comput. Graph. Forum.

[63]  William E. Lorensen,et al.  Marching cubes: A high resolution 3D surface construction algorithm , 1987, SIGGRAPH.

[64]  Jaakko Lehtinen,et al.  Appearance-Driven Automatic 3D Model Simplification , 2021, EGSR.

[65]  Andreas Geiger,et al.  Differentiable Volumetric Rendering: Learning Implicit 3D Representations Without 3D Supervision , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[66]  Luc Van Gool,et al.  From images to 3D models , 2002, CACM.

[67]  Andreas Geiger,et al.  UNISURF: Unifying Neural Implicit Surfaces and Radiance Fields for Multi-View Reconstruction , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[68]  Matthias Zwicker,et al.  SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Merlin Nimier-David,et al.  Material and Lighting Reconstruction for Complex Indoor Scenes with Texture-space Differentiable Rendering , 2021, EGSR.