论文信息 - NeLF: Neural Light-transport Field for Portrait View Synthesis and Relighting

NeLF: Neural Light-transport Field for Portrait View Synthesis and Relighting

Human portraits exhibit various appearances when observed from different views under different lighting conditions. We can easily imagine how the face will look like in another setup, but computer algorithms still fail on this problem given limited observations. To this end, we present a system for portrait view synthesis and relighting: given multiple portraits, we use a neural network to predict the light-transport field in 3D space, and from the predicted Neural Light-transport Field (NeLF) produce a portrait from a new camera view under a new environmental lighting. Our system is trained on a large number of synthetic models, and can generalize to different synthetic and real portraits under various lighting conditions. Our method achieves simultaneous view synthesis and relighting given multi-view portraits as the input, and achieves state-of-the-art results. CCS Concepts • Computing methodologies → Image-based rendering; Computational photography;

[1] Wojciech Matusik,et al. Progressively-Refined Reflectance Functions from Natural Illumination , 2004 .

[2] Yun-Ta Tsai,et al. Neural Light Transport for Relighting and View Synthesis , 2021, ACM Transactions on Graphics.

[3] Lance Williams,et al. View Interpolation for Image Synthesis , 1993, SIGGRAPH.

[4] James T. Kajiya,et al. The rendering equation , 1986, SIGGRAPH.

[5] Leonard McMillan,et al. Plenoptic Modeling: An Image-Based Rendering System , 2023 .

[6] Rynson W. H. Lau,et al. Is a Green Screen Really Necessary for Real-Time Human Matting? , 2020 .

[7] Xiaoming Liu,et al. Nonlinear 3D Face Morphable Model , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8] Gang Yu,et al. BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation , 2018, ECCV.

[9] Graham Fyffe,et al. Stereo Magnification: Learning View Synthesis using Multiplane Images , 2018, ArXiv.

[10] Xin Tong,et al. Deferred Neural Lighting: Free-viewpoint Relighting from Unstructured Photographs , 2020 .

[11] Ira Kemelmacher-Shlizerman,et al. Face Reconstruction from a Single Image using a Single Reference Face Shape , 2009 .

[12] Peiran REN,et al. Image based relighting using neural networks , 2015, ACM Trans. Graph..

[13] Yun-Ta Tsai,et al. Light stage super-resolution , 2020, ACM Trans. Graph..

[14] Ravi Ramamoorthi,et al. Local Light Field Fusion: Practical View Synthesis with Prescriptive Sampling Guidelines , 2019 .

[15] Georgios Tzimiropoulos,et al. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16] Kyaw Zaw Lin,et al. Neural Sparse Voxel Fields , 2020, NeurIPS.

[17] Matthew Turk,et al. A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[18] Marc Levoy,et al. Light field rendering , 1996, SIGGRAPH.

[19] John Flynn,et al. Deep Stereo: Learning to Predict New Views from the World's Imagery , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Kalyan Sunkavalli,et al. Deep image-based relighting from optimal sparse samples , 2018, ACM Trans. Graph..

[21] Kalyan Sunkavalli,et al. Deep view synthesis from sparse photometric images , 2019, ACM Trans. Graph..

[22] Matthew Tancik,et al. pixelNeRF: Neural Radiance Fields from One or Few Images , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Yu-Ting Tsai,et al. All-frequency precomputed radiance transfer using spherical radial basis functions and clustered tensor approximation , 2006, SIGGRAPH '06.

[24] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Kalyan Sunkavalli,et al. Deep 3D Capture: Geometry and Reflectance From Sparse Multi-View Images , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Paul E. Debevec,et al. Acquiring the reflectance field of a human face , 2000, SIGGRAPH.

[27] Rynson W. H. Lau,et al. Is a Green Screen Really Necessary for Real-Time Portrait Matting? , 2020, ArXiv.

[28] Jan-Michael Frahm,et al. Pixelwise View Selection for Unstructured Multi-View Stereo , 2016, ECCV.

[29] Gordon Wetzstein,et al. State of the Art on Neural Rendering , 2020, Comput. Graph. Forum.

[30] Jonathan T. Barron,et al. NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Pieter Peers,et al. Compressive light transport sensing , 2009, ACM Trans. Graph..

[32] Jonathan T. Barron,et al. NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Pratul P. Srinivasan,et al. NeRF , 2020, ECCV.

[34] R. Howe,et al. 17th International Conference on Medical Image Computing and Computer-Assisted Intervention. , 2014, Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention.

[35] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[36] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[37] Richard Szeliski,et al. Piecewise planar stereo for image-based rendering , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[38] Yannick Hold-Geoffroy,et al. Neural Reflectance Fields for Appearance Acquisition , 2020, ArXiv.

[39] Justus Thies,et al. Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Mark Pauly,et al. Realtime performance-based facial animation , 2011, ACM Trans. Graph..

[41] Carlos D. Castillo,et al. SfSNet: Learning Shape, Reflectance and Illuminance of Faces 'in the Wild' , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Patrick Pérez,et al. Automatic Face Reenactment , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[43] Michael Zollhöfer,et al. Learning Compositional Radiance Fields of Dynamic Human Heads , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Jitendra Malik,et al. Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[45] Yannick Hold-Geoffroy,et al. Deep Reflectance Volumes: Relightable Reconstructions from Multi-View Photometric Images , 2020, ECCV.

[46] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[47] Michael Bosse,et al. Unstructured lumigraph rendering , 2001, SIGGRAPH.

[48] David W. Jacobs,et al. Deep Single-Image Portrait Relighting , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[49] Jan Kautz,et al. Precomputed radiance transfer for real-time rendering in dynamic, low-frequency lighting environments , 2002 .

[50] Ruigang Yang,et al. FaceScape: A Large-Scale High Quality 3D Face Dataset and Detailed Riggable 3D Face Prediction , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Richard Szeliski,et al. The lumigraph , 1996, SIGGRAPH.

[52] Jitendra Malik,et al. Shape, Illumination, and Reflectance from Shading , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[53] Patrick Pérez,et al. State of the Art on Monocular 3D Face Reconstruction, Tracking, and Applications , 2018, Comput. Graph. Forum.

[54] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[55] Lingyun Wu,et al. MaskGAN: Towards Diverse and Interactive Facial Image Manipulation , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[56] Jan-Michael Frahm,et al. Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57] Yaser Sheikh,et al. Mixture of volumetric primitives for efficient neural rendering , 2021, ACM Transactions on Graphics.

[58] Jonathan T. Barron,et al. IBRNet: Learning Multi-View Image-Based Rendering , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Jean-François Lalonde,et al. Learning Physics-Guided Face Relighting Under Directional Light , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[60] Jonathan T. Barron,et al. Deformable Neural Radiance Fields , 2020, ArXiv.

[61] Jiaolong Yang,et al. Deep 3D Portrait From a Single Image , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[62] Yun-Ta Tsai,et al. Single image portrait relighting , 2019, ACM Trans. Graph..

[63] Pat Hanrahan,et al. All-frequency shadows using non-linear wavelet lighting approximation , 2003, ACM Trans. Graph..