论文信息 - FLNeRF: 3D Facial Landmarks Estimation in Neural Radiance Fields

FLNeRF: 3D Facial Landmarks Estimation in Neural Radiance Fields

This paper presents the first significant work on directly predicting 3D face landmarks on neural radiance fields (NeRFs). Our 3D coarse-to-fine Face Landmarks NeRF (FLNeRF) model efficiently samples from a given face NeRF with individual facial features for accurate landmarks detection. Expression augmentation is applied to facial features in a fine scale to simulate large emotions range including exaggerated facial expressions (e.g., cheek blowing, wide opening mouth, eye blinking) for training FLNeRF. Qualitative and quantitative comparison with related state-of-the-art 3D facial landmark estimation methods demonstrate the efficacy of FLNeRF, which contributes to downstream tasks such as high-quality face editing and swapping with direct control using our NeRF landmarks. Code and data will be available. Github link: https://github.com/ZHANG1023/FLNeRF.

Chi-Keung Tang | Yu-Wing Tai | Hao Zhang | Tianyuan Dai

[1] S. Rajan,et al. A Novel Face Recognition Using Specific Values from Deep Neural Network-based Landmarks , 2023, IEEE International Conference on Consumer Electronics.

[2] Yinghao Xu,et al. Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator , 2022, NeurIPS.

[3] Jiaya Jia,et al. EfficientNeRF - Efficient Neural Radiance Fields , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] ShahRukh Athar. RigNeRF: Fully Controllable Neural 3D Portraits , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5] T. Funkhouser,et al. Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6] M. Nießner,et al. AutoRF: Learning 3D Object Radiance Fields from Single View Observations , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Zhen Lei,et al. Beyond 3DMM: Learning to Capture High-fidelity 3D Face Shape , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Tao Yu,et al. Structured Local Radiance Fields for Human Avatar Modeling , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Wen-Fong Huang,et al. Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Lan Xu,et al. NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Jiakai Zhang,et al. Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-time , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Christoph Lassner,et al. Virtual Elastic Objects , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Pratul P. Srinivasan,et al. HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Yasuhiro Fujita,et al. Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis , 2022, Computer Vision and Pattern Recognition.

[15] A. Vedaldi,et al. BANMo: Building Animatable 3D Neural Models from Many Casual Videos , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Jeong Joon Park,et al. StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17] A. Makadia,et al. Light Field Neural Rendering , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Xin Tong,et al. GRAM: Generative Radiance Manifolds for 3D-Aware Image Generation , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Shalini De Mello,et al. Efficient Geometry-aware 3D Generative Adversarial Networks , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Ligang Liu,et al. HeadNeRF: A Realtime NeRF-based Parametric Head Model , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Benjamin Recht,et al. Plenoxels: Radiance Fields without Neural Networks , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Jiakai Zhang,et al. HumanNeRF: Efficiently Generated Human Radiance Field from Sparse Inputs , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Xun Cao,et al. MoFaNeRF: Morphable Facial Neural Radiance Field , 2021, ECCV.

[24] Yebin Liu,et al. FENeRF: Face Editing in Neural Radiance Fields , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Mohammad Mahdi Johari,et al. GeoNeRF: Generalizing NeRF with Geometry Priors , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Hwann-Tzong Chen,et al. Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27] K. M. Yi,et al. LOLNeRF: Learn from One Look , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Ruigang Yang,et al. FaceScape: 3D Facial Dataset and Benchmark for Single-View 3D Face Reconstruction , 2021, ArXiv.

[29] Ulrich Neumann,et al. Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry , 2021, 2021 International Conference on 3D Vision (3DV).

[30] Hujun Bao,et al. Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[31] Avneesh Sud,et al. Differentiable Surface Rendering via Non-Differentiable Sampling , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[32] Tao Yu,et al. DoubleField: Bridging the Neural Surface and Radiance Fields for High-fidelity Human Reconstruction and Rendering , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Zhoutong Zhang,et al. Editing Conditional Radiance Fields , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[34] Hujun Bao,et al. Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[35] Stephen Lin,et al. Neural Articulated Radiance Field , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[36] Jonathan T. Barron,et al. Baking Neural Radiance Fields for Real-Time View Synthesis , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[37] Yiyi Liao,et al. KiloNeRF: Speeding up Neural Radiance Fields with Thousands of Tiny MLPs , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[38] Ren Ng,et al. PlenOctrees for Real-time Rendering of Neural Radiance Fields , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[39] Stephan J. Garbin,et al. FastNeRF: High-Fidelity Neural Rendering at 200FPS , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[40] Danni Ai,et al. Multi-scale Landmark Localization Network for 3D Facial Point Clouds , 2021, ICDSP.

[41] Tanner Schmidt,et al. STaR: Self-supervised Tracking and Reconstruction of Rigid Objects in Motion with Neural Rendering , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Vijay Kumar,et al. 3D landmark-based face restoration for recognition using variational autoencoder and triplet loss , 2020, IET Biom..

[43] Jiajun Wu,et al. Neural Radiance Flow for 4D View Synthesis and Video Processing , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[44] Jiajun Wu,et al. Object-Centric Neural Scene Rendering , 2020, ArXiv.

[45] Chia-Kai Liang,et al. Portrait Neural Radiance Fields from a Single Image , 2020, ArXiv.

[46] Michael J. Black,et al. Learning an animatable detailed 3D face model from in-the-wild images , 2020, ACM Trans. Graph..

[47] Justus Thies,et al. Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Gordon Wetzstein,et al. AutoInt: Automatic Integration for Fast Neural Volume Rendering , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[49] Pratul P. Srinivasan,et al. Learned Initializations for Optimizing Coordinate-Based Neural Representations , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50] Francesc Moreno-Noguer,et al. D-NeRF: Neural Radiance Fields for Dynamic Scenes , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Zhengqi Li,et al. Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[52] Jonathan T. Barron,et al. Nerfies: Deformable Neural Radiance Fields , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[53] Wei Jiang,et al. DeRF: Decomposed Radiance Fields , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[54] Changil Kim,et al. Space-time Neural Irradiance Fields for Free-Viewpoint Video , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55] Andreas Geiger,et al. GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[56] Felix Heide,et al. Neural Scene Graphs for Dynamic Scenes , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[57] Kai Zhang,et al. NeRF++: Analyzing and Improving Neural Radiance Fields , 2020, ArXiv.

[58] Zhen Lei,et al. Towards Fast, Accurate and Stable 3D Dense Face Alignment , 2020, ECCV.

[59] Kyaw Zaw Lin,et al. Neural Sparse Voxel Fields , 2020, NeurIPS.

[60] Sahil Sharma,et al. Voxel-based 3D occlusion-invariant face recognition using game theory and simulated annealing , 2020, Multimedia Tools and Applications.

[61] Amy R. Reibman,et al. FaR-GAN for One-Shot Face Reenactment , 2020, ArXiv.

[62] Weijian Li,et al. Structured Landmark Detection via Topology-Adapting Deep Graph Learning , 2020, ECCV.

[63] Ye Wang,et al. LUVLi Face Alignment: Estimating Landmarks’ Location, Uncertainty, and Visibility Likelihood , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[64] Orest Kupyn,et al. ActGAN: Flexible and Efficient One-shot Face Reenactment , 2020, 2020 8th International Workshop on Biometrics and Forensics (IWBF).

[65] Pratul P. Srinivasan,et al. NeRF , 2020, ECCV.

[66] Veronica Teichrieb,et al. Real-Time Facial Motion Capture Using RGB-D Images Under Complex Motion and Occlusions , 2019, 2019 21st Symposium on Virtual and Augmented Reality (SVR).

[67] Jeffrey F. Cohn,et al. The 2nd 3D Face Alignment in the Wild Challenge (3DFAW-Video): Dense Reconstruction From Video , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[68] Haibin Ling,et al. Efficient and Accurate Face Alignment by Global Regression and Cascaded Local Refinement , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[69] Mingjie Zheng,et al. Robust Facial Landmark Detection via Occlusion-Adaptive Deep Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[70] Qiong Cao,et al. MMFace: A Multi-Metric Regression Network for Unconstrained Face Reconstruction , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[71] V. Lempitsky,et al. Few-Shot Adversarial Learning of Realistic Neural Talking Head Models , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[72] Mita Nasipuri,et al. Reg3DFacePtCd: Registration of 3D Point Clouds Using a Common Set of Landmarks for Alignment of Human Face Images , 2019, KI - Künstliche Intelligenz.

[73] Federico Alvarez,et al. Three-D Wide Faces (3DWF): Facial Landmark Detection and 3D Reconstruction over a New RGB–D Multi-Camera Dataset , 2019, Sensors.

[74] Dong Liu,et al. Deep High-Resolution Representation Learning for Human Pose Estimation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[75] Rasmus R. Paulsen,et al. Multi-view Consensus CNN for 3D Facial Landmark Placement , 2018, ACCV.

[76] José Miguel Buenaposada,et al. A Deeply-Initialized Coarse-to-fine Ensemble of Regression Trees for Face Alignment , 2018, ECCV.

[77] Jason Yosinski,et al. An Intriguing Failing of Convolutional Neural Networks and the CoordConv Solution , 2018, NeurIPS.

[78] Yici Cai,et al. Look at Boundary: A Boundary-Aware Face Alignment Algorithm , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[79] Xiaoming Liu,et al. Nonlinear 3D Face Morphable Model , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[80] Xiaoming Liu,et al. Face Alignment in Full Pose Range: A 3D Total Solution , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[81] Xi Zhou,et al. Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network , 2018, ECCV.

[82] Qi Li,et al. Joint Voxel and Coordinate Regression for Accurate 3D Facial Landmark Localization , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[83] Josef Kittler,et al. Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[84] Cheng Cheng,et al. A Deep Regression Architecture with Two-Stage Re-initialization for High Performance Facial Landmark Detection , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[85] Georgios Tzimiropoulos,et al. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[86] Xin Fan,et al. 3D facial landmark localization using texture regression via conformal mapping , 2016, Pattern Recognit. Lett..

[87] Hao Chen,et al. VoxResNet: Deep Voxelwise Residual Networks for Volumetric Brain Segmentation , 2016, ArXiv.

[88] George Trigeorgis,et al. Mnemonic Descent Method: A Recurrent Process Applied for End-to-End Face Alignment , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[89] Jia Deng,et al. Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[90] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[91] Xiangyu Zhu,et al. Face Alignment in Full Pose Range: A 3D Total Solution , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[92] Paul F. Whelan,et al. 3-D Facial Landmark Localization With Asymmetry Patterns and Shape Regression from Incomplete Local Features , 2015, IEEE Transactions on Cybernetics.

[93] Cheng Li,et al. Face alignment by coarse-to-fine shape searching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[94] Dzulkifli Mohamad,et al. Blend Shape Interpolation and FACS for Realistic Avatar , 2015 .

[95] Chun Chen,et al. Robust 3D Face Landmark Localization Based on Local Coordinate Coding , 2014, IEEE Transactions on Image Processing.

[96] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[97] Wei Zhang,et al. Multiview Facial Landmark Localization in RGB-D Images via Hierarchical Regression With Binary Patterns , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[98] Xiaogang Wang,et al. Deep Convolutional Network Cascade for Facial Point Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[99] J. Austin,et al. A Machine-Learning Approach to Keypoint Detection and Landmarking on 3D Meshes , 2013, International Journal of Computer Vision.

[100] Timothy F. Cootes,et al. Accurate Regression Procedures for Active Appearance Models , 2011, BMVC.

[101] Andrea Cavallaro,et al. 3-D Face Detection, Landmark Localization, and Registration Using a Point Distribution Model , 2009, IEEE Transactions on Multimedia.

[102] Fred Nicolls,et al. Locating Facial Features with an Extended Active Shape Model , 2008, ECCV.

[103] Hanspeter Pfister,et al. Face transfer with multilinear models , 2005, ACM Trans. Graph..

[104] Timothy F. Cootes,et al. Active Appearance Models , 1998, ECCV.

[105] Timothy F. Cootes,et al. Active Shape Models-Their Training and Application , 1995, Comput. Vis. Image Underst..

[106] Fred L. Bookstein,et al. Principal Warps: Thin-Plate Splines and the Decomposition of Deformations , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[107] Mostafa Charmi,et al. A distinctive landmark-based face recognition system for identical twins by extracting novel weighted features , 2021, Comput. Electr. Eng..

[108] Matthew Turk,et al. A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[109] Timothy F. Cootes,et al. Active Shape Models - 'smart snakes' , 1992, BMVC.