论文信息 - Modeling Facial Geometry Using Compositional VAEs

Modeling Facial Geometry Using Compositional VAEs

We propose a method for learning non-linear face geometry representations using deep generative models. Our model is a variational autoencoder with multiple levels of hidden variables where lower layers capture global geometry and higher ones encode more local deformations. Based on that, we propose a new parameterization of facial geometry that naturally decomposes the structure of the human face into a set of semantically meaningful levels of detail. This parameterization enables us to do model fitting while capturing varying level of detail under different types of geometrical constraints.

[1] Fernando De la Torre,et al. Interactive region-based linear 3D face models , 2011, SIGGRAPH 2011.

[2] Yiying Tong,et al. FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.

[3] Marcus A. Magnor,et al. Sparse localized deformation components , 2013, ACM Trans. Graph..

[4] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[5] Georgios Tzimiropoulos,et al. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[6] Max Welling,et al. VAE with a VampPrior , 2017, AISTATS.

[7] Thomas Vetter,et al. A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[8] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[9] Dimitris N. Metaxas,et al. The integration of optical flow and deformable models with applications to human face shape and motion estimation , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10] Ken-ichi Anjyo,et al. Practice and Theory of Blendshape Facial Models , 2014, Eurographics.

[11] Timothy F. Cootes,et al. Active Appearance Models , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[12] Varun Ramakrishna,et al. Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Patrick Pérez,et al. MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[14] Moon-Ryul Jung,et al. Local shape blending using coherent weighted regions , 2011, The Visual Computer.

[15] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.

[16] Ioannis A. Kakadiaris,et al. End-to-End 3D Face Reconstruction with Deep Neural Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Pascal Fua,et al. Regularized Bundle-Adjustment to Model Heads from Image Sequences without Calibration Data , 2000, International Journal of Computer Vision.

[18] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[19] Pascal Fua,et al. Accurate face models from uncalibrated and ill-lit video sequences , 2004, CVPR 2004.

[20] Tien D. Bui,et al. Deep Appearance Models: A Deep Boltzmann Machine Approach for Face Modeling , 2016, International Journal of Computer Vision.

[21] Harry Shum,et al. Face poser: Interactive modeling of 3D facial expressions using facial priors , 2009, TOGS.

[22] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.

[23] Marc Alexa,et al. As-rigid-as-possible surface modeling , 2007, Symposium on Geometry Processing.

[24] Simon Lucey,et al. Deformable Model Fitting by Regularized Landmark Mean-Shift , 2010, International Journal of Computer Vision.

[25] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[26] Derek Bradley,et al. An anatomically-constrained local deformation model for monocular face capture , 2016, ACM Trans. Graph..

[27] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[28] Hanspeter Pfister,et al. Face transfer with multilinear models , 2005, SIGGRAPH 2005.

[29] Matan Sela,et al. Learning Detailed Face Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Michael J. Black,et al. Tracking and recognizing rigid and non-rigid facial motions using local parametric models of image motion , 1995, Proceedings of IEEE International Conference on Computer Vision.

[31] Matan Sela,et al. 3D Face Reconstruction by Learning from Synthetic Data , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[32] Alex Pentland,et al. Modeling, tracking and interactive animation of faces and heads//using input from video , 1996, Proceedings Computer Animation '96.

[33] Alan Brunton,et al. Multilinear Wavelets: A Statistical Shape Space for Human Faces , 2014, ECCV.

[34] Trevor Darrell,et al. Adversarial Feature Learning , 2016, ICLR.

[35] Pertti Roivainen,et al. 3-D Motion Estimation in Model-Based Facial Image Coding , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[36] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[37] Stefanos Zafeiriou,et al. A Semi-automatic Methodology for Facial Landmark Annotation , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[38] Thabo Beeler,et al. High-quality single-shot capture of facial geometry , 2010, ACM Trans. Graph..