Dynamic 3D avatar creation from hand-held video input

We present a complete pipeline for creating fully rigged, personalized 3D facial avatars from hand-held video. Our system faithfully recovers facial expression dynamics of the user by adapting a blendshape template to an image sequence of recorded expressions using an optimization that integrates feature tracking, optical flow, and shape from shading. Fine-scale details such as wrinkles are captured separately in normal maps and ambient occlusion maps. From this user- and expression-specific data, we learn a regressor for on-the-fly detail synthesis during animation to enhance the perceptual realism of the avatars. Our system demonstrates that the use of appropriate reconstruction priors yields compelling face rigs even with a minimalistic acquisition system and limited user assistance. This facilitates a range of new applications in computer animation and consumer-level online communication based on personalized avatars. We present realtime application demos to validate our method.

[1]  Ken-ichi Anjyo,et al.  Practice and Theory of Blendshape Facial Models , 2014, Eurographics.

[2]  Simon Lucey,et al.  Face alignment through subspace constrained mean-shifts , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Mark Pauly,et al.  Realtime performance-based facial animation , 2011, ACM Trans. Graph..

[4]  Ronen Basri,et al.  Accuracy of Spherical Harmonic Approximations for Images of Lambertian Objects under Far and Near Lighting , 2004, ECCV.

[5]  Pierre Alliez,et al.  Polygon Mesh Processing , 2010 .

[6]  Kun Zhou,et al.  3D shape regression for real-time facial animation , 2013, ACM Trans. Graph..

[7]  Ira Kemelmacher-Shlizerman,et al.  Face Reconstruction from a Single Image using a Single Reference Face Shape , 2009 .

[8]  Markus H. Gross,et al.  Pose-space animation and transfer of facial details , 2008, SCA '08.

[9]  Horst Bischof,et al.  A Duality Based Approach for Realtime TV-L1 Optical Flow , 2007, DAGM-Symposium.

[10]  Xin Tong,et al.  Automatic acquisition of high-fidelity facial performances using monocular videos , 2014, ACM Trans. Graph..

[11]  Jian Sun,et al.  Face Alignment by Explicit Shape Regression , 2012, International Journal of Computer Vision.

[12]  Yiying Tong,et al.  FaceWarehouse: A 3D Facial Expression Database for Visual Computing , 2014, IEEE Transactions on Visualization and Computer Graphics.

[13]  Leonidas J. Guibas,et al.  Robust single-view geometry and motion reconstruction , 2009, ACM Trans. Graph..

[14]  Jun Li,et al.  Lightweight wrinkle synthesis for 3D facial modeling and animation , 2015, Comput. Aided Des..

[15]  Thabo Beeler,et al.  Facial performance enhancement using dynamic shape space analysis , 2014, TOGS.

[16]  Robert M. Gray,et al.  Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[17]  Robert M. Gray,et al.  Toeplitz And Circulant Matrices: A Review (Foundations and Trends(R) in Communications and Information Theory) , 2006 .

[18]  Nadia Magnenat-Thalmann,et al.  Simulation of static and dynamic wrinkles of skin , 1996, Proceedings Computer Animation '96.

[19]  Hao Li,et al.  Realtime performance-based facial animation , 2011, ACM Trans. Graph..

[20]  Xin Tong,et al.  Leveraging motion capture and 3D scanning for high-fidelity facial performance acquisition , 2011, ACM Trans. Graph..

[21]  Michael Todd Bunnell,et al.  Dynamic Ambient Occlusion and Indirect Lighting , 2005 .

[22]  Wotao Yin,et al.  Iteratively reweighted algorithms for compressive sensing , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[23]  Paul Debevec,et al.  Creating a Photoreal Digital Actor: The Digital Emily Project , 2009 .

[24]  Paul A. Beardsley,et al.  Coupled 3D reconstruction of sparse facial hair and skin , 2012, ACM Trans. Graph..

[25]  Wolfgang Engel GPU Pro 2 , 2016 .

[26]  Jovan Popovic,et al.  Deformation transfer for triangle meshes , 2004, ACM Trans. Graph..

[27]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[28]  Christopher Oat Animated wrinkle maps , 2007, SIGGRAPH '07.

[29]  Andrew Jones,et al.  Digital Ira: creating a real-time photoreal digital actor , 2013, SIGGRAPH '13.

[30]  Kun Zhou,et al.  Displaced dynamic expression regression for real-time facial tracking and animation , 2014, ACM Trans. Graph..

[31]  Yangang Wang,et al.  Online modeling for realtime facial animation , 2013, ACM Trans. Graph..

[32]  Derek Bradley,et al.  High-quality capture of eyes , 2014, ACM Trans. Graph..

[33]  Raghu Raghavan,et al.  A kinematic-variational model for animating skin with wrinkles , 2005, Comput. Graph..

[34]  SunJian,et al.  Face Alignment by Explicit Shape Regression , 2014 .

[35]  Diego Gutierrez,et al.  Practical and Realistic Facial Wrinkles Animation , 2010 .

[36]  Wenjiang J. Fu Penalized Regressions: The Bridge versus the Lasso , 1998 .

[37]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  A. Chambolle,et al.  An introduction to Total Variation for Image Analysis , 2009 .

[39]  Andrew W. Fitzgibbon,et al.  Reconstructing High Quality Face-Surfaces using Model Based Stereo , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[40]  Luc Van Gool,et al.  Face/Off: live facial puppetry , 2009, SCA '09.

[41]  John Hart,et al.  ACM Transactions on Graphics , 2004, SIGGRAPH 2004.

[42]  Shahram Izadi,et al.  Real-time shading-based refinement for consumer depth cameras , 2014, ACM Trans. Graph..

[43]  Jihun Yu,et al.  Realtime facial animation with on-the-fly correctives , 2013, ACM Trans. Graph..

[44]  Andrea Tagliasacchi,et al.  Dynamic 2D/3D Registration , 2014, Eurographics.

[45]  Thabo Beeler,et al.  High-quality single-shot capture of facial geometry , 2010, ACM Trans. Graph..

[46]  Rafael C. González,et al.  Digital image processing, 3rd Edition , 2008 .

[47]  PaulyMark,et al.  Dynamic 3D avatar creation from hand-held video input , 2015 .

[48]  Christian Theobalt,et al.  Reconstructing detailed dynamic face geometry from monocular video , 2013, ACM Trans. Graph..

[49]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[50]  Simon Lucey,et al.  Deformable Model Fitting by Regularized Landmark Mean-Shift , 2010, International Journal of Computer Vision.

[51]  Jovan Popović,et al.  Deformation transfer for triangle meshes , 2004, SIGGRAPH 2004.

[52]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.

[53]  Leonidas J. Guibas,et al.  Robust single-view geometry and motion reconstruction , 2009, SIGGRAPH 2009.

[54]  Chongyang Ma,et al.  Robust hair capture using simulated examples , 2014, ACM Trans. Graph..

[55]  Derek Bradley,et al.  High-quality passive facial performance capture using anchor frames , 2011, ACM Trans. Graph..

[56]  Mark Pauly,et al.  Dynamic 2D/3D registration for the Kinect , 2013, SIGGRAPH '13.

[57]  Kun Zhou,et al.  A reduced model for interactive hairs , 2014, ACM Trans. Graph..

[58]  Pieter Peers,et al.  Facial performance synthesis using deformation-driven polynomial displacement maps , 2008, SIGGRAPH Asia '08.

[59]  P. Debevec,et al.  Creating a Photoreal Digital Actor: The Digital Emily Project , 2009, 2009 Conference for Visual Media Production.

[60]  Paul E. Debevec,et al.  Multiview face capture using polarized spherical gradient illumination , 2011, ACM Trans. Graph..

[61]  Hanspeter Pfister,et al.  Face transfer with multilinear models , 2005, ACM Trans. Graph..

[62]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[63]  Thomas Vetter,et al.  A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[64]  Hans-Peter Seidel,et al.  Lightweight binocular facial performance capture under uncontrolled lighting , 2012, ACM Trans. Graph..

[65]  Li Zhang,et al.  Spacetime faces: high resolution capture for modeling and animation , 2004, SIGGRAPH 2004.

[66]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.