Constraining dense hand surface tracking with elasticity

Many of the actions that we take with our hands involve self-contact and occlusion: shaking hands, making a fist, or interlacing our fingers while thinking. This use of of our hands illustrates the importance of tracking hands through self-contact and occlusion for many applications in computer vision and graphics, but existing methods for tracking hands and faces are not designed to treat the extreme amounts of self-contact and self-occlusion exhibited by common hand gestures. By extending recent advances in vision-based tracking and physically based animation, we present the first algorithm capable of tracking high-fidelity hand deformations through highly self-contacting and self-occluding hand gestures, for both single hands and two hands. By constraining a vision-based tracking algorithm with a physically based deformable model, we obtain an algorithm that is robust to the ubiquitous self-interactions and massive self-occlusions exhibited by common hand gestures, allowing us to track two hand interactions and some of the most difficult possible configurations of a human hand.

[1]  Andrew Jones,et al.  Driving High-Resolution Facial Scans with Video Performance Capture , 2014, ACM Trans. Graph..

[2]  Varun Ramakrishna,et al.  Convolutional Pose Machines , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Nicolas Stoiber,et al.  BlendForces: A Dynamic Framework for Facial Animation , 2016, Comput. Graph. Forum.

[4]  David Kim,et al.  Articulated distance fields for ultra-fast tracking of hands interacting , 2017, ACM Trans. Graph..

[5]  Ken-ichi Anjyo,et al.  Practice and Theory of Blendshape Facial Models , 2014, Eurographics.

[6]  Andrew Jones,et al.  Multi‐View Stereo on Consistent Face Topology , 2017, Comput. Graph. Forum.

[7]  Ingo Wald,et al.  Embree: a kernel framework for efficient CPU ray tracing , 2014, ACM Trans. Graph..

[8]  M. Otaduy,et al.  Capture and modeling of non-linear heterogeneous soft tissue , 2009, ACM Trans. Graph..

[9]  Stefan Jeschke,et al.  Non-smooth Newton Methods for Deformable Multi-body Dynamics , 2019, ACM Trans. Graph..

[10]  Olga Sorkine-Hornung,et al.  Deformation Capture via Soft and Stretchable Sensor Arrays , 2018, ACM Trans. Graph..

[11]  KohliPushmeet,et al.  Efficient and precise interactive hand tracking through joint, continuous optimization of pose and correspondences , 2016 .

[12]  Yin Yang,et al.  Descent methods for elastic body simulation on the GPU , 2016, ACM Trans. Graph..

[13]  Henry Fuchs,et al.  An implicit finite element method for elastic solids in contact , 2001, Proceedings Computer Animation 2001. Fourteenth Conference on Computer Animation (Cat. No.01TH8596).

[14]  Demetri Terzopoulos,et al.  Constraints on Deformable Models: Recovering 3D Shape and Nonrigid Motion , 1988, Artif. Intell..

[15]  Marc Alexa,et al.  As-rigid-as-possible surface modeling , 2007, Symposium on Geometry Processing.

[16]  Antonis A. Argyros,et al.  Full DOF tracking of a hand interacting with an object by modeling occlusions and physical constraints , 2011, 2011 International Conference on Computer Vision.

[17]  Antonis A. Argyros,et al.  Efficient model-based 3D tracking of hand articulations using Kinect , 2011, BMVC.

[18]  Andrew W. Fitzgibbon,et al.  Fits Like a Glove: Rapid and Reliable Hand Shape Personalization , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Daniele Panozzo,et al.  Tetrahedral meshing in the wild , 2018, ACM Trans. Graph..

[20]  Georgios D. Evangelidis,et al.  Parametric Image Alignment Using Enhanced Correlation Coefficient Maximization , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jianfei Cai,et al.  3D Hand Shape and Pose Estimation from a Single RGB Image (Supplementary Material) , 2019 .

[22]  Theodore Kim,et al.  Stable Neo-Hookean Flesh Simulation , 2018, ACM Trans. Graph..

[23]  Demetri Terzopoulos,et al.  Symmetry-seeking models and 3D object reconstruction , 1988, International Journal of Computer Vision.

[24]  Daniel Cremers,et al.  Correspondence-Free Material Reconstruction using Sparse Surface Constraints , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Robert Bridson,et al.  Blended cured quasi-newton for distortion optimization , 2018, ACM Trans. Graph..

[26]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[27]  Shuangshuang Jin,et al.  A comparison of algorithms for vertex normal computation , 2005, The Visual Computer.

[28]  Andrew W. Fitzgibbon,et al.  Real-time non-rigid reconstruction using an RGB-D camera , 2014, ACM Trans. Graph..

[29]  Marc Pollefeys,et al.  Capturing Hands in Action Using Discriminative Salient Points and Physics Simulation , 2015, International Journal of Computer Vision.

[30]  Demetri Terzopoulos,et al.  Snakes: Active contour models , 2004, International Journal of Computer Vision.

[31]  Ronald Fedkiw,et al.  Invertible finite elements for robust simulation of large deformation , 2004, SCA '04.

[32]  Jérémie Allard,et al.  Image-based collision detection and response between arbitrary volume objects , 2008, SCA '08.

[33]  Jernej Barbic,et al.  FEM simulation of 3D deformable solids: a practitioner's guide to theory, discretization and model reduction , 2012, SIGGRAPH '12.

[34]  Kun Zhou,et al.  An asymptotic numerical method for inverse elastic shape design , 2014, ACM Trans. Graph..

[35]  Bin Wang,et al.  Deformation capture and modeling of soft objects , 2015, ACM Trans. Graph..

[36]  Yaser Sheikh,et al.  Deep incremental learning for efficient high-fidelity face tracking , 2018, ACM Trans. Graph..

[37]  Hang Si,et al.  TetGen, a Delaunay-Based Quality Tetrahedral Mesh Generator , 2015, ACM Trans. Math. Softw..

[38]  Ronald Fedkiw,et al.  Fully automatic generation of anatomical face simulation models , 2015, Symposium on Computer Animation.

[39]  Konrad Schindler,et al.  Massively Parallel Multiview Stereopsis by Surface Normal Diffusion , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[40]  Olga Sorkine-Hornung,et al.  Deformation Capture via Self-Sensing Capacitive Arrays (Video) , 2018, 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[41]  Andrew P. Witkin,et al.  Untangling cloth , 2003, ACM Trans. Graph..

[42]  Mariette Yvinec,et al.  Variational tetrahedral meshing , 2005, ACM Trans. Graph..

[43]  Dimitris N. Metaxas,et al.  Shape and Nonrigid Motion Estimation Through Physics-Based Synthesis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[44]  Antti Oulasvirta,et al.  Interactive Markerless Articulated Hand Motion Tracking Using RGB and Depth Data , 2013, 2013 IEEE International Conference on Computer Vision.

[45]  Derek Bradley,et al.  An anatomically-constrained local deformation model for monocular face capture , 2016, ACM Trans. Graph..

[46]  Scott Schaefer,et al.  Bijective parameterization with free boundaries , 2015, ACM Trans. Graph..

[47]  Derek Bradley,et al.  High-quality passive facial performance capture using anchor frames , 2011, ACM Trans. Graph..

[48]  Parimal Aswani,et al.  Tracking of deformable objects , 2000, IS&T/SPIE Electronic Imaging.

[49]  Cordelia Schmid,et al.  Learning Joint Reconstruction of Hands and Manipulated Objects , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Maud Marchal,et al.  Simultaneous Tracking and Elasticity Parameter Estimation of Deformable Objects , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[51]  Andrew W. Fitzgibbon,et al.  Efficient and precise interactive hand tracking through joint, continuous optimization of pose and correspondences , 2016, ACM Trans. Graph..

[52]  Jaroslav Krivánek,et al.  Reconstructing personalized anatomical models for physics-based body animation , 2016, ACM Trans. Graph..

[53]  Ronald Fedkiw,et al.  A Crystalline, Red Green Strategy for Meshing Highly Deformable Objects with Tetrahedra , 2003, IMR.

[54]  Sergio Escalera,et al.  Depth-Based 3D Hand Pose Estimation: From Current Achievements to Future Goals , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[55]  Jochen Lang,et al.  Finite element based tracking of deforming surfaces , 2013, Graph. Model..

[56]  Hujun Bao,et al.  Subspace dynamic simulation using rotation-strain coordinates , 2015, ACM Trans. Graph..

[57]  Yaser Sheikh,et al.  Hand Keypoint Detection in Single Images Using Multiview Bootstrapping , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Ronald Fedkiw,et al.  Volume conserving finite element simulations of deformable models , 2007, ACM Trans. Graph..

[59]  Ronald Fedkiw,et al.  Robust quasistatic finite elements and flesh simulation , 2005, SCA '05.

[60]  Paul G. Kry,et al.  Embedded thin shells for wrinkle simulation , 2013, ACM Trans. Graph..

[61]  Dinesh K. Pai,et al.  The human touch , 2018, ACM Trans. Graph..

[62]  Ladislav Kavan,et al.  Building Accurate Physics-based Face Models from Data , 2019, PACMCGIT.

[63]  Pieter Abbeel,et al.  Tracking deformable objects with point clouds , 2013, 2013 IEEE International Conference on Robotics and Automation.

[64]  David Harmon,et al.  Asynchronous contact mechanics , 2009, SIGGRAPH 2009.

[65]  Leonard McMillan,et al.  Stable real-time deformations , 2002, SCA '02.

[66]  Andrea Tagliasacchi,et al.  Sphere-meshes for real-time hand modeling and tracking , 2016, ACM Trans. Graph..

[67]  Richard Szeliski,et al.  Physically based and probabilistic models for computer vision , 1991, Optics & Photonics.

[68]  Hans-Peter Seidel,et al.  Performance capture from sparse multi-view video , 2008, ACM Trans. Graph..

[69]  Theodore Kim,et al.  Analytic Eigensystems for Isotropic Distortion Energies , 2019, ACM Trans. Graph..

[70]  Jernej Barbic,et al.  Hand modeling and simulation using stabilized magnetic resonance imaging , 2019, ACM Trans. Graph..

[71]  Miguel A. Otaduy,et al.  Real-time pose and shape reconstruction of two interacting hands with a single depth camera , 2019, ACM Trans. Graph..

[72]  Andrew W. Fitzgibbon,et al.  Online generative model personalization for hand tracking , 2017, ACM Trans. Graph..

[73]  James C. Bezdek,et al.  Convergence of Alternating Optimization , 2003, Neural Parallel Sci. Comput..

[74]  Tae-Kyun Kim,et al.  Pushing the Envelope for RGB-Based Dense 3D Hand Pose Estimation via Neural Rendering , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[75]  Fernando De la Torre,et al.  Interactive region-based linear 3D face models , 2011, ACM Trans. Graph..

[76]  Markus H. Gross,et al.  Consistent penetration depth estimation for deformable collision response , 2004, VMV.

[77]  Eftychios Sifakis,et al.  Efficient elasticity for character skinning with contact and collisions , 2011, ACM Trans. Graph..

[78]  Olga Sorkine-Hornung,et al.  Interactive hand pose estimation using a stretch-sensing soft glove , 2019, ACM Trans. Graph..