Learning 3D Articulation and Deformation using 2D Images

With the rise of Augmented Reality, Virtual Reality and 3D printing, methods for acquiring 3D models from the real world are more important then ever. One approach to generate 3D models is by modifying an existing template 3D mesh to fit the pose and shape of similar objects in images. To model the pose of an highly articulated and deformable object, it is essential to understand how an object class can articulate and deform. In this paper we propose to learn a class model of articulation and deformation from a set of annotated Internet images. To do so, we incorporate the idea of local stiffness, which specifies the amount of distortion allowed for a local region. Our system jointly learns the stiffness as it deforms a template 3D mesh to fit the pose of the objects in images. We show that this seemingly complex task can be solved with a sequence of convex optimization programs. We demonstrate our approach on two highly articulated and deformable animals, cats and horses. Our approach obtains significantly more realistic deformations compared to other related approaches.

[1]  Lourdes Agapito,et al.  Balloon Shapes: Reconstructing and Deforming Objects with Volume from Images , 2013, 2013 International Conference on 3D Vision.

[2]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Lourdes Agapito,et al.  Automated articulated structure and 3D shape recovery from point correspondences , 2011, 2011 International Conference on Computer Vision.

[4]  Hang Si,et al.  TetGen, a Delaunay-Based Quality Tetrahedral Mesh Generator , 2015, ACM Trans. Math. Softw..

[5]  Kevin G. Der,et al.  Inverse kinematics for reduced deformable models , 2006, SIGGRAPH 2006.

[6]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[7]  Hans-Peter Seidel,et al.  Performance capture from sparse multi-view video , 2008, ACM Trans. Graph..

[8]  Markus H. Gross,et al.  PriMo: coupled prisms for intuitive surface modeling , 2006, SGP '06.

[9]  Marc Alexa,et al.  As-rigid-as-possible shape interpolation , 2000, SIGGRAPH.

[10]  Hans-Peter Seidel,et al.  A Statistical Model of Human Pose and Body Shape , 2009, Comput. Graph. Forum.

[11]  John C. Platt,et al.  Elastically deformable models , 1987, SIGGRAPH.

[12]  Jonathan T. Barron,et al.  Boundary Cues for 3D Object Shape Recovery , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Michael J. Black,et al.  Coregistration: Simultaneous Alignment and Modeling of Articulated 3D Shape , 2012, ECCV.

[14]  M. Pauly,et al.  Embedded deformation for shape manipulation , 2007, SIGGRAPH 2007.

[15]  Jovan Popović,et al.  Mesh-based inverse kinematics , 2005, SIGGRAPH 2005.

[16]  Alexander M. Bronstein,et al.  Numerical Geometry of Non-Rigid Shapes , 2009, Monographs in Computer Science.

[17]  Olga Sorkine-Hornung,et al.  On Linear Variational Surface Deformation Methods , 2008, IEEE Transactions on Visualization and Computer Graphics.

[18]  Adrien Bartoli,et al.  Monocular Template-based Reconstruction of Inextensible Surfaces , 2011, International Journal of Computer Vision.

[19]  Alec Jacobson,et al.  Skinning: real-time shape deformation , 2014, SIGGRAPH ASIA Courses.

[20]  Henning Biermann,et al.  Recovering non-rigid 3D shape from image streams , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[21]  Zhigang Deng,et al.  Robust and accurate skeletal rigging from mesh sequences , 2014, ACM Trans. Graph..

[22]  Antonio Torralba,et al.  Parsing IKEA Objects: Fine Pose Estimation , 2013, 2013 IEEE International Conference on Computer Vision.

[23]  Marc Alexa,et al.  As-rigid-as-possible surface modeling , 2007, Symposium on Geometry Processing.

[24]  Alla Sheffer,et al.  Material-Aware Mesh Deformations , 2006, SMI.

[25]  Ronen Basri,et al.  Large-scale bounded distortion mappings , 2015, ACM Trans. Graph..

[26]  Larry S. Davis,et al.  Structure of Applicable Surfaces from Single Views , 2004, ECCV.

[27]  Michael J. Black,et al.  Pose-conditioned joint angle limits for 3D human pose reconstruction , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Eitan Grinspun,et al.  Example-based elastic materials , 2011, ACM Trans. Graph..

[29]  Johan Löfberg,et al.  YALMIP : a toolbox for modeling and optimization in MATLAB , 2004 .

[30]  John M. Snyder,et al.  Large mesh deformation using the volumetric graph Laplacian , 2005, SIGGRAPH '05.

[31]  Knud D. Andersen,et al.  The Mosek Interior Point Optimizer for Linear Programming: An Implementation of the Homogeneous Algorithm , 2000 .

[32]  Sebastian Thrun,et al.  Recovering Articulated Object Models from 3D Range Data , 2004, UAI.

[33]  Yaron Lipman,et al.  Bounded distortion mapping spaces for triangular meshes , 2012, ACM Trans. Graph..

[34]  Andrew W. Fitzgibbon,et al.  Finding nemo: Deformable object class modelling using curve matching , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[35]  Alexei A. Efros,et al.  Seeing 3D Chairs: Exemplar Part-Based 2D-3D Alignment Using a Large Dataset of CAD Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Jitendra Malik,et al.  Virtual view networks for object reconstruction , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  B. D. Adelstein,et al.  Calculus of Nonrigid Surfaces for Geometry and Texture Manipulation , 2007 .

[38]  Jovan Popović,et al.  Deformation transfer for triangle meshes , 2004, SIGGRAPH 2004.

[39]  Ruigang Yang,et al.  Real-Time Simultaneous Pose and Shape Estimation for Articulated Objects Using a Single Depth Camera , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Alla Sheffer,et al.  Modeling from contour drawings , 2009, SBIM '09.

[41]  Hongdong Li,et al.  A simple prior-free method for non-rigid structure-from-motion factorization , 2012, CVPR.

[42]  Scott Schaefer,et al.  Example-based skeleton extraction , 2007, Symposium on Geometry Processing.

[43]  Manolis I. A. Lourakis,et al.  SBA: A software package for generic sparse bundle adjustment , 2009, TOMS.

[44]  Christian Rössl,et al.  Differential coordinates for interactive mesh editing , 2004, Proceedings Shape Modeling Applications, 2004..

[45]  Jitendra Malik,et al.  Category-specific object reconstruction from a single image , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Jitendra Malik,et al.  Viewpoints and keypoints , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Yu Chen,et al.  Inferring 3D Shapes and Deformations from Single Views , 2010, ECCV.

[48]  L B Jeffcott,et al.  Natural rigaidity of the horse's backbone. , 1980, Equine veterinary journal.

[49]  Ligang Liu,et al.  Image driven shape deformation using styles , 2009, Journal of Zhejiang University SCIENCE C.

[50]  Marc Pollefeys,et al.  A Factorization-Based Approach for Articulated Nonrigid Shape, Motion and Kinematic Chain Recovery From Video , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51]  Ligang Liu,et al.  Photo-inspired model-driven 3D object modeling , 2011, ACM Trans. Graph..

[52]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[53]  Varun Ramakrishna,et al.  User-Specific Hand Modeling from Monocular Depth Sequences , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[54]  Hans-Peter Seidel,et al.  Automatic Conversion of Mesh Animations into Skeleton‐based Animations , 2008, Comput. Graph. Forum.

[55]  Antonio Torralba,et al.  FPM: Fine Pose Parts-Based Model with 3D CAD Models , 2014, ECCV.

[56]  Daniel Pizarro-Perez,et al.  Shape-from-Template , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  Sebastian Thrun,et al.  SCAPE: shape completion and animation of people , 2005, SIGGRAPH 2005.

[58]  David Levin,et al.  The approximation power of moving least-squares , 1998, Math. Comput..

[59]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[60]  Thomas Vetter,et al.  A morphable model for the synthesis of 3D faces , 1999, SIGGRAPH.

[61]  Andrew W. Fitzgibbon,et al.  What Shape Are Dolphins? Building 3D Morphable Models from 2D Images , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[63]  Luc Van Gool,et al.  Motion Capture of Hands in Action Using Discriminative Salient Points , 2012, ECCV.

[64]  Michael J. Black,et al.  Estimating human shape and pose from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[65]  Ronen Basri,et al.  Controlling singular values with semidefinite programming , 2014, ACM Trans. Graph..

[66]  Yaser Sheikh,et al.  3D object manipulation in a single photograph using stock 3D models , 2014, ACM Trans. Graph..