论文信息 - Hierarchical Data-Driven Descent for Efficient Optimal Deformation Estimation

Hierarchical Data-Driven Descent for Efficient Optimal Deformation Estimation

Real-world surfaces such as clothing, water and human body deform in complex ways. The image distortions observed are high-dimensional and non-linear, making it hard to estimate these deformations accurately. The recent data-driven descent approach applies Nearest Neighbor estimators iteratively on a particular distribution of training samples to obtain a globally optimal and dense deformation field between a template and a distorted image. In this work, we develop a hierarchical structure for the Nearest Neighbor estimators, each of which can have only a local image support. We demonstrate in both theory and practice that this algorithm has several advantages over the non-hierarchical version: it guarantees global optimality with significantly fewer training samples, is several orders faster, provides a metric to decide whether a given image is ``hard'' (or ``easy'') requiring more (or less) samples, and can handle more complex scenes that include both global motion and local deformation. The proposed algorithm successfully tracks a broad range of non-rigid scenes including water, clothing, and medical images, and compares favorably against several other deformation estimation and tracking approaches that do not provide optimality guarantees.

Yuandong Tian | Srinivasa G. Narasimhan | Yuandong Tian | S. Narasimhan

[1] Fred L. Bookstein,et al. Principal Warps: Thin-Plate Splines and the Decomposition of Deformations , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[2] Carlo Tomasi,et al. Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[3] Daniel Rueckert,et al. Nonrigid registration using free-form deformations: application to breast MR images , 1999, IEEE Transactions on Medical Imaging.

[4] Simon Baker,et al. Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[5] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6] Simon Baker,et al. Active Appearance Models Revisited , 2004, International Journal of Computer Vision.

[7] Thomas Serre,et al. Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9] Pascal Fua,et al. Convex Optimization for Deformable Surface 3-D Tracking , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10] Vincent Lepetit,et al. Closed-Form Solution to Non-rigid 3D Surface Registration , 2008, ECCV.

[11] Yan Zhou,et al. Collaborative Tracking for MRI-Guided Robotic Intervention on the Beating Heart , 2010, MICCAI.

[12] Kiriakos N. Kutulakos,et al. Non-rigid structure from locally-rigid motion , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13] Andrew W. Fitzgibbon,et al. Real-time human pose recognition in parts from single depth images , 2011, CVPR 2011.

[14] Yuandong Tian,et al. Globally Optimal Estimation of Nonrigid Image Distortion , 2012, International Journal of Computer Vision.

[15] Yan Zhou,et al. Shape Prior Modeling Using Sparse Representation and Online Dictionary Learning , 2012, MICCAI.

[16] Jian Sun,et al. Face Alignment by Explicit Shape Regression , 2012, International Journal of Computer Vision.

[17] Yuandong Tian,et al. Detailed Derivation of Theory of Hierarchical Data-driven Descent , 2013 .