Accelerated Optimization in the PDE Framework: Formulations for the Manifold of Diffeomorphisms

We consider the problem of optimization of cost functionals on the infinite-dimensional manifold of diffeomorphisms. We present a new class of optimization methods, valid for any optimization problem setup on the space of diffeomorphisms by generalizing Nesterov accelerated optimization to the manifold of diffeomorphisms. While our framework is general for infinite dimensional manifolds, we specifically treat the case of diffeomorphisms, motivated by optical flow problems in computer vision. This is accomplished by building on a recent variational approach to a general class of accelerated optimization methods by Wibisono, Wilson and Jordan, which applies in finite dimensions. We generalize that approach to infinite dimensional manifolds. We derive the surprisingly simple continuum evolution equations, which are partial differential equations, for accelerated gradient descent, and relate it to simple mechanical principles from fluid mechanics. Our approach has natural connections to the optimal mass transport problem. This is because one can think of our approach as an evolution of an infinite number of particles endowed with mass (represented with a mass density) that moves in an energy landscape. The mass evolves with the optimization variable, and endows the particles with dynamics. This is different than the finite dimensional case where only a single particle moves and hence the dynamics does not depend on the mass. We derive the theory, compute the PDEs for accelerated optimization, and illustrate the behavior of these new accelerated optimization schemes.

[1]  James T. Kwok,et al.  Accelerated Gradient Methods for Stochastic Optimization and Online Learning , 2009, NIPS.

[2]  V. Arnold Mathematical Methods of Classical Mechanics , 1974 .

[3]  Stephen Gould,et al.  Accelerated dual decomposition for MAP inference , 2010, ICML.

[4]  Jean-Philippe Pons,et al.  Generalized Gradients: Priors on Minimization Flows , 2007, International Journal of Computer Vision.

[5]  Hong Cheng,et al.  Accelerated First-order Methods for Geodesically Convex Optimization on Riemannian Manifolds , 2017, NIPS.

[6]  Jerrold E. Marsden,et al.  Introduction to Mechanics and Symmetry: A Basic Exposition of Classical Mechanical Systems , 1999 .

[7]  Y. Nesterov A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .

[8]  E. Rouy,et al.  A viscosity solutions approach to shape-from-shading , 1992 .

[9]  Michael J. Black,et al.  The Robust Estimation of Multiple Motions: Parametric and Piecewise-Smooth Flow Fields , 1996, Comput. Vis. Image Underst..

[10]  Alain Trouvé,et al.  Geodesic Shooting for Computational Anatomy , 2006, Journal of Mathematical Imaging and Vision.

[11]  Andre Wibisono,et al.  A variational perspective on accelerated methods in optimization , 2016, Proceedings of the National Academy of Sciences.

[12]  Ganesh Sundaramoorthi,et al.  Modeling Self-Occlusions in Dynamic Shape and Appearance Tracking , 2013, 2013 IEEE International Conference on Computer Vision.

[13]  Alex M. Andrew,et al.  Level Set Methods and Fast Marching Methods: Evolving Interfaces in Computational Geometry, Fluid Mechanics, Computer Vision, and Materials Science (2nd edition) , 2000 .

[14]  Alexandre M. Bayen,et al.  Accelerated Mirror Descent in Continuous and Discrete Time , 2015, NIPS.

[15]  Stephen P. Boyd,et al.  A Differential Equation for Modeling Nesterov's Accelerated Gradient Method: Theory and Insights , 2014, J. Mach. Learn. Res..

[16]  Martin Bauer,et al.  Overview of the Geometries of Shape Spaces and Diffeomorphism Groups , 2013, Journal of Mathematical Imaging and Vision.

[17]  Mohit Singh,et al.  A geometric alternative to Nesterov's accelerated gradient descent , 2015, ArXiv.

[18]  Stefano Soatto,et al.  A New Geometric Metric in the Space of Curves, and Applications to Tracking Deforming Objects by Prediction and Filtering , 2011, SIAM J. Imaging Sci..

[19]  Ganesh Sundaramoorthi,et al.  Shape Tracking with Occlusions via Coarse-to-Fine Region-Based Sobolev Descent , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  J. Marsden,et al.  Groups of diffeomorphisms and the motion of an incompressible fluid , 1970 .

[21]  Olivier D. Faugeras,et al.  Designing spatially coherent minimizing flows for variational problems based on active contours , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[22]  Yann Brenier,et al.  A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem , 2000, Numerische Mathematik.

[23]  Yurii Nesterov,et al.  Gradient methods for minimizing composite functions , 2012, Mathematical Programming.

[24]  Alain Trouvé,et al.  Computing Large Deformation Metric Mappings via Geodesic Flows of Diffeomorphisms , 2005, International Journal of Computer Vision.

[25]  Saeed Ghadimi,et al.  Accelerated gradient methods for nonconvex nonlinear and stochastic programming , 2013, Mathematical Programming.

[26]  Yurii Nesterov,et al.  Smooth minimization of non-smooth functions , 2005, Math. Program..

[27]  Anuj Srivastava,et al.  Analysis of planar shapes using geodesic paths on shape spaces , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[29]  Suvrit Sra,et al.  Fast stochastic optimization on Riemannian manifolds , 2016, ArXiv.

[30]  Suvrit Sra,et al.  An alternative to EM for Gaussian mixture models: batch and stochastic Riemannian optimization , 2017, Mathematical Programming.

[31]  Guillermo Sapiro,et al.  New Possibilities with Sobolev Active Contours , 2007, International Journal of Computer Vision.

[32]  Daniel Cremers,et al.  An Improved Algorithm for TV-L 1 Optical Flow , 2009, Statistical and Geometrical Approaches to Visual Motion Analysis.

[33]  D. Mumford,et al.  Sobolev metrics on diffeomorphism groups and the derived geometry of spaces of submanifolds , 2013 .

[34]  Francis R. Bach,et al.  From Averaging to Acceleration, There is Only a Step-size , 2015, COLT.

[35]  Jieping Ye,et al.  An accelerated gradient method for trace norm minimization , 2009, ICML '09.

[36]  Huan Li,et al.  Accelerated Proximal Gradient Methods for Nonconvex Programming , 2015, NIPS.

[37]  Anthony J. Yezzi,et al.  Accelerated Optimization in the PDE Framework: Formulations for the Active Contour Case , 2017, ArXiv.

[38]  Yurii Nesterov,et al.  Cubic regularization of Newton method and its global performance , 2006, Math. Program..

[39]  S. Zagatti On viscosity solutions of Hamilton-Jacobi equations , 2008 .

[40]  Steven Haker,et al.  Minimizing Flows for the Monge-Kantorovich Problem , 2003, SIAM J. Math. Anal..

[41]  Anthony J. Yezzi,et al.  Sobolev Active Contours , 2005, International Journal of Computer Vision.

[42]  Stefano Soatto,et al.  Self-Occlusions and Disocclusions in Causal Video Object Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[43]  G. Sundaramoorthi,et al.  Properties of Sobolev-type metrics in the space of curves , 2006, math/0605017.

[44]  W. Gangbo,et al.  The geometry of optimal transportation , 1996 .

[45]  D. Mumford,et al.  A Metric on Shape Space with Explicit Geodesics , 2007, 0706.4299.

[46]  Michael J. Black,et al.  Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[47]  Anthony J. Yezzi,et al.  Coarse-to-Fine Segmentation and Tracking Using Sobolev Active Contours , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  C. Villani Topics in Optimal Transportation , 2003 .

[49]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[50]  Yurii Nesterov,et al.  Accelerating the cubic regularization of Newton’s method on convex problems , 2005, Math. Program..

[51]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.