On the Convergence of Gradient Descent for Finding the Riemannian Center of Mass

We study the problem of finding the global Riemannian center of mass of a set of data points on a Riemannian manifold. Specifically, we investigate the convergence of constant step-size gradient descent algorithms for solving this problem. The challenge is that often the underlying cost function is neither globally differentiable nor convex, and despite this one would like to have guaranteed convergence to the global minimizer. After some necessary preparations we state a conjecture which we argue is the best (in a sense described) convergence condition one can hope for. The conjecture specifies conditions on the spread of the data points, step-size range, and the location of the initial condition (i.e., the region of convergence) of the algorithm. These conditions depend on the topology and the curvature of the manifold and can be conveniently described in terms of the injectivity radius and the sectional curvatures of the manifold. For manifolds of constant nonnegative curvature (e.g., the sphere and the rotation group in $\mathbb{R}^{3}$) we show that the conjecture holds true (we do this by proving and using a comparison theorem which seems to be of a different nature from the standard comparison theorems in Riemannian geometry). For manifolds of arbitrary curvature we prove convergence results which are weaker than the conjectured one (but still superior over the available results). We also briefly study the effect of the configuration of the data points on the speed of convergence.

[1]  I. Holopainen Riemannian Geometry , 1927, Nature.

[2]  N E Manos,et al.  Stochastic Models , 1960, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[3]  U. Grenander Probabilities on Algebraic Structures , 1964 .

[4]  N. S. Barnett,et al.  Private communication , 1969 .

[5]  H. Karcher,et al.  How to conjugateC1-close group actions , 1973 .

[6]  S. Yau Non-existence of continuous convex functions on certain Riemannian manifolds , 1974 .

[7]  J. Cheeger,et al.  Comparison theorems in Riemannian geometry , 1975 .

[8]  H. Karcher Riemannian center of mass and mollifier smoothing , 1977 .

[9]  R. Bishop Decomposition of cut loci , 1977 .

[10]  F. Clarke Optimization And Nonsmooth Analysis , 1983 .

[11]  Naum Zuselevich Shor,et al.  Minimization Methods for Non-Differentiable Functions , 1985, Springer Series in Computational Mathematics.

[12]  W. Kendall Probability, Convexity, and Harmonic Maps with Small Image I: Uniqueness and Fine Existence , 1990 .

[13]  C. Udriste,et al.  Convex Functions and Optimization Methods on Riemannian Manifolds , 1994 .

[14]  I. Chavel Riemannian Geometry: Subject Index , 2006 .

[15]  T. K. Carne,et al.  Shape and Shape Theory , 1999 .

[16]  T. Porter SHAPE AND SHAPE THEORY (Wiley Series in Probability and Statistics) , 2000 .

[17]  H. Le,et al.  Locating Fréchet means with application to shape spaces , 2001, Advances in Applied Probability.

[18]  Samuel R. Buss,et al.  Spherical averages and applications to spherical splines and interpolation , 2001, TOGS.

[19]  K. Krakowski Geometrical Methods of Inference , 2002 .

[20]  Maher Moakher,et al.  Means and Averaging in the Group of Rotations , 2002, SIAM J. Matrix Anal. Appl..

[21]  David Groisser Newton's method, zeroes of vector fields, and the Riemannian center of mass , 2004, Adv. Appl. Math..

[22]  Jonathan H. Manton,et al.  A globally convergent numerical algorithm for computing the centre of mass on compact Lie groups , 2004, ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004..

[23]  R. Bhattacharya,et al.  Large sample theory of intrinsic and extrinsic sample means on manifolds--II , 2005, math/0507423.

[24]  D. Groisser On the convergence of some Procrustean averaging algorithms , 2005 .

[25]  N. Hitchin A panoramic view of riemannian geometry , 2006 .

[26]  Xavier Pennec,et al.  Intrinsic Statistics on Riemannian Manifolds: Basic Tools for Geometric Measurements , 2006, Journal of Mathematical Imaging and Vision.

[27]  Robert E. Mahony,et al.  Optimization Algorithms on Matrix Manifolds , 2007 .

[28]  K. Hüper,et al.  On the Computation of the Karcher Mean on Spheres and Special Orthogonal Groups , 2007 .

[29]  R. Bhattacharya,et al.  Nonparametric statistics on manifolds with applications to shape spaces , 2008, 0805.3282.

[30]  Andreas Terzis,et al.  Distributed pose averaging in camera networks via consensus on SE(3) , 2008, 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras.

[31]  Fatih Murat Porikli,et al.  Pedestrian Detection via Classification on Riemannian Manifolds , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Kaj Madsen,et al.  Introduction to Optimization and Data Fitting , 2008 .

[33]  Rama Chellappa,et al.  Statistical analysis on Stiefel and Grassmann manifolds with applications in computer vision , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  B. Afsari MEANS AND AVERAGING ON RIEMANNIAN MANIFOLDS , 2009 .

[35]  Le Yang Riemannian Median and Its Estimation , 2009, 0911.3474.

[36]  Toshihisa Tanaka,et al.  An Algorithm to Compute Averages on Matrix Lie Groups , 2009, IEEE Transactions on Signal Processing.

[37]  Alain Sarlette,et al.  Consensus Optimization on Manifolds , 2008, SIAM J. Control. Optim..

[38]  Suresh Venkatasubramanian,et al.  The geometric median on Riemannian manifolds with application to robust atlas estimation , 2009, NeuroImage.

[39]  R. Vidal,et al.  A nonparametric Riemannian framework for processing high angular resolution diffusion images (HARDI) , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Simone G. O. Fiori,et al.  Solving Minimal-Distance Problems over the Manifold of Real-Symplectic Matrices , 2011, SIAM J. Matrix Anal. Appl..

[41]  B. Afsari Riemannian Lp center of mass: existence, uniqueness, and convexity , 2011 .

[42]  M. Arnaudon,et al.  Stochastic algorithms for computing means of probability measures , 2011, 1106.5106.

[43]  B. Charlier Necessary and sufficient condition for the existence of a Fr\'echet mean on the circle , 2011, 1109.1986.

[44]  Jochen Trumpf,et al.  L1 rotation averaging using the Weiszfeld algorithm , 2011, CVPR 2011.

[45]  T. Hotz,et al.  Intrinsic means on the circle: uniqueness, locus and asymptotics , 2011, 1108.2141.

[46]  L. Miclo,et al.  Means in complete manifolds: uniqueness and approximation , 2012, 1207.3232.

[47]  René Vidal,et al.  Intrinsic consensus on SO(3) with almost-global convergence , 2012, 2012 IEEE 51st IEEE Conference on Decision and Control (CDC).

[48]  Frank Nielsen,et al.  Matrix Information Geometry , 2012 .

[49]  V. Arsigny,et al.  Exponential Barycenters of the Canonical Cartan Connection and Invariant Means on Lie Groups , 2013 .

[50]  Frank Nielsen,et al.  On approximating the Riemannian 1-center , 2011, Comput. Geom..

[51]  B. Charlier Necessary and sufficient condition for the existence of a Fréchet mean on the circle , 2013 .