Anisotropic Diffusion in Consensus-Based Optimization on the Sphere

In this paper we are concerned with the global minimization of a possibly non-smooth and non-convex objective function constrained on the unit hypersphere by means of a multi-agent derivative-free method. The proposed algorithm falls into the class of the recently introduced Consensus-Based Optimization. In fact, agents move on the sphere driven by a drift towards an instantaneous consensus point, which is computed as a convex combination of agent locations, weighted by the cost function according to Laplace’s principle, and it represents an approximation to a global minimizer. The dynamics is further perturbed by an anisotropic random vector field to favor exploration. The main results of this paper are about the proof of convergence of the numerical scheme to global minimizers provided conditions of well-preparation of the initial datum. The proof of convergence combines a mean-field limit result with a novel asymptotic analysis, and classical convergence results of numerical methods for SDE. The main innovation with respect to previous work is the introduction of an anisotropic stochastic term, which allows us to ensure the independence of the parameters of the algorithm from the dimension and to scale the method to work in very high dimension. We present several numerical experiments, which show that the algorithm proposed in the present paper is extremely versatile and outperforms previous formulations with isotropic stochastic noise.

[1]  Marie-Therese Wolfram,et al.  Consensus-based global optimization with personal best. , 2020, Mathematical biosciences and engineering : MBE.

[2]  Levent Tunçel,et al.  Optimization algorithms on matrix manifolds , 2009, Math. Comput..

[3]  John L. Nazareth,et al.  Introduction to derivative-free optimization , 2010, Math. Comput..

[4]  Shi Jin,et al.  A consensus-based global optimization method for high dimensional machine learning problems , 2019 .

[5]  I. Couzin,et al.  Collective memory and spatial sorting in animal groups. , 2002, Journal of theoretical biology.

[6]  M. Fornasier,et al.  Consensus-based optimization on hypersurfaces: Well-posedness and mean-field limit , 2020, Mathematical Models and Methods in Applied Sciences.

[7]  Jos'e A. Carrillo,et al.  An analytical framework for a consensus-based global optimization method , 2016, 1602.00220.

[8]  Lei Li,et al.  Random Batch Methods (RBM) for interacting particle systems , 2018, J. Comput. Phys..

[9]  Massimo Fornasier,et al.  Stable Recovery of Entangled Weights: Towards Robust Identification of Deep Neural Networks from Minimal Samples , 2021, Applied and Computational Harmonic Analysis.

[10]  B Y Gu,et al.  Gerchberg-Saxton and Yang-Gu algorithms for phase retrieval in a nonunitary transform system: a comparison. , 1994, Applied optics.

[11]  Marco Dorigo,et al.  Ant colony optimization theory: A survey , 2005, Theor. Comput. Sci..

[12]  Gilad Lerman,et al.  A Well-Tempered Landscape for Non-convex Robust Subspace Recovery , 2017, J. Mach. Learn. Res..

[13]  Jorge Nocedal,et al.  Wedge trust region methods for derivative free optimization , 2002, Math. Program..

[14]  E. Platen An introduction to numerical methods for stochastic differential equations , 1999, Acta Numerica.

[15]  Flagot Yohannes Derivative free optimization methods , 2012 .

[16]  M. J. D. Powell,et al.  UOBYQA: unconstrained optimization by quadratic approximation , 2002, Math. Program..

[17]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[18]  D. Stroock,et al.  Simulated annealing via Sobolev inequalities , 1988 .

[19]  Xiaodong Li,et al.  Phase Retrieval via Wirtinger Flow: Theory and Algorithms , 2014, IEEE Transactions on Information Theory.

[20]  Vicsek,et al.  Novel type of phase transition in a system of self-driven particles. , 1995, Physical review letters.

[21]  Seung-Yeal Ha,et al.  A Stochastic Consensus Method for Nonconvex Optimization on the Stiefel Manifold , 2020, 2020 59th IEEE Conference on Decision and Control (CDC).

[22]  Gilad Lerman,et al.  Fast, Robust and Non-convex Subspace Recovery , 2014, 1406.6145.

[23]  Pierre Degond,et al.  Continuum limit of self-driven particles with orientation interaction , 2007, 0710.0293.

[24]  Zaikun Zhang,et al.  Sobolev seminorm of quadratic functions with applications to derivative-free optimization , 2011, Math. Program..

[25]  Pierre-Antoine Absil,et al.  A Collection of Nonsmooth Riemannian Optimization Problems , 2019, Nonsmooth Optimization and Its Applications.

[26]  Xiaojun Chen,et al.  Spherical Designs and Nonconvex Minimization for Recovery of Sparse Signals on the Sphere , 2018, SIAM J. Imaging Sci..

[27]  Yonina C. Eldar,et al.  Phase Retrieval via Matrix Completion , 2011, SIAM Rev..

[28]  R. Fetecau,et al.  Propagation of chaos for the Keller–Segel equation over bounded domains , 2018, Journal of Differential Equations.

[29]  Lorenzo Pareschi,et al.  From particle swarm optimization to consensus based optimization: stochastic modeling and mean-field limit , 2020, Mathematical Models and Methods in Applied Sciences.

[30]  Jian-Guo Liu,et al.  Dynamics in a Kinetic Model of Oriented Particles with Phase Transition , 2011, SIAM J. Math. Anal..

[31]  Doheon Kim,et al.  Convergence and error estimates for time-discrete consensus-based optimization algorithms , 2020, Numerische Mathematik.

[32]  Joel A. Tropp,et al.  Robust Computation of Linear Models by Convex Relaxation , 2012, Foundations of Computational Mathematics.

[33]  J R Fienup,et al.  Phase retrieval algorithms: a comparison. , 1982, Applied optics.

[34]  Joao M. Pereira,et al.  Subspace power method for symmetric tensor decomposition and generalized PCA , 2019, ArXiv.

[35]  R. Pinnau,et al.  A consensus-based model for global optimization and its mean-field limit , 2016, 1604.05648.

[36]  Katya Scheinberg,et al.  Computation of sparse low degree interpolating polynomials and their application to derivative-free optimization , 2012, Mathematical Programming.

[37]  Lorenzo Pareschi,et al.  Binary Interaction Algorithms for the Simulation of Flocking and Swarming Dynamics , 2012, Multiscale Model. Simul..

[38]  G. Marsaglia Choosing a Point from the Surface of a Sphere , 1972 .

[39]  A. Sznitman Topics in propagation of chaos , 1991 .

[40]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[41]  Andrew M. Stuart,et al.  Strong Convergence of Euler-Type Methods for Nonlinear Stochastic Differential Equations , 2002, SIAM J. Numer. Anal..

[42]  Shai Dekel,et al.  Super-Resolution on the Sphere Using Convex Optimization , 2014, IEEE Transactions on Signal Processing.

[43]  R. Durrett Stochastic Calculus: A Practical Introduction , 1996 .

[44]  Mervin E. Muller,et al.  A note on a method for generating points uniformly on n-dimensional spheres , 1959, CACM.

[45]  Yuxin Chen,et al.  Gradient descent with random initialization: fast global convergence for nonconvex phase retrieval , 2018, Mathematical Programming.

[46]  Xin-She Yang,et al.  A literature survey of benchmark functions for global optimisation problems , 2013, Int. J. Math. Model. Numer. Optimisation.

[47]  P. Bassanini,et al.  Elliptic Partial Differential Equations of Second Order , 1997 .

[48]  R. Gerchberg A practical algorithm for the determination of phase from image and diffraction plane pictures , 1972 .

[49]  L. Cooper,et al.  Optimal location on a sphere , 1980 .

[50]  Liqun Qi,et al.  Eigenvalues of a real supersymmetric tensor , 2005, J. Symb. Comput..

[51]  Wilma A. Bainbridge,et al.  The intrinsic memorability of face photographs. , 2013, Journal of experimental psychology. General.