论文信息 - Smoothed Functional Algorithms for Stochastic Optimization Using q-Gaussian Distributions

Smoothed Functional Algorithms for Stochastic Optimization Using q-Gaussian Distributions

Smoothed functional (SF) schemes for gradient estimation are known to be efficient in stochastic optimization algorithms, especially when the objective is to improve the performance of a stochastic system. However, the performance of these methods depends on several parameters, such as the choice of a suitable smoothing kernel. Different kernels have been studied in the literature, which include Gaussian, Cauchy, and uniform distributions, among others. This article studies a new class of kernels based on the q-Gaussian distribution, which has gained popularity in statistical physics over the last decade. Though the importance of this family of distributions is attributed to its ability to generalize the Gaussian distribution, we observe that this class encompasses almost all existing smoothing kernels. This motivates us to study SF schemes for gradient estimation using the q-Gaussian distribution. Using the derived gradient estimates, we propose two-timescale algorithms for optimization of a stochastic objective function in a constrained setting with a projected gradient search approach. We prove the convergence of our algorithms to the set of stationary points of an associated ODE. We also demonstrate their performance numerically through simulations on a queuing model.

[1] Claude E. Shannon,et al. A Mathematical Theory of Communications , 1948 .

[2] J. Kiefer,et al. Stochastic Estimation of the Maximum of a Regression Function , 1952 .

[3] I. M. Pyshik,et al. Table of integrals, series, and products , 1965 .

[4] Jan Havrda,et al. Quantification method of classification processes. Concept of structural a-entropy , 1967, Kybernetika.

[5] P. Schweitzer. Perturbation theory and finite Markov chains , 1968 .

[6] Zoltán Daróczy,et al. Generalized Information Functions , 1970, Inf. Control..

[7] Harold J. Kushner,et al. wchastic. approximation methods for constrained and unconstrained systems , 1978 .

[8] V. Nollau. Kushner, H. J./Clark, D. S., Stochastic Approximation Methods for Constrained and Unconstrained Systems. (Applied Mathematical Sciences 26). Berlin‐Heidelberg‐New York, Springer‐Verlag 1978. X, 261 S., 4 Abb., DM 26,40. US $ 13.20 , 1980 .

[9] Reuven Y. Rubinstein,et al. Simulation and the Monte Carlo method , 1981, Wiley series in probability and mathematical statistics.

[10] Reuven Y. Rubinstein,et al. Simulation and the Monte Carlo Method , 1981 .

[11] D. Ruppert. A Newton-Raphson Version of the Multivariate Robbins-Monro Procedure , 1985 .

[12] Rajan Suri,et al. Infinitesimal perturbation analysis for general discrete event systems , 1987, JACM.

[13] R. Rubinstein,et al. Smoothed functionals and constrained stochastic approximation , 1988 .

[14] C. Tsallis. Possible generalization of Boltzmann-Gibbs statistics , 1988 .

[15] G. Rappl. On Linear Convergence of a Class of Random Search Algorithms , 1989 .

[16] Morris W. Hirsch,et al. Convergent activation dynamics in continuous time networks , 1989, Neural Networks.

[17] M. A. Styblinski,et al. Experiments in nonconvex optimization: Stochastic approximation with function smoothing and simulated annealing , 1990, Neural Networks.

[18] David Williams,et al. Probability with Martingales , 1991, Cambridge mathematical textbooks.

[19] H. Kushner,et al. Estimation of the derivative of a stationary measure with respect to a control parameter , 1992 .

[20] Reuven Y. Rubinstein,et al. Nondifferentiable optimization via smooth approximation: General analytical approach , 1992, Ann. Oper. Res..

[21] J. Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[22] P. Glynn,et al. Stochastic Optimization by Simulation: Convergence Proofs for the GI/G/1 Queue in Steady-State , 1994 .

[23] C. Tsallis. Some comments on Boltzmann-Gibbs statistical mechanics , 1995 .

[24] C. Tsallis,et al. The role of constraints within generalized nonextensive statistics , 1998 .

[25] Odile Brandière,et al. Some Pathological Traps for Stochastic Approximation , 1998 .

[26] S. Bhatnagar,et al. A two timescale stochastic approximation scheme for simulation-based parametric optimization , 1998 .

[27] C. Tsallis,et al. Nonextensive foundation of Lévy distributions. , 1999, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[28] Albert,et al. Emergence of scaling in random networks , 1999, Science.

[29] James C. Spall,et al. Adaptive stochastic approximation by the simultaneous perturbation method , 2000, IEEE Trans. Autom. Control..

[30] Michael C. Fu,et al. Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences , 2003, TOMC.

[31] Jose A. Costa,et al. On Solutions to Multivariate Maximum α-Entropy Problems , 2003 .

[32] Vivek S. Borkar,et al. Multiscale Chaotic SPSA and Smoothed Functional Algorithms for Simulation Optimization , 2003, Simul..

[33] S. Abe,et al. Itineration of the Internet over nonequilibrium stationary states in Tsallis statistics. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34] Hiroki Suyari. Generalization of Shannon-Khinchin axioms to nonextensive systems and the uniqueness theorem for the nonextensive entropy , 2004, IEEE Transactions on Information Theory.

[35] Funabashi,et al. Scale-free statistics of time interval between successive earthquakes , 2004, cond-mat/0410123.

[36] Hiroki Suyari,et al. Law of error in Tsallis statistics , 2005, IEEE Transactions on Information Theory.

[37] Michael C. Fu,et al. Chapter 19 Gradient Estimation , 2006, Simulation.

[38] A. Plastino,et al. Poincaré's observation and the origin of Tsallis generalized canonical distributions , 2005, cond-mat/0509689.

[39] Geir Storvik,et al. Simulation and Monte Carlo Methods , 2006 .

[40] Kenric P. Nelson,et al. Generalized Box–MÜller Method for Generating $q$-Gaussian Random Deviates , 2006, IEEE Transactions on Information Theory.

[41] H. Robbins. A Stochastic Approximation Method , 1951 .

[42] M. N. Murty,et al. On measure-theoretic aspects of nonextensive entropy functionals and corresponding maximum entropy prescriptions , 2007 .

[43] Shalabh Bhatnagar,et al. Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization , 2007, TOMC.

[44] C. Tsallis,et al. Multivariate Generalizations of the q--Central Limit Theorem , 2007, cond-mat/0703533.

[45] A. Plastino,et al. Central limit theorem and deformed exponentials , 2007 .

[46] V. Borkar. Stochastic Approximation: A Dynamical Systems Viewpoint , 2008 .

[47] A. Sato. q-Gaussian distributions and multiplicative stochastic processes for analysis of multiple financial time series , 2010 .

[48] Dirk P. Kroese,et al. Handbook of Monte Carlo Methods , 2011 .

[49] Evgueni A. Haroutunian,et al. Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[50] Shalabh Bhatnagar,et al. q-Gaussian based Smoothed Functional algorithms for stochastic optimization , 2012, 2012 IEEE International Symposium on Information Theory Proceedings.

[51] G. Crooks. On Measures of Entropy and Information , 2015 .

[52] Kerstin Vogler,et al. Table Of Integrals Series And Products , 2016 .