On sampling controlled stochastic approximation

The authors examine a novel class of stochastic approximation procedures which are based on carefully controlling the number of observations or measurements taken before each computational iteration. This method, referred to as sampling controlled stochastic approximation, has advantages over standard stochastic approximation such as requiring less computation and the ability to handle bias in estimation. The authors address the growth rate required of the number of samples and prove a general convergence theorem for the proposed stochastic approximation method. In addition, they present applications to optimize and also derive a sampling controlled version of the classic Robbins-Munro algorithm. >

[1]  H. D. Block Estimates of Error for Two Modifications of the Robbins-Monro Stochastic Approximation Process , 1957 .

[2]  Kumpati S. Narendra,et al.  Adaptive, learning, and pattern recognition systems: Theory and applications , 1972 .

[3]  T. Stern,et al.  A Class of Decentralized Routing Algorithms Using Relaxation , 1977, IEEE Trans. Commun..

[4]  Robert G. Gallager,et al.  A Minimum Delay Routing Algorithm Using Distributed Computation , 1977, IEEE Trans. Commun..

[5]  Hisashi Kobayashi,et al.  Modeling and analysis , 1978 .

[6]  A. Cohen An Introduction to Probability Theory and Mathematical Statistics , 1979 .

[7]  A. P. Korostelev,et al.  Convergence of recursive stochastic algorithms under Gaussian disturbances , 1979 .

[8]  G. Pflug Stochastic Approximation Methods for Constrained and Unconstrained Systems - Kushner, HJ.; Clark, D.S. , 1980 .

[9]  S. Lakshmivarahan,et al.  Learning Algorithms Theory and Applications , 1981 .

[10]  Xi-Ren Cao,et al.  Perturbation analysis and optimization of queueing networks , 1983 .

[11]  Dimitri P. Bertsekas,et al.  Second Derivative Algorithms for Minimum Delay Distributed Routing in Networks , 1984, IEEE Trans. Commun..

[12]  D. Bertsekas,et al.  Distributed asynchronous optimal routing in data networks , 1984, The 23rd IEEE Conference on Decision and Control.

[13]  David G. Luenberger,et al.  Linear and nonlinear programming , 1984 .

[14]  R. Ellis,et al.  LARGE DEVIATIONS FOR A GENERAL-CLASS OF RANDOM VECTORS , 1984 .

[15]  P. Kumar,et al.  Adaptive control with the stochastic approximation algorithm: Geometry and convergence , 1985 .

[16]  Bruce E. Hajek Stochastic approximation methods for decentralized control of multiaccess communications , 1985, IEEE Trans. Inf. Theory.

[17]  Peter W. Glynn,et al.  Proceedings of Ihe 1986 Winter Simulation , 2022 .

[18]  Peter W. Glynn,et al.  Optimization of stochastic systems , 1986, WSC '86.

[19]  Fu Chang,et al.  An optimal adaptive routing algorithm , 1986 .

[20]  Anurag Kumar,et al.  Adaptive Load Balancing in a Multi-Processor System with a Central Job Scheduler , 1987, Computer Performance and Reliability.

[21]  Marc S. Meketon,et al.  Optimization in simulation: a survey of recent results , 1987, WSC '87.

[22]  Rajan Suri,et al.  Convergence of a Single Run Simulation Optimization Algorithm , 1988, 1988 American Control Conference.

[23]  Masaaki Kijima,et al.  THEORY AND ALGORITHMS OF , 1988 .

[24]  R. Suri,et al.  Perturbation analysis gives strongly consistent sensitivity estimates for the M/G/ 1 queue , 1988 .

[25]  Michael C. Fu,et al.  Using perturbation analysis for gradient estimation, averaging and updating in a stochastic approximation algorithm , 1988, WSC '88.

[26]  P. Heidelberger,et al.  Sensitivity analysis from sample paths using likelihoods , 1989 .

[27]  RAHUL SIMHA,et al.  Relative reward strength algorithms for learning automata , 1989, IEEE Trans. Syst. Man Cybern..

[28]  R. Suri,et al.  Perturbation analysis: the state of the art and research issues explained via the GI/G/1 queue , 1989, Proc. IEEE.

[29]  P. Dupuis,et al.  Stochastic approximation and large deviations: upper bounds and w.p.1 convergence , 1989 .

[30]  Alan Weiss,et al.  Sensitivity Analysis for Simulations via Likelihood Ratios , 1989, Oper. Res..

[31]  金 宗根 Load balancing in distributed computer systems , 1991 .