论文信息 - Continuous action set learning automata for stochastic optimization

Continuous action set learning automata for stochastic optimization

The problem of optimization with noisy measurements is common in many areas of engineering. The only available information is the noise-corrupted value of the objective function at any chosen point in the parameter space. One well-known method for solving this problem is the stochastic approximation procedure. In this paper we consider an adaptive random search procedure, based on the reinforcement-learning paradigm. The learning model presented here generalizes the traditional model of a learning automaton [Narendra and Thathachar, Learning Automata: An Introduction, Prentice Hall, Englewood Cliffs, 1989]. This procedure requires a lesser number of function evaluations at each step compared to the stochastic approximation. The convergence properties of the algorithm are theoretically investigated. Simulation results are presented to show the efficacy of the learning method.

P. S. Sastry | G. Santharam | M. A. L. Thathachar

[1] J. Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[2] Roger J.-B. Wets,et al. Minimization by Random Search Techniques , 1981, Math. Oper. Res..

[3] R. Ge,et al. The globally convexized filled functions for global optimization , 1990 .

[4] Mandayam A. L. Thathachar,et al. Learning Optimal Discriminant Functions through a Cooperative Game of Automata , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[5] R. L. Anderson,et al. RECENT ADVANCES IN FINDING BEST OPERATING CONDITIONS , 1953 .

[6] J. Blum. Multidimensional Stochastic Approximation Methods , 1954 .

[7] Vijaykumar Gullapalli,et al. A stochastic reinforcement learning algorithm for learning real-valued functions , 1990, Neural Networks.