论文信息 - Model-free control of nonlinear stochastic systems with discrete-time measurements

Model-free control of nonlinear stochastic systems with discrete-time measurements

Consider the problem of developing a controller for general (nonlinear and stochastic) systems where the equations governing the system are unknown. Using discrete-time measurement, this paper presents an approach for estimating a controller without building or assuming a model for the system. Such an approach has potential advantages in accommodating complex systems with possibly time-varying dynamics. The controller is constructed through use of a function approximator, such as a neural network or polynomial. This paper considers the use of the simultaneous perturbation stochastic approximation algorithm which requires only system measurements. A convergence result for stochastic approximation algorithms with time-varying objective functions and feedback is established. It is shown that this algorithm can greatly enhance the efficiency over more standard stochastic approximation algorithms based on finite-difference gradient approximations.

J. Spall | J. Cristion | John A. Cristion

[1] J. Spall. Implementation of the simultaneous perturbation algorithm for stochastic optimization , 1998 .

[2] J. L. Maryak. Some guidelines for using iterate averaging in stochastic approximation , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[3] J. Spall. Accelerated second-order stochastic optimization using only function measurements , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[4] Yangsheng Xu,et al. Human control strategy: abstraction, verification, and replication , 1997 .

[5] Rui J. P. de Figueiredo,et al. Learning rules for neuro-controller via simultaneous perturbation , 1997, IEEE Trans. Neural Networks.

[6] J. Dippon,et al. Weighted Means in Stochastic Approximation of Minima , 1997 .

[7] D. C. Chin,et al. Traffic-responsive signal timing for system-wide traffic control , 1997, Proceedings of the 1997 American Control Conference (Cat. No.97CH36041).

[8] J. Spall,et al. Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation , 1997, Proceedings of the 1997 American Control Conference (Cat. No.97CH36041).

[9] Payman Sadegh,et al. Constrained optimization via stochastic approximation with a simultaneous perturbation gradient approximation , 1997, Autom..

[10] M. Fu,et al. Optimization of discrete event systems via simultaneous perturbation stochastic approximation , 1997 .

[11] Richard D. Braatz,et al. On the "Identification and control of dynamical systems using neural networks" , 1997, IEEE Trans. Neural Networks.

[12] M. E. Ahmed,et al. Neural-net-based direct self-tuning control of nonlinear plants , 1997 .

[13] R H Smith,et al. NETWORKWIDE APPROACH TO OPTIMAL SIGNAL TIMING FOR INTEGRATED TRANSIT VEHICLE AND TRAFFIC OPERATIONS , 1997 .

[14] James C. Spall,et al. A one-measurement form of simultaneous perturbation stochastic approximation , 1997, Autom..

[15] Visakan Kadirkamanathan,et al. Dynamic structure neural networks for stable adaptive control of nonlinear systems , 1996, IEEE Trans. Neural Networks.

[16] F. Lewis,et al. Discrete-time model reference adaptive control of nonlinear dynamical systems using neural networks , 1996 .

[17] Edwin K. P. Chong,et al. Analysis of stochastic approximation and related algorithms , 1996 .

[18] K S Narendra,et al. Control of nonlinear dynamical systems using neural networks. II. Observability, identification, and control , 1996, IEEE Trans. Neural Networks.

[19] Hong Chen,et al. Universal approximation to nonlinear operators by neural networks with arbitrary activation functions and its application to dynamical systems , 1995, IEEE Trans. Neural Networks.

[20] H. Kushner,et al. Stochastic approximation with averaging and feedback: rapidly convergent "on-line" algorithms , 1995, IEEE Trans. Autom. Control..

[21] J. Spall,et al. A neural network controller for systems with unmodeled dynamics with applications to wastewater treatment , 1994, Proceedings of 1994 9th IEEE International Symposium on Intelligent Control.

[22] J. Spall,et al. Nonlinear adaptive control using neural networks: estimation with a smoothed form of simultaneous perturbation gradient approximation , 1994, Proceedings of 1994 American Control Conference - ACC '94.

[23] Thomas Parisini,et al. Neural approximations for multistage optimal control of nonlinear stochastic systems , 1994, Proceedings of 1994 American Control Conference - ACC '94.

[24] D. C. Chin,et al. A more efficient global optimization algorithm based on Styblinski and Tang , 1994, Neural Networks.

[25] Kumpati S. Narendra,et al. Control of nonlinear dynamical systems using neural networks: controllability and stabilization , 1993, IEEE Trans. Neural Networks.

[26] S. Yakowitz. A globally convergent stochastic approximation , 1993 .

[27] Dejan J. Sobajic,et al. Neural-net computing and the intelligent control of systems , 1992 .

[28] Héctor J. Sussmann,et al. Uniqueness of the weights for minimal feedforward nets with a given input-output map , 1992, Neural Networks.

[29] Boris Polyak,et al. Acceleration of stochastic approximation by averaging , 1992 .

[30] P.J. Antsaklis,et al. Implementations of learning control systems using neural networks , 1992, IEEE Control Systems.

[31] D.A. Handelman,et al. Theory and development of higher-order CMAC neural networks , 1992, IEEE Control Systems.

[32] J. Spall. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation , 1992 .

[33] Kurt Hornik,et al. Convergence of learning algorithms with constant learning rates , 1991, IEEE Trans. Neural Networks.

[34] Robert M. Sanner,et al. Gaussian Networks for Direct Adaptive Control , 1991, 1991 American Control Conference.

[35] Pierre Priouret,et al. Adaptive Algorithms and Stochastic Approximations , 1990, Applications of Mathematics.

[36] F. Girosi,et al. Networks for approximation and learning , 1990, Proc. IEEE.

[37] Arjan van der Schaft,et al. Non-linear dynamical control systems , 1990 .

[38] F.-C. Chen,et al. Back-propagation neural networks for nonlinear self-tuning adaptive control , 1990, IEEE Control Systems Magazine.

[39] D. Bayard. A forward method for optimal stochastic nonlinear and adaptive control , 1988, Proceedings of the 27th IEEE Conference on Decision and Control.

[40] K Y San,et al. The design of controllers for batch bioreactors , 1988, Biotechnology and bioengineering.

[41] Engin Yaz,et al. A control scheme for a class of discrete nonlinear stochastic systems , 1987 .

[42] S. Evans,et al. On the almost sure convergence of a general stochastic approximation procedure , 1986, Bulletin of the Australian Mathematical Society.

[43] R. K. Miller,et al. Stability Analysis of Hybrid Composite Dynamical Systems: Descriptions Involving Operators and Difference Equations , 1986, 1986 American Control Conference.

[44] A. Michel,et al. Stability analysis of hybrid composite dynamical systems: Descriptions involving operators and differential equations , 1985, 1985 24th IEEE Conference on Decision and Control.

[45] D. Ruppert. A Newton-Raphson Version of the Multivariate Robbins-Monro Procedure , 1985 .

[46] Denis Dochain,et al. Adaptive identification and control algorithms for nonlinear bacterial growth systems , 1984, Autom..

[47] E. Eweda,et al. Second-order convergence analysis of stochastic adaptive linear filtering , 1983 .

[48] A. Benveniste,et al. A measure of the tracking capability of recursive stochastic algorithms with constant gains , 1982 .

[49] T. Soderstrom,et al. Stationary performance of linear stochastic systems under single step optimal control , 1982 .

[50] H. Anton,et al. Functions of several variables , 2021, Thermal Physics of the Atmosphere.

[51] H. Kushner,et al. Asymptotic Properties of Stochastic Approximations with Constant Coefficients. , 1981 .

[52] George N. Saridis,et al. Self-organizing control of stochastic systems , 1977 .

[53] George Tchobanoglous,et al. Wastewater Engineering Treatment Disposal Reuse , 1972 .

[54] D. Sworder,et al. On the Control of Stochastic Systems , 1967 .

[55] W. Rudin. Principles of mathematical analysis , 1964 .

[56] J. Blum. Approximation Methods which Converge with Probability one , 1954 .