论文信息 - Information bounds, certainty equivalence and learning in asymptotically efficient adaptive control of time-invariant stochastic systems - 字舞流文

Information bounds, certainty equivalence and learning in asymptotically efficient adaptive control of time-invariant stochastic systems

[1] T. Lai,et al. Parallel recursive algorithms in asymptotically efficient adaptive control of linear stochastic systems , 1991 .

[2] T. Lai,et al. Optimal stopping and dynamic allocation , 1987, Advances in Applied Probability.

[3] J. Walrand,et al. Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .

[4] T. Lai. Adaptive treatment allocation and the multi-armed bandit problem , 1987 .

[5] C. Z. Wei. Multivariate Adaptive Stochastic Approximation , 1987 .

[6] T. Lai,et al. Asymptotically efficient self-tuning regulators , 1987 .

[7] T. Lai,et al. Extended least squares and their applications to adaptive control and prediction in linear systems , 1986 .

[8] T. Lai,et al. On the concept of excitation in least squares identification and adaptive control , 1986 .

[9] T. Lai. Asymptotically efficient adaptive control in stochastic regression models , 1986 .

[10] Patchigolla Kiran Kumar,et al. A Survey of Some Results in Stochastic Adaptive Control , 1985 .

[11] H. Robbins,et al. Asymptotically efficient adaptive allocation rules , 1985 .

[12] Lennart Ljung,et al. Theory and Practice of Recursive Identification , 1983 .

[13] Karl Johan Åström,et al. Theory and applications of adaptive control - A survey , 1983, Autom..

[14] P. Caines,et al. Adaptive control with recursive identification for stochastic linear systems: Multivariable case , 1982, 1982 21st IEEE Conference on Decision and Control.

[15] H. Robbins,et al. Iterated least squares in multiperiod control , 1982 .

[16] T. Lai,et al. Least Squares Estimates in Stochastic Regression Models with Applications to Identification and Control of Dynamic Systems , 1982 .

[17] V. Solo. The convergence of AML , 1979 .

[18] H. Robbins,et al. Adaptive Design and Stochastic Approximation , 1979 .

[19] T. W. Anderson,et al. Some Experimental Results on the Statistical Properties of Least Squares Estimates in Control Problems , 1976 .

[20] J. Gani,et al. Progress in statistics , 1975 .

[21] Arnold Zellner,et al. An Introduction to Bayesian Inference in Econometrics. , 1974 .

[22] Björn Wittenmark,et al. On Self Tuning Regulators , 1973 .

[23] E. Prescott. THE MULTI-PERIOD CONTROL PROBLEM UNDER UNCERTAINTY , 1972 .

[24] D. Berry. A Bernoulli Two-armed Bandit , 1972 .

[25] W. V. Zwet,et al. Some Remarks on the Two-Armed Bandit , 1970 .

[26] Herman Chernoff,et al. A Bayes Sequential Sampling Inspection Plan , 1965 .

[27] Dorian Feldman. Contributions to the "Two-Armed Bandit" Problem , 1962 .

[28] H. Robbins. Some aspects of the sequential design of experiments , 1952 .

[29] Tze Leung Lai,et al. Asymptotic Solutions of Bandit Problems , 1988 .

[30] Wendell H. Fleming,et al. Stochastic differential systems, stochastic control theory and applications , 1988 .

[31] D. Teneketzis,et al. Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost , 1988 .

[32] Graham C. Goodwin,et al. Adaptive filtering prediction and control , 1984 .

[33] H. Robbins,et al. ADAPTIVE DESIGN AND THE MULTIPERIOD CONTROL PROBLEM , 1982 .

[34] P. Whittle. Multi‐Armed Bandits and the Gittins Index , 1980 .

[35] J. Gittins. Bandit processes and dynamic allocation indices , 1979 .

[36] P. Ramadge,et al. Discrete time stochastic adaptive control , 1979, 1979 18th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[37] G. Chow. Analysis and control of dynamic economic systems , 1975 .

[38] Masanao Aoki,et al. On Some Price Adjustment Schemes , 1974 .

[39] Herman Chernoff,et al. Sequential models for clinical trials , 1967 .