Information bounds, certainty equivalence and learning in asymptotically efficient adaptive control of time-invariant stochastic systems

[1]  T. Lai,et al.  Parallel recursive algorithms in asymptotically efficient adaptive control of linear stochastic systems , 1991 .

[2]  T. Lai,et al.  Optimal stopping and dynamic allocation , 1987, Advances in Applied Probability.

[3]  J. Walrand,et al.  Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-Part II: Markovian rewards , 1987 .

[4]  T. Lai Adaptive treatment allocation and the multi-armed bandit problem , 1987 .

[5]  C. Z. Wei Multivariate Adaptive Stochastic Approximation , 1987 .

[6]  T. Lai,et al.  Asymptotically efficient self-tuning regulators , 1987 .

[7]  T. Lai,et al.  Extended least squares and their applications to adaptive control and prediction in linear systems , 1986 .

[8]  T. Lai,et al.  On the concept of excitation in least squares identification and adaptive control , 1986 .

[9]  T. Lai Asymptotically efficient adaptive control in stochastic regression models , 1986 .

[10]  Patchigolla Kiran Kumar,et al.  A Survey of Some Results in Stochastic Adaptive Control , 1985 .

[11]  H. Robbins,et al.  Asymptotically efficient adaptive allocation rules , 1985 .

[12]  Lennart Ljung,et al.  Theory and Practice of Recursive Identification , 1983 .

[13]  Karl Johan Åström,et al.  Theory and applications of adaptive control - A survey , 1983, Autom..

[14]  P. Caines,et al.  Adaptive control with recursive identification for stochastic linear systems: Multivariable case , 1982, 1982 21st IEEE Conference on Decision and Control.

[15]  H. Robbins,et al.  Iterated least squares in multiperiod control , 1982 .

[16]  T. Lai,et al.  Least Squares Estimates in Stochastic Regression Models with Applications to Identification and Control of Dynamic Systems , 1982 .

[17]  V. Solo The convergence of AML , 1979 .

[18]  H. Robbins,et al.  Adaptive Design and Stochastic Approximation , 1979 .

[19]  T. W. Anderson,et al.  Some Experimental Results on the Statistical Properties of Least Squares Estimates in Control Problems , 1976 .

[20]  J. Gani,et al.  Progress in statistics , 1975 .

[21]  Arnold Zellner,et al.  An Introduction to Bayesian Inference in Econometrics. , 1974 .

[22]  Björn Wittenmark,et al.  On Self Tuning Regulators , 1973 .

[23]  E. Prescott THE MULTI-PERIOD CONTROL PROBLEM UNDER UNCERTAINTY , 1972 .

[24]  D. Berry A Bernoulli Two-armed Bandit , 1972 .

[25]  W. V. Zwet,et al.  Some Remarks on the Two-Armed Bandit , 1970 .

[26]  Herman Chernoff,et al.  A Bayes Sequential Sampling Inspection Plan , 1965 .

[27]  Dorian Feldman Contributions to the "Two-Armed Bandit" Problem , 1962 .

[28]  H. Robbins Some aspects of the sequential design of experiments , 1952 .

[29]  Tze Leung Lai,et al.  Asymptotic Solutions of Bandit Problems , 1988 .

[30]  Wendell H. Fleming,et al.  Stochastic differential systems, stochastic control theory and applications , 1988 .

[31]  D. Teneketzis,et al.  Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost , 1988 .

[32]  Graham C. Goodwin,et al.  Adaptive filtering prediction and control , 1984 .

[33]  H. Robbins,et al.  ADAPTIVE DESIGN AND THE MULTIPERIOD CONTROL PROBLEM , 1982 .

[34]  P. Whittle Multi‐Armed Bandits and the Gittins Index , 1980 .

[35]  J. Gittins Bandit processes and dynamic allocation indices , 1979 .

[36]  P. Ramadge,et al.  Discrete time stochastic adaptive control , 1979, 1979 18th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes.

[37]  G. Chow Analysis and control of dynamic economic systems , 1975 .

[38]  Masanao Aoki,et al.  On Some Price Adjustment Schemes , 1974 .

[39]  Herman Chernoff,et al.  Sequential models for clinical trials , 1967 .