Multi-armed bandits with switching penalties
暂无分享,去创建一个
[1] R. Agrawal,et al. Multi-armed bandit problems with multiple plays and switching cost , 1990 .
[2] D. Teneketzis,et al. Optimality of index policies for stochastic scheduling with switching penalties , 1992, Journal of Applied Probability.
[3] Jean Walrand,et al. Extensions of the multiarmed bandit problem: The discounted case , 1985 .
[4] A. Mandelbaum. Discrete multi-armed bandits and multi-parameter processes , 1986 .
[5] J. Tsitsiklis. A lemma on the multiarmed bandit problem , 1986 .
[6] R. Agrawal,et al. Asymptotically efficient adaptive allocation schemes for controlled Markov chains: finite parameter space , 1989 .
[7] K. Glazebrook. On a sufficient condition for superprocesses due to whittle , 1982, Journal of Applied Probability.
[8] J. Banks,et al. Switching Costs and the Gittins Index , 1994 .
[9] Kevin D. Glazebrook,et al. Methods for the Evaluation of Permutations as Strategies in Stochastic Scheduling Problems , 1983 .
[10] P. Whittle. Arm-Acquiring Bandits , 1981 .
[11] K. Glazebrook. On stochastic scheduling with precedence relations and switching costs , 1980, Journal of Applied Probability.
[12] D. Teneketzis,et al. Asymptotically Efficient Adaptive Allocation Schemes for Controlled I.I.D. Processes: Finite Paramet , 1988 .
[13] D. Teneketzis,et al. Asymptotically efficient adaptive allocation rules for the multiarmed bandit problem with switching cost , 1988 .
[14] M. Weitzman. Optimal search for the best alternative , 1978 .
[15] J. Gittins. Bandit processes and dynamic allocation indices , 1979 .
[16] Michael N. Katehakis,et al. COMPUTING OPTIMAL SEQUENTIAL ALLOCATION RULES IN CLINICAL TRIALS , 1986 .
[17] J. Banks,et al. Denumerable-Armed Bandits , 1992 .
[18] I. Karatzas. Gittins Indices in the Dynamic Allocation Problem for Diffusion Processes , 1984 .
[19] Michael N. Katehakis,et al. The Multi-Armed Bandit Problem: Decomposition and Computation , 1987, Math. Oper. Res..
[20] F. Kelly. Multi-Armed Bandits with Discount Factor Near One: The Bernoulli Case , 1981 .
[21] P. Whittle. Multi‐Armed Bandits and the Gittins Index , 1980 .
[22] Kevin D. Glazebrook. On the evaluation of suboptimal strategies for families of alternative bandit processes , 1982 .
[23] R. Weber. On the Gittins Index for Multiarmed Bandits , 1992 .
[24] Dale T. Mortensen,et al. Chapter 15 Job search and labor market analysis , 1986 .