Reinforcement learning and evolutionary algorithms for non-stationary multi-armed bandit problems
暂无分享,去创建一个
[1] B. McCall,et al. Systematic search, belated information, and the gittins' index , 1981 .
[2] Dirk Thierens,et al. An Adaptive Pursuit Strategy for Allocating Operator Probabilities , 2005, BNAIC.
[3] Jean Walrand,et al. Extensions of the multiarmed bandit problem: The discounted case , 1985 .
[4] Nicolò Cesa-Bianchi,et al. Gambling in a rigged casino: The adversarial multi-armed bandit problem , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.
[5] Benoît Leloup,et al. Dynamic Pricing on the Internet: Theory and Simulations , 2001, Electron. Commer. Res..
[6] Rina Azoulay-Schwartz,et al. Exploitation vs. exploration: choosing a supplier in an environment of incomplete information , 2004, Decis. Support Syst..
[7] Mayur S. Desai,et al. Information technology project failures: Applying the bandit problem to evaluate managerial decision making , 2005, Inf. Manag. Comput. Security.
[8] DE Economist. A SURVEY ON THE BANDIT PROBLEM WITH SWITCHING COSTS , 2004 .
[9] P. S. Sastry,et al. A Class of Rapidly Converging Algorithms for Learning Automata , 1984 .
[10] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[11] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[12] P. W. Jones,et al. Bandit Problems, Sequential Allocation of Experiments , 1987 .
[13] A. Mandelbaum,et al. Multi-armed bandits in discrete and continuous time , 1998 .
[14] M. A. L. THATHACHAR,et al. A new approach to the design of reinforcement schemes for learning automata , 1985, IEEE Transactions on Systems, Man, and Cybernetics.
[15] D B Fogel,et al. Do evolutionary processes minimize expected losses? , 2000, Journal of theoretical biology.
[16] J. Banks,et al. Switching Costs and the Gittins Index , 1994 .
[17] Irene Valsecchi,et al. Job assignment and bandit problems , 2003 .
[18] J. Bather,et al. Multi‐Armed Bandit Allocation Indices , 1990 .