About an initial value of Q-value in Profit Sharing
暂无分享,去创建一个
[1] Shoji Tatsumi,et al. About the Reinforcement Function for Profit Sharing , 2004 .
[2] John J. Grefenstette,et al. Credit assignment in rule discovery systems based on genetic algorithms , 1988, Machine Learning.
[3] Uemura Wataru,et al. SAPS: The Exploitation Reinforcement Learning Method on POMDPs , 2004 .
[4] Kwang Soon Lee,et al. Successive Linearization-based Repetitive Control of Simulated Moving Bed Process , 2006, 2006 SICE-ICASE International Joint Conference.
[5] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[6] K.S. Lee,et al. Model Predictive Control of Condensate Recycle Process in a Cogeneration Power Station , 2007, 2007 American Control Conference.
[7] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[8] Richard S. Sutton,et al. Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming , 1990, ML.