Can Reinforcement Learning Always Provide the Best Policy
暂无分享,去创建一个
[1] H. Vincent Poor,et al. An Introduction to Signal Detection and Estimation , 1994, Springer Texts in Electrical Engineering.
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[4] Anthony Kuh,et al. Temporal difference learning applied to sequential detection , 1997, IEEE Trans. Neural Networks.
[5] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..
[6] Richard S. Sutton,et al. Open Theoretical Questions in Reinforcement Learning , 1999, EuroCOLT.
[7] J. Andel. Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.
[8] R. Khan,et al. Sequential Tests of Statistical Hypotheses. , 1972 .
[9] Iain Murray,et al. Solution of a Toy Problem by Reinforcement Learning , 2006 .
[10] J. G. Gander,et al. An introduction to signal detection and estimation , 1990 .
[11] Michael L. Littman,et al. Algorithms for Sequential Decision Making , 1996 .
[12] J. Wolfowitz,et al. Optimum Character of the Sequential Probability Ratio Test , 1948 .