W. Lu
发表
J. Hou,
X. Wang,
B. Wang,
2007
.
Statistical inference of the value function for reinforcement learning in infinite‐horizon settings
pdf
S. Zhang,
C. Shi,
W. Lu,
2020,
Journal of the Royal Statistical Society: Series B (Statistical Methodology).
Alan H. Greenaway,
S. Zhang,
Weiping Lu,
2006
.