Huizhen Yu
发表
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2009,
IEEE Transactions on Automatic Control.
Huizhen Yu,
Huizhen Yu,
2010,
ICML.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2010,
2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2004,
UAI.
Huizhen Yu,
Huizhen Yu,
2012,
SIAM J. Control. Optim..
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2013,
Math. Oper. Res..
Huizhen Yu,
Huizhen Yu,
2015,
J. Mach. Learn. Res..
Huizhen Yu,
Huizhen Yu,
2015,
COLT.
A Function Approximation Approach to Estimation of Policy Gradient for POMDP with Structured Policies
pdf
Huizhen Yu,
Huizhen Yu,
2005,
UAI.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2010,
Math. Oper. Res..
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2013,
Math. Oper. Res..
Huizhen Yu,
Huizhen Yu,
2017,
ArXiv.
Huizhen Yu,
Huizhen Yu U.H,
L. Sucar,
2006,
Encyclopedia of Social Network Analysis and Mining.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2009,
2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2013
.
Huizhen Yu,
2016,
ArXiv.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2010,
49th IEEE Conference on Decision and Control (CDC).
Huizhen Yu,
2006
.
Martha White,
Richard S. Sutton,
Huizhen Yu,
2015,
ArXiv.
Richard S. Sutton,
Huizhen Yu,
Ashique Rupam Mahmood,
2017,
Canadian Conference on AI.
Huizhen Yu,
2020,
SIAM J. Control. Optim..
Huizhen Yu,
Huizhen Yu,
2014,
SIAM J. Control. Optim..
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2008,
2008 46th Annual Allerton Conference on Communication, Control, and Computing.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2012,
Annals of Operations Research.
Huizhen Yu,
2020,
SIAM J. Control. Optim..
Richard S. Sutton,
Huizhen Yu,
Ashique Rupam Mahmood,
2017,
ArXiv.
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2011,
SIAM J. Optim..
Dimitri P. Bertsekas,
Huizhen Yu,
D. Bertsekas,
2008,
Math. Oper. Res..
Juho Rousu,
Huizhen Yu,
Tietojenkäsittelytieteen laitos,
2008
.
W. Eric L. Grimson,
Huizhen Yu,
2001,
IEEE Pacific Rim Conference on Multimedia.
Two geometric input transformation methods for fast online reinforcement learning with neural nets
pdf
Richard S. Sutton,
Huizhen Yu,
Sina Ghiassian,
2018,
ArXiv.
Dimitri P. Bertsekas,
Huizhen Yu,
2010,
CDC.
Huizhen Yu,
Huizhen Yu,
2019,
Math. Oper. Res..
Huizhen Yu,
Huizhen Yu,
2021,
2104.00181.