Daniel Visentin
发表
Geraint Rees,
Demis Hassabis,
Olaf Ronneberger,
2018,
Nature Medicine.
Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
pdf
Peter Sunehag,
Gabriel Dulac-Arnold,
Yori Zwols,
2015,
ArXiv.