P. Wiering

发表

University of Groningen Approximating two value functions instead of one: towards characterizing a new family of Deep Reinforcement Learning algorithms Sabatelli,

Gilles Louppe, Matthia Loupe, Gilles Geurts, 2019 .