Sreejith Balakrishnan
发表
Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization
pdf
Harold Soh,
Quoc Phong Nguyen,
Bryan Kian Hsiang Low,
2020,
NeurIPS.