论文信息 - The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications - 字舞流文

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications

W. B. Knox | S. Niekum | J. Shah | Peter Stone | A. Allievi | S. Booth | Bosch