论文信息 - How to Train Your Agent: Active Learning from Human Preferences and Justifications in Safety-critical Environments - 字舞流文

How to Train Your Agent: Active Learning from Human Preferences and Justifications in Safety-critical Environments

T. Norman | Yali Du | Ilias Kazantzidis | Christopher T. Freeman