论文信息 - The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models - 字舞流文

The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models

J. Steinhardt | K. Bhatia | Alexander Pan