论文信息 - Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning - 字舞流文

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

Alexander H. Miller | Athul Paul Jacob | Adam Lerer | A. Bakhtin | Noam Brown | Jonathan Gray | Gabriele Farina | David J. Wu