论文信息 - Few-Shot Preference Learning for Human-in-the-Loop RL - 字舞流文

Few-Shot Preference Learning for Human-in-the-Loop RL

Dorsa Sadigh | Divyansh Garg | D. Hejna | Ashwin Vangipuram | Joey Hejna