UvA-DARE (Digital Academic Repository) Rethinking Supervised Learning and Reinforcement Learning in Task-Oriented Dialogue Systems