MDP Playground: A Design and Debug Testbed for Reinforcement Learning