The dynamics of reinforcement learning agents