Dynamic energy scheduling and routing of a large fleet of electric vehicles using multi-agent reinforcement learning