An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems