Learning Multiple Coordinated Agents under Directed Acyclic Graph Constraints