UPD E T: U NIVERSAL M ULTI - AGENT R EINFORCEMENT L EARNING VIA P OLICY D ECOUPLING WITH T RANS FORMERS