Multi-Agent Q-Leaming for Power Allocation in Interference Channel