Multi-agent Deep Reinforcement Learning Based Adaptive User Association in Heterogeneous Networks

Nowadays, lots of technical challenges emerge focusing on user association in ever-increasingly complicated 5G heterogeneous networks. With distributed multiple attribute decision making (MADM) algorithm, users tend to maximize their utilities selfishly for lack of cooperation, leading to congestion. Therefore, it is efficient to apply artificial intelligence to deal with these emerging problems, which enables users to learn with incomplete environment information. In this paper, we propose an adaptive user association approach based on multi-agent deep reinforcement learning (RL), considering various user equipment types and femtocell access mechanisms. It aims to achieve a desirable trade-off between Quality of Experience (QoE) and load balancing. We formulate user association as a Markov Decision Process. And a deep RL approach, semi-distributed deep Q-network (DQN), is exploited to get the optimal strategy. Individual reward is defined as a function of transmission rate and base station load, which are adaptively balanced by a designed weight. Simulation results reveal that DQN with adaptive weight achieves the highest average reward compared with DQN with fixed weight and MADM, which indicates it obtains the best trade-off between QoE and load balancing. Compared with MADM, our approach improves by \({4\%\sim 11\%}\), \({32\%\sim 40\%}\), \({99\%}\) in terms of QoE, load balancing and blocking probability, respectively. Furthermore, semi-distributed framework reduces computational complexity.

[1]  Jie Zhang,et al.  Access control mechanisms for femtocells , 2010, IEEE Communications Magazine.

[2]  Lusheng Wang,et al.  Mathematical Modeling for Network Selection in Heterogeneous Wireless Networks — A Tutorial , 2013, IEEE Communications Surveys & Tutorials.

[3]  Gang Feng,et al.  Multi-RAT Access Based on Multi-Agent Reinforcement Learning , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[4]  Xing Zhang,et al.  Context-aware multi-RAT connection with bi-level decision in 5G heterogeneous networks , 2017, 2017 IEEE/CIC International Conference on Communications in China (ICCC).

[5]  Wan Choi,et al.  Optimal Access in OFDMA Multi-RAT Cellular Networks With Stochastic Geometry: Can a Single RAT Be Better? , 2016, IEEE Transactions on Wireless Communications.

[6]  Zhu Han,et al.  Cell selection in two-tier femtocell networks with open/closed access using evolutionary game , 2013, 2013 IEEE Wireless Communications and Networking Conference (WCNC).

[7]  Bernard Cousin,et al.  A Network-Assisted Approach for RAT Selection in Heterogeneous Cellular Networks , 2015, IEEE Journal on Selected Areas in Communications.

[8]  Shin-Ming Cheng,et al.  eNB Selection for Machine Type Communications Using Reinforcement Learning Based Markov Decision Process , 2017, IEEE Transactions on Vehicular Technology.

[9]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[10]  Jeffrey G. Andrews,et al.  Femtocell networks: a survey , 2008, IEEE Communications Magazine.