Multiple Access in Dynamic Cell-Free Networks: Outage Performance and Deep Reinforcement Learning-Based Design

In future cell-free (or cell-less) wireless networks, a large number of devices in a geographical area will be served simultaneously in non-orthogonal multiple access scenarios by a large number of distributed access points (APs), which coordinate with a centralized processing pool. For such a centralized cell-free network with static predefined beamforming design, we first derive a closed-form expression of the uplink per-user probability of outage. To significantly reduce the complexity of joint processing of users' signals in presence of a large number of devices and APs, we propose a novel dynamic cell-free network architecture. In this architecture, the distributed APs are partitioned (i.e. clustered) among a set of subgroups with each subgroup acting as a virtual AP equipped with a distributed antenna system (DAS). The conventional static cell-free network is a special case of this dynamic cell-free network when the cluster size is one. For this dynamic cell-free network, we propose a successive interference cancellation (SIC)-enabled signal detection method and an inter-user-interference (IUI)-aware DAS's receive diversity combining scheme. We then formulate the general problem of clustering APs and designing the beamforming vectors with an objective to maximizing the sum rate or maximizing the minimum rate. To this end, we propose a hybrid deep reinforcement learning (DRL) model, namely, a deep deterministic policy gradient (DDPG)-deep double Q-network (DDQN) model, to solve the optimization problem for online implementation with low complexity. The DRL model for sum-rate optimization significantly outperforms that for maximizing the minimum rate in terms of average per-user rate performance. Also, in our system setting, the proposed DDPG-DDQN scheme is found to achieve around $78\%$ of the rate achievable through an exhaustive search-based design.

[1]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[2]  Victor C. M. Leung,et al.  Joint User Scheduling and Power Allocation Optimization for Energy-Efficient NOMA Systems With Imperfect CSI , 2017, IEEE Journal on Selected Areas in Communications.

[3]  Emil Björnson,et al.  Ubiquitous cell-free Massive MIMO communications , 2018, EURASIP Journal on Wireless Communications and Networking.

[4]  Jeffrey G. Andrews,et al.  Downlink performance and capacity of distributed antenna systems in a multicell environment , 2007, IEEE Transactions on Wireless Communications.

[5]  Alister G. Burr,et al.  On the Performance of Cell-Free Massive MIMO Relying on Adaptive NOMA/OMA Mode-Switching , 2020, IEEE Transactions on Communications.

[6]  Guillem Femenias,et al.  Cell-Free Millimeter-Wave Massive MIMO Systems With Limited Fronthaul Capacity , 2019, IEEE Access.

[7]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[8]  Satoshi Nagata,et al.  Coordinated multipoint transmission and reception in LTE-advanced: deployment scenarios and operational challenges , 2012, IEEE Communications Magazine.

[9]  Ranjan K. Mallik,et al.  Optimized diversity combining with imperfect channel estimation , 2006, IEEE Transactions on Information Theory.

[10]  Michael S. Berger,et al.  Cloud RAN for Mobile Networks—A Technology Overview , 2015, IEEE Communications Surveys & Tutorials.

[11]  Emil Björnson,et al.  Dynamic Resource Allocation in Co-Located and Cell-Free Massive MIMO , 2019, IEEE Transactions on Green Communications and Networking.

[12]  Yeheskel Bar-Ness,et al.  Diversity combining with imperfect channel estimation , 2005, IEEE Transactions on Communications.

[13]  Mohamed-Slim Alouini,et al.  Digital Communication Over Fading Channels: A Unified Approach to Performance Analysis , 2000 .

[14]  David Gesbert,et al.  A Coordinated Approach to Channel Estimation in Large-Scale Multiple-Antenna Systems , 2012, IEEE Journal on Selected Areas in Communications.

[15]  Milton Abramowitz,et al.  Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables , 1964 .

[16]  Ekram Hossain,et al.  The D-OMA Method for Massive Multiple Access in 6G: Performance, Security, and Challenges , 2019, IEEE Vehicular Technology Magazine.

[17]  Qing Wang,et al.  Wireless network cloud: Architecture and system requirements , 2010, IBM J. Res. Dev..

[18]  Erik G. Larsson,et al.  Cell-Free Massive MIMO Versus Small Cells , 2016, IEEE Transactions on Wireless Communications.

[19]  Yikai Li,et al.  NOMA-Aided Cell-Free Massive MIMO Systems , 2018, IEEE Wireless Communications Letters.

[20]  Guy Lever,et al.  Deterministic Policy Gradient Algorithms , 2014, ICML.

[21]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[22]  M. Abramowitz,et al.  Handbook of Mathematical Functions With Formulas, Graphs and Mathematical Tables (National Bureau of Standards Applied Mathematics Series No. 55) , 1965 .

[23]  Rudolf H. Riedi An Introduction to Statistical Signal Processing , 2006 .

[24]  Emil Björnson,et al.  Making Cell-Free Massive MIMO Competitive With MMSE Processing and Centralized Implementation , 2019, IEEE Transactions on Wireless Communications.

[25]  Alister G. Burr,et al.  On the Uplink Max–Min SINR of Cell-Free Massive MIMO Systems , 2019, IEEE Transactions on Wireless Communications.

[26]  Yao Zhang,et al.  Spectral Efficiency Maximization for Uplink Cell-Free Massive MIMO-NOMA Networks , 2019, 2019 IEEE International Conference on Communications Workshops (ICC Workshops).

[27]  Amr M. Youssef,et al.  Ultra-Dense Networks: A Survey , 2016, IEEE Communications Surveys & Tutorials.

[28]  Jing Wang,et al.  Distributed wireless communication system: a new architecture for future public wireless access , 2003, IEEE Commun. Mag..

[29]  Peter J. Smith,et al.  Exact Performance Analysis of Optimum Combining With Multiple Interferers in Flat Rayleigh Fading , 2007, IEEE Transactions on Communications.

[30]  Dong In Kim,et al.  Generalized Coordinated Multipoint (GCoMP)-Enabled NOMA: Outage, Capacity, and Power Allocation , 2019, IEEE Transactions on Communications.

[31]  Yongbin Wei,et al.  A survey on 3GPP heterogeneous networks , 2011, IEEE Wireless Communications.

[32]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[33]  Klaus David,et al.  6G Vision and Requirements: Is There Any Need for Beyond 5G? , 2018, IEEE Vehicular Technology Magazine.

[34]  Eric Villier Performance analysis of optimum combining with multiple interferers in flat Rayleigh fading , 1999, IEEE Trans. Commun..

[35]  Vincent K. N. Lau,et al.  Joint BS-User Association, Power Allocation, and User-Side Interference Cancellation in Cell-free Heterogeneous Networks , 2017, IEEE Transactions on Signal Processing.

[36]  Erik G. Larsson,et al.  Massive MIMO for next generation wireless systems , 2013, IEEE Communications Magazine.

[37]  Randy L. Haupt,et al.  Introduction to Adaptive Arrays , 1980 .

[38]  Ekram Hossain,et al.  Large-Scale NOMA: Promises for Massive Machine-Type Communication , 2019, ArXiv.

[39]  F. E. Satterthwaite An approximate distribution of estimates of variance components. , 1946, Biometrics.

[40]  Alister G. Burr,et al.  Cell-Free Massive MIMO with Limited Backhaul , 2018, 2018 IEEE International Conference on Communications (ICC).