High-Reliability Multi-Agent Q-Learning-Based Scheduling for D2D Microgrid Communications

This paper proposes a multi-agent Q-learning-based resource allocation algorithm that allows long-term evolution (LTE)-enabled device-to-device (D2D) communication agents to generate the orthogonal transmission schedules outside the network coverage. This algorithm reduces packet drop rates (PDR) in distributed D2D communication networks to meet the quality-of-service requirements of the microgrid communications. The data traffic characteristics of three archetypal smart grid applications, namely demand response, solar, and generation forecasting, and synchrophasor communications, were simulated under seven different traffic congestion scenarios, where the total aggregate throughput of users ranged from 50% to 140% channel utilization. The PDR and latency performance of the proposed algorithm were compared with the existing random self-allocation mechanism introduced under the Third-Generation Partnership Project’s LTE Release 12 standard for such scenarios. Our algorithm outperformed the LTE algorithm for all tested scenarios, demonstrating 20%–40% absolute reductions in PDR and 10–20-ms reductions in latency for all microgrid applications. The use of our algorithm in a simulated D2D-enabled demand response application resulted in a hundredfold reduction in power oscillations about the desired power flows.

[1]  Jan Markendahl,et al.  Device-to-device communications and small cells: enabling spectrum reuse for dense networks , 2014, IEEE Wireless Communications.

[2]  Xi Fang,et al.  3. Full Four-channel 6.3-gb/s 60-ghz Cmos Transceiver with Low-power Analog and Digital Baseband Circuitry 7. Smart Grid — the New and Improved Power Grid: a Survey , 2022 .

[3]  Ethem Alpaydin,et al.  Introduction to machine learning , 2004, Adaptive computation and machine learning.

[4]  Hung-Yu Wei,et al.  UE autonomous resource selection for D2D communications: Explicit vs. implicit approaches , 2016, 2016 IEEE Conference on Standards for Communications and Networking (CSCN).

[5]  Peng Yong Kong,et al.  Effects of Communication Network Performance on Dynamic Pricing in Smart Power Grid , 2014, IEEE Systems Journal.

[6]  Geoffrey Ye Li,et al.  Deep Reinforcement Learning for Resource Allocation in V2V Communications , 2017, 2018 IEEE International Conference on Communications (ICC).

[7]  Vincenzo Mancuso,et al.  QoS Requirements For Multimedia Services , 2007 .

[8]  Nadjib Aitsaadi,et al.  Joint Routing and Wireless Resource Allocation in Multihop LTE-D2D Communications , 2018, 2018 IEEE 43rd Conference on Local Computer Networks (LCN).

[9]  Jesus Alonso-Zarate,et al.  Cellular Communications for Smart Grid Neighborhood Area Networks: A Survey , 2016, IEEE Access.

[10]  H. T. Mouftah,et al.  Energy-Efficient Information and Communication Infrastructures in the Smart Grid: A Survey on Interactions and Open Issues , 2015, IEEE Communications Surveys & Tutorials.

[11]  Muhammad Ali Imran,et al.  Delay-optimal mode selection in device-to-device communications for smart grid , 2017, 2017 IEEE International Conference on Smart Grid Communications (SmartGridComm).

[12]  H. T. Mouftah,et al.  Delay Critical Smart Grid Applications and Adaptive QoS Provisioning , 2015, IEEE Access.

[13]  Javier Gozalvez,et al.  LTE-V for Sidelink 5G V2X Vehicular Communications: A New 5G Technology for Short-Range Vehicle-to-Everything Communications , 2017, IEEE Vehicular Technology Magazine.

[14]  Hichem Besbes,et al.  Radio resource allocation scheme for intra-inter-cell D2D communications in LTE-A , 2015, 2015 IEEE 26th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC).

[15]  Luigi Vanfretti,et al.  An IEC 61850-90-5 gateway for IEEE C37.118.2 synchrophasor data transfer , 2016, 2016 IEEE Power and Energy Society General Meeting (PESGM).

[16]  Muhammad Ali Imran,et al.  Joint Resource Allocation and Power Control in Heterogeneous Cellular Networks for Smart Grids , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[17]  Dacheng Yang,et al.  QoS-aware mode selection and resource allocation scheme for Device-to-Device (D2D) communication in cellular networks , 2013, 2013 IEEE International Conference on Communications Workshops (ICC).

[18]  K. Shamganth,et al.  A survey on relay selection in cooperative device-to-device (D2D) communication for 5G cellular networks , 2017, 2017 International Conference on Energy, Communication, Data Analytics and Soft Computing (ICECDS).

[19]  Zhao Ming,et al.  Q-learning based power control algorithm for D2D communication , 2016 .

[20]  Tao Jiang,et al.  Device-to-Device Communications for Energy Management: A Smart Grid Case , 2016, IEEE Journal on Selected Areas in Communications.

[21]  Setareh Maghsudi,et al.  Joint channel allocation and power control for underlay D2D transmission , 2015, 2015 IEEE International Conference on Communications (ICC).

[22]  Yoshikazu Miyanaga,et al.  An Autonomous Learning-Based Algorithm for Joint Channel and Power Level Selection by D2D Pairs in Heterogeneous Cellular Networks , 2016, IEEE Transactions on Communications.

[23]  Taskin Koçak,et al.  A Survey on Smart Grid Potential Applications and Communication Requirements , 2013, IEEE Transactions on Industrial Informatics.