Reinforcement Learning for Energy-Efficient Delay-Sensitive CSMA/CA Scheduling

We study learning-based energy-efficient multi- user scheduling of delay-sensitive data over fading channels. To tradeoff energy and delay, we combine adaptive rate transmission at the physical layer with a rate-adaptive medium access control (MAC) protocol based on carrier sense multiple access with collision avoidance (CSMA/CA). We formulate the multi-user scheduling problem as a constrained Markov decision process (CMDP). We show that the multi-user problem is intractable and propose to decompose it into multiple (coupled) single-user problems. We design a reinforcement learning algorithm to solve the single-user problems online so that users can achieve energy-efficient operation while meeting their delay constraints, even though the channel, traffic, and multi-user dynamics are unknown a priori. Our proposed MAC protocol enables users to meet significantly tighter delay constraints while also consuming less energy than under the 802.11 Distributed Coordination Function (DCF). Moreover, the proposed learning algorithm converges significantly faster than a state-of-the-art solution.

[1]  Jacob Chakareski Uplink Scheduling of Visual Sensors: When View Popularity Matters , 2015, IEEE Transactions on Communications.

[2]  Jacob Chakareski Distributed media cooperation for enhanced video communication , 2006 .

[3]  Martin Heusse,et al.  Performance anomaly of 802.11b , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[4]  Vivek S. Borkar,et al.  A Stable Online Algorithm for Energy-Efficient Multiuser Scheduling , 2010, IEEE Transactions on Mobile Computing.

[5]  Pascal Frossard,et al.  Rate-distortion optimized distributed packet scheduling of multiple video streams over shared communication resources , 2006, IEEE Trans. Multim..

[6]  Yang Xiao Backoff-based priority schemes for IEEE 802.11 , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[7]  Kiseon Kim,et al.  A novel MAC scheme for prioritized services in IEEE 802.11a wireless LAN , 2001, Joint 4th IEEE International Conference on ATM(ICATM'01) and High Speed Intelligent Internet Symposium. ICATM 2001 (Cat. No.00EX486).

[8]  Mihaela van der Schaar,et al.  Optimal Foresighted Multi-User Wireless Video , 2015, IEEE Journal of Selected Topics in Signal Processing.

[9]  Mihaela van der Schaar,et al.  Joint Physical-Layer and System-Level Power Management for Delay-Sensitive Wireless Communications , 2013, IEEE Transactions on Mobile Computing.

[10]  Lei Ding,et al.  Cross-Layer Routing and Dynamic Spectrum Allocation in Cognitive Radio Ad Hoc Networks , 2010, IEEE Transactions on Vehicular Technology.

[11]  Saleem A. Kassam,et al.  Finite-state Markov model for Rayleigh fading channels , 1999, IEEE Trans. Commun..

[12]  E. Altman Constrained Markov Decision Processes , 1999 .

[13]  A. Girotra,et al.  Performance Analysis of the IEEE 802 . 11 Distributed Coordination Function , 2005 .

[14]  Luca Benini,et al.  Policy optimization for dynamic power management , 1999, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[15]  Xuemin Shen,et al.  MAC in Motion: Impact of Mobility on the MAC of Drive-Thru Internet , 2012, IEEE Transactions on Mobile Computing.

[16]  Ruay-Shiung Chang,et al.  A Priority Scheme for IEEE 802. 11 DCF Access Method , 1999 .

[17]  Mihaela van der Schaar,et al.  A systematic framework for dynamically optimizing multi-user wireless video transmission , 2009, IEEE Journal on Selected Areas in Communications.