To Send or Not to Send - Learning MAC Contention

The exponential back-off mechanism, proposed for reducing MAC- layer contention in the 802.11 standard, is sub-optimal in terms of the network throughput. This back-off mechanism and its improved variants are especially inefficient under unknown dynamics such as packet arrivals and user entry/exit. In this paper, we formulate the problem of optimizing this back-off mechanism as a Markov decision process, and propose online learning algorithms to learn the optimal back-off schemes under unknown dynamics. By exploiting the fact that some components of the system dynamics (such as protocol states) are known because the users follow the common 802.11 protocol, we propose a post-decision state (PDS)- based learning algorithm to speed up the learning process. Compared to traditional Q-learning algorithms, the advantages of the proposed online learning algorithm are that 1) it exploits partial information about the system so that less information needs to be learned in comparison to other learning algorithms, and 2) it removes the necessity for action exploration which usually impedes the learning process of conventional learning algorithms (such as Q-Learning). We prove the optimality of the proposed PDS-based learning algorithm and via numerical results demonstrate the improvement over existing protocols and Q-learning in terms of throughput and convergence speed. We first address this problem from a single-user perspective and later describe the challenges involved and present new insights into the multi-user learning scenarios, especially in cases where the MDP models of the users are coupled with each other.

[1]  Yongkang Xiao,et al.  Game theory models for IEEE 802.11 DCF in wireless ad hoc networks , 2005, IEEE Communications Magazine.

[2]  Mihaela van der Schaar,et al.  Winning the Lottery: Learning Perfect Coordination With Minimal Feedback , 2013, IEEE Journal of Selected Topics in Signal Processing.

[3]  Yuguang Fang,et al.  A novel MAC protocol with fast collision resolution for wireless LANs , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[4]  Ahmed Mehaoua,et al.  SCW: sliding contention window for efficient service differentiation in IEEE 802.11 networks , 2005, IEEE Wireless Communications and Networking Conference, 2005.

[5]  Yanghee Choi,et al.  P-DCF: enhanced backoff scheme for the IEEE 802.11 DCF , 2005, 2005 IEEE 61st Vehicular Technology Conference.

[6]  Nah-Oak Song,et al.  Enhancement of IEEE 802.11 distributed coordination function with exponential increase exponential decrease backoff algorithm , 2003, The 57th IEEE Semiannual Vehicular Technology Conference, 2003. VTC 2003-Spring..

[7]  Kamal Jamshidi,et al.  Game Theory Applications in CSMA Methods , 2013, IEEE Communications Surveys & Tutorials.

[8]  David Malone,et al.  Decentralised learning MACs for collision-free access in WLANs , 2010, Wirel. Networks.

[9]  Mohamed Naimi,et al.  Determinist contention window algorithm for IEEE 802.11 , 2005, 2005 IEEE 16th International Symposium on Personal, Indoor and Mobile Radio Communications.

[10]  Cristina Cano,et al.  Learning-BEB: Avoiding Collisions in WLAN , 2008 .

[11]  Jeffrey G. Andrews,et al.  Fundamentals of WiMAX: Understanding Broadband Wireless Networking , 2007 .

[12]  Sagar Naik,et al.  A Localized Adaptive Strategy to Calculate the Backoff Interval in Contention-Based Vehicular Networks , 2014, IEEE Access.

[13]  Abhijeet Bhorkar,et al.  An on-line learning algorithm for energy efficient delay constrained scheduling over a fading channel , 2008, IEEE Journal on Selected Areas in Communications.

[14]  Mihaela van der Schaar,et al.  Structure-Aware Stochastic Control for Transmission Scheduling , 2010, IEEE Transactions on Vehicular Technology.

[15]  Urbashi Mitra,et al.  Cognitive interference management in retransmission-based wireless networks , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[16]  Mihaela van der Schaar,et al.  Joint Physical-Layer and System-Level Power Management for Delay-Sensitive Wireless Communications , 2013, IEEE Transactions on Mobile Computing.

[17]  P. Venkata Krishna,et al.  Virtual Backoff Algorithm: An Enhancement to 802.11 Medium-Access Control to Improve the Performance of Wireless Networks , 2010, IEEE Transactions on Vehicular Technology.

[18]  Gary Shueng Han Chan,et al.  A TCP-like adaptive contention window scheme for WLAN , 2004 .