Exploiting channel memory for multi-user wireless scheduling without channel measurement: Capacity regions and algorithms

We study the fundamental network capacity of a multi-user wireless downlink under two assumptions: (1) Channels are not explicitly measured and thus instantaneous states are unknown, (2) Channels are modeled as ON/OFF Markov chains. This is an important network model to explore because channel probing may be costly or infeasible in some contexts. In this case, we can use channel memory with ACK/NACK feedback from previous transmissions to improve network throughput. Computing in closed form the capacity region of this network is difficult because it involves solving a high dimension partially observed Markov decision problem. Instead, in this paper we construct an inner and outer bound on the capacity region, showing that the bound is tight when the number of users is large and the traffic is symmetric. For the case of heterogeneous traffic and any number of users, we propose a simple queue-dependent policy that can stabilize the network with any data rates strictly within the inner capacity bound. The stability analysis uses a novel frame-based Lyapunov drift argument. The outer-bound analysis uses stochastic coupling and state aggregation to bound the performance of a restless bandit problem using a related multi-armed bandit system. Our results are useful in cognitive radio networks, opportunistic scheduling with delayed/uncertain channel state information, and restless bandit problems.

[1]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[2]  Michael J. Neely,et al.  Energy-optimal scheduling with dynamic channel acquisition in wireless downlinks , 2007, 2007 46th IEEE Conference on Decision and Control.

[3]  Qing Zhao,et al.  Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access , 2008, IEEE Transactions on Information Theory.

[4]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Vol. II , 1976 .

[5]  Mingyan Liu,et al.  Multi-channel opportunistic access: A case of restless bandits with multiple plays , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[6]  P. Whittle Restless Bandits: Activity Allocation in a Changing World , 1988 .

[7]  Christian M. Ernst,et al.  Multi-armed Bandit Allocation Indices , 1989 .

[8]  Hong Shen Wang,et al.  On verifying the first-order Markovian assumption for a Rayleigh fading channel model , 1994, Proceedings of 1994 3rd IEEE International Conference on Universal Personal Communications.

[9]  Robert G. Gallager,et al.  Discrete Stochastic Processes , 1995 .

[10]  Bhaskar Krishnamachari,et al.  On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance , 2007, IEEE Transactions on Wireless Communications.

[11]  Ananthram Swami,et al.  A Decision-Theoretic Framework for Opportunistic Spectrum Access , 2007, IEEE Wireless Communications.

[12]  Mingyan Liu,et al.  Optimality of Myopic Sensing in Multi-Channel Opportunistic Access , 2008, 2008 IEEE International Conference on Communications.

[13]  Michael J. Neely,et al.  Stochastic optimization for Markov modulated networks with application to delay constrained wireless scheduling , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[14]  Prasanna Chaporkar,et al.  Scheduling with limited information in wireless systems , 2009, MobiHoc '09.

[15]  J. Bather,et al.  Multi‐Armed Bandit Allocation Indices , 1990 .

[16]  Laurence B. Milstein,et al.  A Markov model for block errors on fading channels , 1996, Proceedings of PIMRC '96 - 7th International Symposium on Personal, Indoor, and Mobile Communications.

[17]  Michael J. Neely,et al.  Energy-Optimal Scheduling with Dynamic Channel Acquisition in Wireless Downlinks , 2010, IEEE Trans. Mob. Comput..

[18]  Kevin D. Glazebrook,et al.  Multi-Armed Bandit Allocation Indices: Gittins/Multi-Armed Bandit Allocation Indices , 2011 .

[19]  Pao-Chi Chang,et al.  On verifying the first-order Markovian assumption for a Rayleigh fading channel model , 1996 .

[20]  Leandros Tassiulas,et al.  Dynamic server allocation to parallel queues with randomly varying connectivity , 1993, IEEE Trans. Inf. Theory.

[21]  Leandros Tassiulas,et al.  Resource Allocation and Cross-Layer Control in Wireless Networks , 2006, Found. Trends Netw..

[22]  Michael J. Neely,et al.  On Achievable Network Capacity and Throughput-Achieving Policies over Markov ON/OFF Channels , 2010, arXiv.org.

[23]  Anthony Ephremides,et al.  Joint Scheduling and Routing for Ad-hoc Networks Under Channel State Uncertainty , 2007, 2007 5th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks and Workshops.

[24]  Michael J. Neely,et al.  Network utility maximization over partially observable Markovian channels , 2010, 2011 International Symposium of Modeling and Optimization of Mobile, Ad Hoc, and Wireless Networks.

[25]  Lei Ying,et al.  On Throughput Optimality With Delayed Network-State Information , 2008, IEEE Transactions on Information Theory.

[26]  Lei Ying,et al.  Scheduling in Mobile Ad Hoc Networks With Topology and Channel-State Uncertainty , 2012, IEEE Transactions on Automatic Control.