Network utility maximization over partially observable Markovian channels

This paper considers maximizing throughput utility in a multi-user network with partially observable Markov ON/OFF channels. Instantaneous channel states are never known, and all control decisions are based on information provided by ACK/NACK feedback from past transmissions. This system can be viewed as a restless multi-armed bandit problem with a concave objective function of the time average reward vector. Such problems are generally intractable. However, we provide an approximate solution by optimizing the concave objective over a non-trivial inner bound on the network performance region, where the inner bound is constructed by randomizing well-designed stationary policies. Using a new frame-based Lyapunov drift argument, we design a policy of admission control and channel selection that stabilizes the network with throughput utility that can be made arbitrarily close to the optimal in the inner performance region. Our problem has applications in limited channel probing in wireless networks, dynamic spectrum access in cognitive radio networks, and target tracking of unmanned aerial vehicles. Our analysis generalizes the MaxWeight-type scheduling policies in stochastic network optimization theory from time-slotted systems to frame-based systems that have policy-dependent frame sizes.

[1]  Xiaojun Lin,et al.  Joint rate control and scheduling in multihop wireless networks , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[2]  Prasanna Chaporkar,et al.  Optimal joint probing and transmission strategy for maximizing throughput in wireless systems , 2008, IEEE Journal on Selected Areas in Communications.

[3]  Ness B. Shroff,et al.  Multiuser Scheduling in a Markov-Modeled Downlink Using Randomly Delayed ARQ Feedback , 2012, IEEE Transactions on Information Theory.

[4]  R. Srikant,et al.  Fair Resource Allocation in Wireless Networks Using Queue-Length-Based Scheduling and Congestion Control , 2005, IEEE/ACM Transactions on Networking.

[5]  Koushik Kar,et al.  Throughput-Optimal Scheduling in Multichannel Access Point Networks Under Infrequent Channel Measurements , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[6]  David D. Yao,et al.  Dynamic Scheduling via Polymatroid Optimization , 2002, Performance.

[7]  Qing Zhao,et al.  Indexability of Restless Bandit Problems and Optimality of Whittle Index for Dynamic Multichannel Access , 2008, IEEE Transactions on Information Theory.

[8]  Leandros Tassiulas,et al.  Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks , 1992 .

[9]  Michael J. Neely,et al.  Dynamic power allocation and routing for satellite and wireless networks with time varying channels , 2003 .

[10]  Laurence B. Milstein,et al.  A Markov model for block errors on fading channels , 1996, Proceedings of PIMRC '96 - 7th International Symposium on Personal, Indoor, and Mobile Communications.

[11]  Jean C. Walrand,et al.  Fair end-to-end window-based congestion control , 2000, TNET.

[12]  J. Nio-Mora An Index Policy for Dynamic Fading-Channel Allocation to Heterogeneous Mobile Users with Partial Observations , 2008, 2008 Next Generation Internet Networks.

[13]  Atilla Eryilmaz,et al.  Asymptotically optimal downlink scheduling over Markovian fading channels , 2012, 2012 Proceedings IEEE INFOCOM.

[14]  José Niño-Mora,et al.  Dynamic priority allocation via restless bandit marginal productivity indices , 2007, 2304.06115.

[15]  Michael J. Neely,et al.  Exploiting channel memory for multi-user wireless scheduling without channel measurement: Capacity regions and algorithms , 2010, 8th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks.

[16]  Eytan Modiano,et al.  Power allocation and routing in multibeam satellites with time-varying channels , 2003, TNET.

[17]  Alexander L. Stolyar,et al.  Maximizing Queueing Network Utility Subject to Stability: Greedy Primal-Dual Algorithm , 2005, Queueing Syst. Theory Appl..

[18]  Eytan Modiano,et al.  A state action frequency approach to throughput maximization over uncertain wireless channels , 2011, 2011 Proceedings IEEE INFOCOM.

[19]  Leandros Tassiulas,et al.  Resource Allocation and Cross-Layer Control in Wireless Networks , 2006, Found. Trends Netw..

[20]  Michael J. Neely,et al.  Asynchronous scheduling for energy optimality in systems with multiple servers , 2012, 2012 46th Annual Conference on Information Sciences and Systems (CISS).

[21]  Mingyan Liu,et al.  Optimality of Myopic Sensing in Multi-Channel Opportunistic Access , 2008, 2008 IEEE International Conference on Communications.

[22]  R. Weber,et al.  On an index policy for restless bandits , 1990, Journal of Applied Probability.

[23]  Prasanna Chaporkar,et al.  Learning to Optimally Exploit Multi-Channel Diversity in Wireless Systems , 2010, 2010 Proceedings IEEE INFOCOM.

[24]  R. Srikant,et al.  Joint congestion control, routing, and MAC for stability and fairness in wireless networks , 2006, IEEE Journal on Selected Areas in Communications.

[25]  P. Whittle Restless Bandits: Activity Allocation in a Changing World , 1988 .

[26]  Guy Pujolle,et al.  Introduction to queueing networks , 1987 .

[27]  Mingyan Liu,et al.  Optimal Channel Probing and Transmission Scheduling for Opportunistic Spectrum Access , 2007, IEEE/ACM Transactions on Networking.

[28]  D. Bertsekas,et al.  Approximate solution methods for partially observable markov and semi-markov decision processes , 2006 .

[29]  Lei Ying,et al.  Scheduling in Mobile Ad Hoc Networks With Topology and Channel-State Uncertainty , 2012, IEEE Transactions on Automatic Control.

[30]  Michael J. Neely,et al.  Energy optimal control for time-varying wireless networks , 2005, IEEE Transactions on Information Theory.

[31]  John N. Tsitsiklis,et al.  The complexity of optimal queueing network control , 1994, Proceedings of IEEE 9th Annual Conference on Structure in Complexity Theory.

[32]  Mingyan Liu,et al.  Multi-channel opportunistic access: A case of restless bandits with multiple plays , 2009, 2009 47th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[33]  S. Guha,et al.  Jointly optimal transmission and probing strategies for multichannel wireless systems , 2006, 2006 40th Annual Conference on Information Sciences and Systems.

[34]  Zhen Zhang,et al.  Dynamic Index Coding for Wireless Broadcast Networks , 2013, IEEE Transactions on Information Theory.

[35]  Michael J. Neely,et al.  Stability and Capacity Regions or Discrete Time Queueing Networks , 2010, ArXiv.

[36]  Michael J. Neely,et al.  Dynamic optimization and learning for renewal systems , 2010, 2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers.

[37]  Anthony Ephremides,et al.  Joint Scheduling and Routing for Ad-hoc Networks Under Channel State Uncertainty , 2007, 2007 5th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks and Workshops.

[38]  Peng Shi,et al.  Approximation algorithms for restless bandit problems , 2007, JACM.

[39]  Eytan Modiano,et al.  Fairness and Optimal Stochastic Control for Heterogeneous Networks , 2005, IEEE/ACM Transactions on Networking.

[40]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[41]  Lei Ying,et al.  On throughput optimality with delayed network-state information , 2008 .

[42]  Aditya Gopalan,et al.  On Wireless Scheduling With Partial Channel-State Information , 2012, IEEE Transactions on Information Theory.

[43]  Michael J. Neely,et al.  Energy-optimal scheduling with dynamic channel acquisition in wireless downlinks , 2007, 2007 46th IEEE Conference on Decision and Control.

[44]  Sheldon M. Ross,et al.  Stochastic Processes , 2018, Gauge Integral Structures for Stochastic Calculus and Quantum Electrodynamics.

[45]  Prasanna Chaporkar,et al.  Scheduling with limited information in wireless systems , 2009, MobiHoc '09.

[46]  R. Srikant,et al.  A tutorial on cross-layer optimization in wireless networks , 2006, IEEE Journal on Selected Areas in Communications.

[47]  John N. Tsitsiklis,et al.  The Complexity of Optimal Queuing Network Control , 1999, Math. Oper. Res..

[48]  E. Feron,et al.  Multi-UAV dynamic routing with partial observations using restless bandit allocation indices , 2008, 2008 American Control Conference.

[49]  Michael J. Neely,et al.  Delay and rate-optimal control in a multi-class priority queue with adjustable service rates , 2012, 2012 Proceedings IEEE INFOCOM.

[50]  R. Srikant,et al.  Stable scheduling policies for fading wireless channels , 2005, IEEE/ACM Transactions on Networking.

[51]  Brian M. Sadler,et al.  A Survey of Dynamic Spectrum Access , 2007, IEEE Signal Processing Magazine.

[52]  Chih-ping Li,et al.  Stochastic optimization over parallel queues: Channel-blind scheduling, restless bandit, and optimal delay , 2011 .

[53]  Michael J. Neely,et al.  Network utility maximization over partially observable Markovian channels , 2013, Perform. Evaluation.

[54]  Michael J. Neely,et al.  Opportunistic Cooperation in Cognitive Femtocell Networks , 2011, IEEE Journal on Selected Areas in Communications.

[55]  Leandros Tassiulas,et al.  Dynamic server allocation to parallel queues with randomly varying connectivity , 1993, IEEE Trans. Inf. Theory.

[56]  Atilla Eryilmaz,et al.  Scheduling with rate adaptation under incomplete knowledge of channel/estimator statistics , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[57]  Frank Kelly,et al.  Charging and rate control for elastic traffic , 1997, Eur. Trans. Telecommun..

[58]  M. Neely,et al.  Dynamic Optimization and Learning for Renewal Systems , 2010, IEEE Transactions on Automatic Control.

[59]  Michael J. Neely,et al.  Stochastic optimization for Markov modulated networks with application to delay constrained wireless scheduling , 2009, Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference.

[60]  Bhaskar Krishnamachari,et al.  On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance , 2007, IEEE Transactions on Wireless Communications.

[61]  Ananthram Swami,et al.  A Decision-Theoretic Framework for Opportunistic Spectrum Access , 2007, IEEE Wireless Communications.