Optimal Scheduling over Time-Varying Channels with Traffic Admission Control: Structural Results and Online Learning Algorithms

This work studies the joint scheduling- admission control (SAC) problem for a single user over a fading channel. Specifically, the SAC problem is formulated as a constrained Markov decision process (MDP) to maximize a utility defined as a function of the throughput and queue size. The optimal throughput- queue size trade-off is investigated. Optimal policies and their structural properties (i.e., monotonicity and convexity) are derived for two models: simultaneous and sequential scheduling and admission control actions. Furthermore, we propose online learning algorithms for the optimal policies for the two models when the statistical knowledge of the time-varying traffic arrival and channel processes is unknown. The analysis and algorithm development are relied on the reformulation of the Bellman's optimality equations using suitably defined state-value functions which can be learned online, at transmission time, using time-averaging. The learning algorithms require less complexity and converge faster than the conventional Q-learning algorithms. This work also builds a connection between the MDP based formulation and the Lyapunov optimization based formulation for the SAC problem. Illustrative results demonstrate the performance of the proposed algorithms in various settings.

[1]  R. Amir Supermodularity and Complementarity in Economics: An Elementary Survey , 2003 .

[2]  V. Borkar Stochastic Approximation: A Dynamical Systems Viewpoint , 2008 .

[3]  Vincent K. N. Lau,et al.  Decentralized Delay Optimal Control for Interference Networks With Limited Renewable Energy Storage , 2012, IEEE Transactions on Signal Processing.

[4]  Leandros Tassiulas,et al.  Resource Allocation and Cross-Layer Control in Wireless Networks , 2006, Found. Trends Netw..

[5]  Mihaela van der Schaar,et al.  Fast Reinforcement Learning for Energy-Efficient Wireless Communication , 2010, IEEE Transactions on Signal Processing.

[6]  Vivek S. Borkar,et al.  Structural Properties of Optimal Transmission Policies Over a Randomly Varying Channel , 2008, IEEE Transactions on Automatic Control.

[7]  Saleem A. Kassam,et al.  Finite-state Markov model for Rayleigh fading channels , 1999, IEEE Trans. Commun..

[8]  Mihaela van der Schaar,et al.  Structure-Aware Stochastic Control for Transmission Scheduling , 2010, IEEE Transactions on Vehicular Technology.

[9]  Heng Wang,et al.  A simple packet-transmission scheme for wireless data over fading channels , 2004, IEEE Transactions on Communications.

[10]  Vikram Krishnamurthy,et al.  MIMO Transmission Control in Fading Channels—A Constrained Markov Decision Process Formulation With Monotone Randomized Policies , 2007, IEEE Transactions on Signal Processing.

[11]  Vikram Krishnamurthy,et al.  ${Q}$-Learning Algorithms for Constrained Markov Decision Processes With Randomized Monotone Policies: Application to MIMO Transmission Control , 2007, IEEE Transactions on Signal Processing.

[12]  Vinod Sharma,et al.  Optimal Cross-Layer Scheduling of Transmissions Over a Fading Multiaccess Channel , 2008, IEEE Transactions on Information Theory.

[13]  S. Shenker Fundamental Design Issues for the Future Internet , 1995 .

[14]  E. K. Gannett,et al.  THE INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS , 1965 .

[15]  Michael J. Neely,et al.  Energy optimal control for time-varying wireless networks , 2005, IEEE Transactions on Information Theory.

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17]  Abhijeet Bhorkar,et al.  An on-line learning algorithm for energy efficient delay constrained scheduling over a fading channel , 2008, IEEE Journal on Selected Areas in Communications.

[18]  Eytan Modiano,et al.  Optimal energy allocation for delay-constrained data transmission over a time-varying channel , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[19]  Mihaela van der Schaar,et al.  A systematic framework for dynamically optimizing multi-user wireless video transmission , 2009, IEEE Journal on Selected Areas in Communications.

[20]  Mingyan Liu,et al.  Energy-Efficient Transmission Scheduling With Strict Underflow Constraints , 2011, IEEE Transactions on Information Theory.

[21]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.