Neural-based downlink scheduling algorithm for broadband wireless networks

Wireless local area networks are becoming very popular in many scenarios because they are very simple, convenient and cheap. This paper focuses on multimedia traffic management in wireless networks, where we consider to provide differentiated Quality of Service (QoS) levels. We address the complex task of traffic scheduling with multi-objective requirements in the presence of errors introduced by the radio channel. In particular, we focus on managing downlink traffic in both wireless ATM and WiFi scenarios, referring to an infrastructure wireless access network where a central coordinator takes scheduling decisions for the mobile users in its cell. Our scheduler is based on an Artificial Neural Network (ANN) with reinforcement learning. The ANN is trained from examples to behave as an ''optimal'' scheduler, according to an Actor-Critic model. The results obtained in scheduling concomitant voice, video and Web traffic classes permit to show the significant capacity improvement that can be achieved by our scheme with respect to other techniques previously proposed in the literature.

[1]  John S. Bridle,et al.  Training Stochastic Model Recognition Algorithms as Networks can Lead to Maximum Mutual Information Estimation of Parameters , 1989, NIPS.

[2]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[3]  B. E. Brand,et al.  Multidimensional PRMA with prioritized Bayesian broadcast-a MAC strategy for multiservice traffic over UMTS , 1998 .

[4]  Chris Blondia,et al.  Performance analysis of statistical multiplexing of VBR sources , 1992, [Proceedings] IEEE INFOCOM '92: The Conference on Computer Communications.

[5]  W. Richard Stevens,et al.  TCP Slow Start, Congestion Avoidance, Fast Retransmit, and Fast Recovery Algorithms , 1997, RFC.

[6]  Keinosuke Fukunaga,et al.  Statistical Pattern Recognition , 1993, Handbook of Pattern Recognition and Computer Vision.

[7]  Gerald Tesauro,et al.  Temporal difference learning and TD-Gammon , 1995, CACM.

[8]  Satish K. Tripathi,et al.  Enhancing throughput over wireless LANs using channel state dependent packet scheduling , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[9]  Vaduvur Bharghavan,et al.  WTCP: A Reliable Transport Protocol for Wireless Wide-Area Networks , 1999, Wirel. Networks.

[10]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[11]  Jonas Karlsson,et al.  Learning to Solve Multiple Goals , 1997 .

[12]  Richard S. Sutton,et al.  TD Models: Modeling the World at a Mixture of Time Scales , 1995, ICML.

[13]  Giovanni Giambene,et al.  High-capacity resource sharing schemes for broadband wireless networks , 2004, Wirel. Commun. Mob. Comput..

[14]  Nsf Ncr,et al.  A Generalized Processor Sharing Approach to Flow Control in Integrated Services Networks: The Single Node Case* , 1991 .

[15]  Laurence B. Milstein,et al.  On the effect of imperfect interleaving for the Gilbert-Elliott channel , 1999, IEEE Trans. Commun..

[16]  Lars Wischhof,et al.  Packet Scheduling for Link-Sharing and Quality of Service Support in Wireless Local Area Networks , 2001 .

[17]  Thomas G. Dietterich,et al.  High-Performance Job-Shop Scheduling With A Time-Delay TD(λ) Network , 1995, NIPS 1995.

[18]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks-the single node case , 1992, [Proceedings] IEEE INFOCOM '92: The Conference on Computer Communications.

[19]  李幼升,et al.  Ph , 1989 .

[20]  Matthew S. Gast,et al.  802.11 Wireless Networks: The Definitive Guide , 2002 .

[21]  A. Pattavina,et al.  Quality of service issues in extending ATM to wireless networks , 1998, ICUPC '98. IEEE 1998 International Conference on Universal Personal Communications. Conference Proceedings (Cat. No.98TH8384).

[22]  Vaduvur Bharghavan,et al.  WTCP: A Reliable Transport Protocol for Wireless Wide-Area Networks , 2002, Wirel. Networks.

[23]  C.W. Anderson,et al.  Learning to control an inverted pendulum using neural networks , 1989, IEEE Control Systems Magazine.

[24]  R. Srikant,et al.  Fair scheduling in wireless packet networks , 1999, TNET.

[25]  Richard S. Sutton,et al.  Neuronlike adaptive elements that can solve difficult learning control problems , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[26]  Richard S. Sutton,et al.  Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .

[27]  Richard S. Sutton,et al.  Introduction to Reinforcement Learning , 1998 .

[28]  Keinosuke Fukunaga,et al.  Introduction to statistical pattern recognition (2nd ed.) , 1990 .

[29]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks: the single-node case , 1993, TNET.

[30]  M. R. Karim ATM Technology and Services Delivery , 2000 .

[31]  Ayse Yasemin Seydim,et al.  WIRELESS ATM ( WATM ) AN OVERVIEW , 2001 .

[32]  S. Jamaloddin Golestani,et al.  A self-clocked fair queueing scheme for broadband applications , 1994, Proceedings of INFOCOM '94 Conference on Computer Communications.

[33]  Richard S. Sutton,et al.  Reinforcement Learning: Past, Present and Future , 1998, SEAL.

[34]  Andrew G. Barto,et al.  Elevator Group Control Using Multiple Reinforcement Learning Agents , 1998, Machine Learning.

[35]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[36]  Matthew S Gast 802.11 Wireless Networks: The Definitive Guide, Second Edition , 2005 .

[37]  S. Schaal,et al.  Robot juggling: implementation of memory-based learning , 1994, IEEE Control Systems.

[38]  Paul T. Brady,et al.  A model for generating on-off speech patterns in two-way conversation , 1969 .

[39]  D.C. Verma,et al.  Delay jitter control for real-time communication in a packet switching network , 1991, Proceedings of TRICOMM `91: IEEE Conference on Communications Software: Communications for Distributed Applications and Systems.

[40]  Richard Bellman,et al.  Dynamic Programming Treatment of the Travelling Salesman Problem , 1962, JACM.