A Game Theoretic Framework for Multipath Optimal Data Transfer in Multiuser Overlay Networks

In this paper, we study the problem of optimal data transfer over multiple overlay paths. Instead of solving the problem from the single controller point of view, we adopt the game theory perspective to consider the problem from a more realistic view where multiple traffic controllers competing for the shared bandwidth. We formulate the problem as a general- sum stochastic game, and a reinforcement learning technique namely Correlated-Q Learning is implemented to derive the best- possible strategy, i.e. the strategy to play correlated equilibrium (CE) for each controller. Through a proof-of-concept simulation scenario with 2 overlay paths and 2 controllers, we show that by playing cooperative strategies, e.g. CE, the controllers can achieve superior performance compared to acting selfishly. The result emphasizes that considering the problem of optimal multipath data transfer from the single controller perspective is inadequate.

[1]  Fan Yang,et al.  AMTP: a multipath multimedia streaming protocol for mobile ad hoc networks , 2005, IEEE International Conference on Communications, 2005. ICC 2005. 2005.

[2]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[3]  Eitan Altman,et al.  A survey on networking games in telecommunications , 2006, Comput. Oper. Res..

[4]  Rauf Izmailov,et al.  Fast replication in content distribution overlays , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[5]  B. Chaib-draa,et al.  Multiagent Q-Learning : Preliminary Study on Dominance between the Nash and Stackelberg Equilibriums , 2005 .

[6]  Michael P. Wellman,et al.  Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[7]  David G. Andersen,et al.  Improving end-to-end availability using overlay networks , 2004 .

[8]  Shang Zhi,et al.  A proof of the queueing formula: L=λW , 2001 .

[9]  Edward W. Knightly,et al.  Opportunistic traffic scheduling over multiple network paths , 2004, IEEE INFOCOM 2004.

[10]  Krishna R. Pattipati,et al.  Application-layer multipath data transfer via TCP: Schemes and performance tradeoffs , 2007, Perform. Evaluation.

[11]  Thomas R. Gross,et al.  Multipath streaming in best-effort networks , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[12]  Keith B. Hall,et al.  Correlated Q-Learning , 2003, ICML.

[13]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14]  Samir Khuller,et al.  Large-scale data collection: a coordinated approach , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[15]  J. Little A Proof for the Queuing Formula: L = λW , 1961 .

[16]  Weiping Zhu,et al.  Modelling Internet End-to-End Loss Behaviours: A New Approach , 2007, First Asia International Conference on Modelling & Simulation (AMS'07).

[17]  Pascal Frossard,et al.  Video Packet Selection and Scheduling for Multipath Streaming , 2007, IEEE Transactions on Multimedia.

[18]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[19]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .