Dynamic traffic splitting to parallel wireless networks with partial information: A Bayesian approach

Contemporary wireless networks are based on a wide range of different technologies providing overlapping coverage. This offers users a seamless integration of connectivity by allowing to switch between networks, and opens up a promising area for boosting the performance of wireless networks. Motivated by this, we consider a networking environment in which users are able to select between the available wireless networks to minimize the mean processing times for file downloads in the presence of background traffic. The information available to the user is only the total number of jobs in each network, rather than the per-network numbers of foreground and background jobs. This leads to a complex partial information decision problem which is the focus of this paper. We develop and evaluate a Bayesian learning algorithm that optimally splits a stream of jobs that minimizes the expected sojourn time. The algorithm learns as the system operates and provides information at each decision and departure epoch. We evaluate the optimality of the partial information algorithm by comparing the performance of the algorithm with the ''ideal'' performance obtained by solving a Markov decision problem with full state information. To this end, we have conducted extensive experiments both numerically and in a simulation testbed with the full wireless protocol stack. The results show that the Bayesian algorithm has close to optimal performance over a wide range of parameter values.

[1]  Pablo Rodriguez,et al.  Parallel-access for mirror sites in the Internet , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[2]  W. Fleming Book Review: Discrete-time Markov control processes: Basic optimality criteria , 1997 .

[3]  Patchigolla Kiran Kumar,et al.  A Survey of Some Results in Stochastic Adaptive Control , 1985 .

[4]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[5]  HasegawaYohei,et al.  Deployable multipath communication scheme with sufficient performance data distribution method , 2007 .

[6]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[7]  Mostafa Ammar,et al.  On the effect of large-scale deployment of parallel downloading , 2003, Proceedings the Third IEEE Workshop on Internet Applications. WIAPP 2003.

[8]  Wenju Liu,et al.  Region-Based Approximations for Planning in Stochastic Domains , 1997, UAI.

[9]  Leslie Pack Kaelbling,et al.  Learning Policies for Partially Observable Environments: Scaling Up , 1997, ICML.

[10]  Dilip Sarkar,et al.  Concurrent multipath transport , 2007, Comput. Commun..

[11]  Apostolos Burnetas,et al.  Optimal Adaptive Policies for Markov Decision Processes , 1997, Math. Oper. Res..

[12]  Milos Hauskrecht,et al.  Planning and control in stochastic domains with imperfect information , 1997 .

[13]  George E. Monahan,et al.  A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 2007 .

[14]  Tutomu Murase,et al.  Deployable multipath communication scheme with sufficient performance data distribution method , 2007, Comput. Commun..

[15]  J. Duncanson,et al.  Inverse multiplexing , 1994, IEEE Communications Magazine.

[16]  Ronen I. Brafman,et al.  A Heuristic Variable Grid Solution Method for POMDPs , 1997, AAAI/IAAI.

[17]  Paramvir Bahl,et al.  MultiNet: connecting to multiple IEEE 802.11 networks using a single wireless card , 2004, IEEE INFOCOM 2004.

[18]  W. Lovejoy A survey of algorithmic methods for partially observed Markov decision processes , 1991 .

[19]  Donald C. Cox Fundamental limitations on increasing data rate in wireless systems , 2008, IEEE Communications Magazine.

[20]  A. Cassandra,et al.  Exact and approximate algorithms for partially observable markov decision processes , 1998 .

[21]  O. Hernández-Lerma,et al.  Discrete-time Markov control processes , 1999 .

[22]  Sally Floyd,et al.  TCP Selective Acknowledgement Options , 1996 .

[23]  Sally Floyd,et al.  TCP Selective Acknowledgment Options , 1996, RFC.

[24]  S. Christian Albright,et al.  Structural Results for Partially Observable Markov Decision Processes , 1979, Oper. Res..

[25]  Robert D. van der Mei,et al.  Effective load for flow-level performance modelling of file transfers in wireless LANs , 2010, Comput. Commun..

[26]  G. Monahan State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[27]  Chelsea C. White,et al.  A survey of solution techniques for the partially observed Markov decision process , 1991, Ann. Oper. Res..

[28]  Stuart J. Russell,et al.  Approximating Optimal Policies for Partially Observable Stochastic Domains , 1995, IJCAI.

[29]  K. M. vanHee,et al.  Bayesian control of Markov chains , 1978 .

[30]  Raghupathy Sivakumar,et al.  A Transport Layer Approach for Achieving Aggregate Bandwidths on Multi-Homed Mobile Hosts , 2002, MobiCom '02.

[31]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[32]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .