Online stochastic optimization under time constraints

This paper considers online stochastic combinatorial optimization problems where uncertainties, i.e., which requests come and when, are characterized by distributions that can be sampled and where time constraints severely limit the number of offline optimizations which can be performed at decision time and/or in between decisions. It proposes online stochastic algorithms that combine the frameworks of online and stochastic optimization. Online stochastic algorithms differ from traditional a priori methods such as stochastic programming and Markov Decision Processes by focusing on the instance data that is revealed over time. The paper proposes three main algorithms: expectation E, consensus C, and regret R. They all make online decisions by approximating, for each decision, the solution to a multi-stage stochastic program using an exterior sampling method and a polynomial number of samples. The algorithms were evaluated experimentally and theoretically. The experimental results were obtained on three applications of different nature: packet scheduling, multiple vehicle routing with time windows, and multiple vehicle dispatching. The theoretical results show that, under assumptions which seem to hold on these, and other, applications, algorithm E has an expected constant loss compared to the offline optimal solution. Algorithm R reduces the number of optimizations by a factor |R|, where R is the number of requests, and has an expected ρ(1+o(1)) loss when the regret gives a ρ-approximation to the offline problem.

[1]  Jan Vondrák,et al.  Approximating the stochastic knapsack problem: the benefit of adaptivity , 2004, 45th Annual IEEE Symposium on Foundations of Computer Science.

[2]  Werner Römisch,et al.  Stability of Solutions for Stochastic Programs with Complete Recourse , 1993, Math. Oper. Res..

[3]  Matthew Brand,et al.  Marginalizing Out Future Passengers in Group Elevator Control , 2003, UAI.

[4]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[5]  Robert Givan,et al.  On-line Scheduling via Sampling , 2000, AIPS.

[6]  Jay H. Lee,et al.  Dynamic programming in a heuristically confined state space: a stochastic resource-constrained project scheduling application , 2004, Comput. Chem. Eng..

[7]  Marius M. Solomon,et al.  Algorithms for the Vehicle Routing and Scheduling Problems with Time Window Constraints , 1987, Oper. Res..

[8]  Gerhard J. Woeginger,et al.  Developments from a June 1996 seminar on Online algorithms: the state of the art , 1998 .

[9]  Jay H. Lee,et al.  An algorithmic framework for improving heuristic solutions: Part I. A deterministic discount coupon traveling salesman problem , 2004, Comput. Chem. Eng..

[10]  Alexander Shapiro,et al.  On complexity of multistage stochastic programs , 2006, Oper. Res. Lett..

[11]  Claire Mathieu,et al.  On the Sum-of-Squares algorithm for bin packing , 2002, JACM.

[12]  Marius M. Solomon,et al.  Partially dynamic vehicle routing—models and algorithms , 2002, J. Oper. Res. Soc..

[13]  Alexander Shapiro,et al.  The Sample Average Approximation Method for Stochastic Discrete Optimization , 2002, SIAM J. Optim..

[14]  Benjamin Van Roy,et al.  Approximate Linear Programming for Average-Cost Dynamic Programming , 2002, NIPS.

[15]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[16]  Paul Shaw,et al.  Using Constraint Programming and Local Search Methods to Solve Vehicle Routing Problems , 1998, CP.

[17]  Charles E. Blair,et al.  The value function of an integer program , 1982, Math. Program..

[18]  Michel Gendreau,et al.  Parallel Tabu Search for Real-Time Vehicle Routing and Dispatching , 1999, Transp. Sci..

[19]  Boaz Patt-Shamir,et al.  Buffer overflow management in QoS switches , 2001, STOC '01.

[20]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[21]  Russell Bent,et al.  Online Stochastic Optimization Without Distributions , 2005, ICAPS.

[22]  Russell Bent,et al.  A Two-Stage Hybrid Local Search for the Vehicle Routing Problem with Time Windows , 2004, Transp. Sci..

[23]  Russell Bent,et al.  The Value of Consensus in Online Stochastic Scheduling , 2004, ICAPS.

[24]  Russell Bent,et al.  Scenario-Based Planning for Partially Dynamic Vehicle Routing with Stochastic Customers , 2004, Oper. Res..

[25]  Allan Borodin,et al.  Adversarial queuing theory , 2001, JACM.

[26]  S. Ross A First Course in Probability , 1977 .

[27]  Anna R. Karlin,et al.  Competitive snoopy caching , 1986, 27th Annual Symposium on Foundations of Computer Science (sfcs 1986).

[28]  Leen Stougie,et al.  Approximation in Stochastic integer programming , 2003 .

[29]  Chaitanya Swamy,et al.  Stochastic optimization is (almost) as easy as deterministic optimization , 2004, 45th Annual IEEE Symposium on Foundations of Computer Science.

[30]  Alan M. Frieze,et al.  On the random 2-stage minimum spanning tree , 2005, SODA '05.

[31]  Gerhard J. Woeginger,et al.  Radio Labeling with Preassigned Frequencies , 2004, SIAM J. Optim..

[32]  Russell Bent,et al.  Sub-optimality Approximations , 2005, CP.

[33]  A Gerodimos,et al.  Robust Discrete Optimization and its Applications , 1996, J. Oper. Res. Soc..

[34]  Rüdiger Schultz Continuity Properties of Expectation Functions in Stochastic Integer Programming , 1993, Math. Oper. Res..

[35]  Christos H. Papadimitriou,et al.  Beyond competitive analysis [on-line algorithms] , 1994, Proceedings 35th Annual Symposium on Foundations of Computer Science.

[36]  Susan Messer Regrets Only , 2007 .

[37]  Maarten Hendrikus van der Vlerk Stochastic programming with integer recourse , 1995 .

[38]  Mauro Dell'Amico,et al.  Annotated Bibliographies in Combinatorial Optimization , 1997 .

[39]  Peter Kall,et al.  Stochastic Programming , 1995 .

[40]  John R. Birge,et al.  Introduction to Stochastic Programming , 1997 .

[41]  CsirikJanos,et al.  On the Sum-of-Squares algorithm for bin packing , 2006 .

[42]  Alexander Shapiro,et al.  A simulation-based approach to two-stage stochastic programming with recourse , 1998, Math. Program..

[43]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[44]  U. Rieder,et al.  Markov Decision Processes , 2010 .

[45]  R. Schultz,et al.  Two-stage stochastic integer programming : a survey , 1996 .

[46]  Leen Stougie,et al.  Solving stochastic programs with integer recourse by enumeration: A framework using Gröbner basis , 1995, Math. Program..

[47]  Martin W. P. Savelsbergh,et al.  Decision Support for Consumer Direct Grocery Initiatives , 2005, Transp. Sci..

[48]  Russell Bent,et al.  Regrets Only! Online Stochastic Optimization under Time Constraints , 2004, AAAI.

[49]  Eric Bourreau,et al.  Towards Stochastic Constraint Programming: A Study of Online Multi-choice Knapsack with Deadlines , 2001, CP.

[50]  Rüdiger Schultz On structure and stability in stochastic programs with random technology matrix and complete integer recourse , 1995, Math. Program..

[51]  Rolf H. Möhring,et al.  Approximation in stochastic scheduling: the power of LP-based priority policies , 1999, JACM.

[52]  Russell Bent,et al.  Online Stochastic and Robust Optimization , 2004, ASIAN.

[53]  Gideon Weiss,et al.  Stochastic scheduling problems I — General strategies , 1984, Z. Oper. Research.

[54]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[55]  Boaz Patt-Shamir,et al.  Buffer Overflow Management in QoS Switches , 2004, SIAM J. Comput..

[56]  Martin Skutella,et al.  Stochastic Machine Scheduling with Precedence Constraints , 2005, SIAM J. Comput..