An adaptive dynamic programming algorithm for a stochastic multiproduct batch dispatch problem

We address the problem of dispatching a vehicle with different product classes. There is a common dispatch cost, but holding costs that vary by product class. The problem exhibits multidimensional state, outcome and action spaces, and as a result is computationally intractable using either discrete dynamic programming methods, or even as a deterministic integer program. We prove a key structural property for the decision function, and exploit this property in the development of continuous value function approximations that form the basis of an approximate dispatch rule. Comparisons on single product‐class problems, where optimal solutions are available, demonstrate solutions that are within a few percent of optimal. The algorithm is then applied to a problem with 100 product classes, and comparisons against a carefully tuned myopic heuristic demonstrate significant improvements. © 2003 Wiley Periodicals, Inc. Naval Research Logistics 50: 742–769, 2003.

[1]  Norman T. J. Bailey,et al.  On Queueing Processes with Bulk Service , 1954 .

[2]  M. Neuts A General Class of Bulk Queues with Poisson Input , 1967 .

[3]  Peter J. Kolesar,et al.  Operating Characteristics of a Simple Shuttle under Local Dispatching Rules , 1972, Oper. Res..

[4]  R. Serfozo,et al.  Optimal control of batch service queues , 1973, Advances in Applied Probability.

[5]  Anthony Barnett On operating a shuttle service , 1973, Networks.

[6]  M. J. Phillips,et al.  Stochastic theory of service systems , 1974 .

[7]  Peter J. Kolesar,et al.  Optimal Dispatching of an Infinite-Capacity Shuttle: Control at a Single Terminal , 1974, Oper. Res..

[8]  R. Deb,et al.  Optimal control of batch service queues with switching costs , 1976 .

[9]  Stanley R. Pliska,et al.  Optimal Control of Some Markov Processes with Applications to Batch Queueing and Continuous Review Inventory Systems , 1976 .

[10]  R. Deb Optimal Dispatching of a Finite Capacity Shuttle. , 1978 .

[11]  Howard J. Weiss Technical Note - Further Results on an Infinite Capacity Shuttle with Control at a Single Terminal , 1981, Oper. Res..

[12]  M. L. Chaudhry,et al.  A first course in bulk queues , 1983 .

[13]  C. D. Litton,et al.  A First Course in Bulk Queues , 1983 .

[14]  J. Medhi Recent developments in bulk queueing models , 1984 .

[15]  Warren B. Powell,et al.  Analysis of Vehicle Holding and Cancellation Strategies in Bulk Arrival, Bulk Service Queues , 1985, Transp. Sci..

[16]  Warren B. Powell,et al.  The Bulk Service Queue with a General Control Strategy: Theoretical Analysis and a New Computational Procedure , 1986, Oper. Res..

[17]  Charles P. Schmidt,et al.  Optimal average cost policies for the two-terminal shuttle , 1987 .

[18]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[19]  Walter Ukovich,et al.  Minimizing Transportation and Inventory Costs for Several Products on a Single Link , 1994, Oper. Res..

[20]  John N. Tsitsiklis,et al.  Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.

[21]  John N. Tsitsiklis,et al.  Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.

[22]  Maria Grazia Speranza,et al.  An algorithm for optimal shipments with given frequencies , 1996 .

[23]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[24]  Sridhar R. Tayur,et al.  Managing Stochastic Multiproduct Systems: Model, Measures, and Analysis , 1998, Oper. Res..

[25]  Luca Bertazzi,et al.  Minimizing logistic costs in multistage supply chains , 1999 .

[26]  Ram Akella,et al.  Single-Period Multiproduct Inventory Models with Substitution , 1999, Oper. Res..

[27]  Luca Bertazzi,et al.  Inventory control on sequences of links with given transportation frequencies , 1999 .

[28]  Warrren B Powell,et al.  An Adaptive, Distribution-Free Algorithm for the Newsvendor Problem with Censored Demands, with Applications to Inventory and Distribution , 2001 .