论文信息 - Measure valued differentiation for stochastic processes : the finite horizon case

Measure valued differentiation for stochastic processes : the finite horizon case

This paper addresses the problem of sensitivity analysis for finite horizon performance measures of general Markov chains. We derive closed form expressions and associated unbiased gradient estimators for derivatives of finite products of Markov kernels by measure-valued differentiation (MVD). In the MVD setting, derivatives of Markov kernels, called D-derivatives, are defined with respect to an appropriately defined class of performance functions D, such that for any performance measure g ∈ D the derivative of the integral of g with respect to the one step transition probability of the Markov chain exists. The MVD approach (1) yields results that that can be applied to performance functions out of a predefined class, (2) allows for a product rule of differentiation, that is, analyzing the derivative of the transition kernel immediately yields finite horizon results, (3) provides an operator language approach to differentiation of Markov chains and (4) clearly identifies the trade-off between the generality of performance classes that can be analyzed and the generality of the classes of measures (Markov kernels). The D-derivative of a measure can be interpreted in terms of various (unbiased) gradient estimators and the product rule for D-differentiation yields a product-rule for various gradient estimators.

F. Vázquez-Abad | B. Heidergott

[1] L. S. Gurin. Optimisation in stochastic models , 1964 .

[2] Edwin Hewitt,et al. Real And Abstract Analysis , 1967 .

[3] P. Meyer,et al. Probabilities and potential C , 1978 .

[4] Y. Ho,et al. Smoothed (conditional) perturbation analysis of discrete event dynamical systems , 1987 .

[5] Alan Weiss,et al. Sensitivity Analysis for Simulations via Likelihood Ratios , 1989, Oper. Res..

[6] Paul Glasserman,et al. Gradient Estimation Via Perturbation Analysis , 1990 .

[7] Xi-Ren Cao,et al. Perturbation analysis of discrete event dynamic systems , 1991 .

[8] Georg Ch. Pflug. Gradient estimates for the performance of markov chains and discrete event processes , 1992, Ann. Oper. Res..

[9] Pierre Brémaud,et al. On the pathwise computation of derivatives with respect to the rate of a point process: The phantom RPA method , 1992, Queueing Syst. Theory Appl..

[10] Xi-Ren Cao,et al. Realization Probabilities: The Dynamics of Queuing Systems , 1994 .

[11] Christos G. Cassandras,et al. Scheduling policies using marked/phantom slot algorithms , 1995, Queueing Syst. Theory Appl..

[12] Harold J. Kushner,et al. Stochastic Approximation Algorithms and Applications , 1997, Applications of Mathematics.

[13] Michael C. Fu,et al. Conditional Monte Carlo , 1997 .

[14] Bernd Heidergott. Optimisation of a single-component maintenance system: A smoothed perturbation analysis approach , 1999, Eur. J. Oper. Res..