论文信息 - Estimation of the derivative of a stationary measure with respect to a control parameter

Estimation of the derivative of a stationary measure with respect to a control parameter

The paper deals with a problem which arises in the Monte Carlo optimization of steady state or ergodic systems which can be modelled by Markov chains. The transition probability depends on a parameter, and one wishes to find the parameter value at which some performance function is minimum. The only available data are obtained from either simulation or actual operating information. For such a problem ore needs good statistical estimates of the derivatives. Conditions are given for the existence of the derivative of the stationary measure with respect to the parameter, in the sense that the derivative is a signed measure, and is the limit of the natural approximating sequence. Some properties and a useful characterization of the derivative are obtained. It is also shown that, under appropriate conditions, the derivative of the n-step transition function converges to the derivative of the stationary measure as n tends to oo. This latter result is of particular importance whether one is simply estimating or is actually optimizing via some sequential Monte Carlo procedure, since the basic observations are always taken over a finite time interval.

H. Kushner | F. Vázquez-Abad

[1] P. Schweitzer. Perturbation theory and finite Markov chains , 1968 .

[2] H. Kushner,et al. Averaging Methods for the Asymptotic Analysis of Learning and Adaptive Systems, with Small Adjustment Rate. Analysis of Nonlinear Stochastic Systems with Wide-Band Inputs. , 1980 .

[3] Christos G. Cassandras,et al. A new approach to the analysis of discrete event dynamic systems , 1983, Autom..

[4] Peter W. Glynn,et al. Stochastic approximation for Monte Carlo optimization , 1986, WSC '86.

[5] C. D. Meyer,et al. Using the QR factorization and group inversion to compute, differentiate ,and estimate the sensitivity of stationary probabilities for markov chains , 1986 .

[6] J. Ben Atkinson,et al. An Introduction to Queueing Networks , 1988 .

[7] Donald L. Iglehart,et al. Simulation methods for queues: An overview , 1988, Queueing Syst. Theory Appl..

[8] R. Suri,et al. Perturbation analysis gives strongly consistent sensitivity estimates for the M/G/ 1 queue , 1988 .

[9] Reuven Y. Rubinstein,et al. Sensitivity Analysis and Performance Extrapolation for Computer Simulation Models , 1989, Oper. Res..

[10] On using perturbation analysis to do sensitivity analysis: derivatives vs. differences , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.

[11] Alan Weiss,et al. Sensitivity Analysis for Simulations via Likelihood Ratios , 1989, Oper. Res..