Convergence of parameter sensitivity estimates in a stochastic experiment

To reduce the error in estimating the gradient (parameter sensitivity) of an unknown function is of great importance in stochastic optimization problems. Three kinds of parameter sensitivity estimates using the Monte Carlo method are discussed in this paper. The estimates depend on the number of replications, N, and the change in parameter, ¿v. The convergence properties as N¿¿ and ¿v¿0 for these estimates are obtained. The result explains many theoretical and practical issues in the study of discrete event dynamic systems, as well as continuous dynamic systems, by the Monte Carlo method. It is proved that an estimate based on averaging the gradients calculated along each sample path by a perturbation of the path is much better than the other estimates if the output functions are uniformly differentiable with probability one (w.p.1). It is also concluded that in computer simulations one should always choose the same seed for both v and v+¿v in estimating the parameter sensitivity. Combining the results in this paper with existing stochastic approximation algorithms may yield algorithms with faster convergence.

[1]  W. David Kelton,et al.  Simulation analysis , 1983, WSC '83.

[2]  A. Ruszczynski,et al.  Stochastic approximation method with gradient averaging for unconstrained problems , 1983 .

[3]  Rajan Suri,et al.  Infinitesimal perturbation analysis of discrete event dynamic systems: A general theory , 1983, The 22nd IEEE Conference on Decision and Control.

[4]  Y. Ho,et al.  Perturbation analysis of sojourn time in queueing networks , 1983, The 22nd IEEE Conference on Decision and Control.

[5]  R. Suri Implementation of sensitivity calculations on a monte carlo experiment , 1983 .

[6]  Y. Ho,et al.  Perturbation analysis and optimization of queueing networks , 1982, 1982 21st IEEE Conference on Decision and Control.

[7]  Christos G. Cassandras,et al.  Infinitesimal and finite perturbation analysis for queueing networks , 1982, 1982 21st IEEE Conference on Decision and Control.

[8]  Philip Heidelberger,et al.  A spectral method for confidence interval generation and run length control in simulations , 1981, CACM.

[9]  Yu-Chi Ho,et al.  Parametric sensitivity of a statistical experiment , 1979 .

[10]  C. H. Sauer,et al.  Sequential stopping rules for the regenerative method of simulation , 1977 .

[11]  Lennart Ljung,et al.  Analysis of recursive stochastic algorithms , 1977 .

[12]  Jeffrey P. Buzen,et al.  Computational algorithms for closed queueing networks with exponential servers , 1973, Commun. ACM.

[13]  A. Dvoretzky On Stochastic Approximation , 1956 .

[14]  J. Kiefer,et al.  Stochastic Estimation of the Maximum of a Regression Function , 1952 .