Potentially unlimited variance reduction in importance sampling of Markov chains

We consider the application of importance sampling in steady-state simulations of finite Markov chains. We show that, for a large class of performance measures, there is a choice of the alternative transition matrix for which the ratio of the variance of the importance sampling estimator to the variance of the naive simulation estimator converges to zero as the sample path length goes to infinity. Obtaining this ‘optimal’ transition matrix involves computing the performance measure of interest, so the optimal matrix cannot be computed in precisely those situations where simulation is required to estimate steady-state performance. However, our results show that alternative transition matrices of the form Q = P + E/T, where P is the original transition matrix and T is the sample path length, can be expected to provide good results. Moreover, we provide an iterative algorithm for obtaining alternative transition matrices of this form that converge to the optimal matrix as the number of iterations increases, and present an example that shows that spending some computer time iterating this algorithm and then conducting the simulation with the resulting alternative transition matrix may provide considerable variance reduction when compared to naive simulation.

[1]  J. Keilson,et al.  A central limit theorem for processes defined on a finite Markov chain , 1964, Mathematical Proceedings of the Cambridge Philosophical Society.

[2]  Sigrún Andradóttir,et al.  On the Choice of Alternative Measures in Importance Sampling with Markov Chains , 1995, Oper. Res..

[3]  John G. Kemeny,et al.  Finite Markov Chains. , 1960 .

[4]  Sigrún Andradóttir,et al.  Variance reduction through smoothing and control variates for Markov chain simulations , 1993, TOMC.

[5]  Paul Bratley,et al.  A guide to simulation , 1983 .

[6]  Jean Walrand,et al.  A quick simulation method for excessive backlogs in networks of queues , 1989 .

[7]  Michael R. Frater,et al.  Optimally efficient estimation of the statistics of rare events in queueing networks , 1991 .

[8]  P. Schweitzer Perturbation theory and finite Markov chains , 1968 .

[9]  Wei Wu,et al.  Numerical Experiments with Iteration and Aggregation for Markov Chains , 1992, INFORMS J. Comput..

[10]  J. Sadowsky Large deviations theory and efficient simulation of excessive backlogs in a GI/GI/m queue , 1991 .

[11]  P. Ney,et al.  Monte Carlo simulation and large deviations theory for uniformly recurrent Markov chains , 1990, Journal of Applied Probability.

[12]  John G. Kemeny,et al.  Finite Markov chains , 1960 .

[13]  P. Glynn,et al.  Varaince reduction in mean time to failure simulations , 1988, 1988 Winter Simulation Conference Proceedings.

[14]  Philip Heidelberger,et al.  Varaince reduction in mean time to failure simulations (1988) , 2007, WSC '07.

[15]  Donald L. Iglehart,et al.  Importance sampling for stochastic simulations , 1989 .

[16]  Paul Bratley,et al.  A guide to simulation (2nd ed.) , 1986 .