Fast Simulation of Markov Chains with Small Transition Probabilities

Consider a finite-state Markov chain where the transition probabilities differ by orders of magnitude. This Markov chain has an "attractor state," i.e., from any state of the Markov chain there exists a sample path ofsignificant probability to the attractor state. There also exists a "rare set," which is accessible from the attractor state only by sample paths of very small probability. The problem is to estimate the probability that starting from the attractor state, the Markov chain hits the rare set before returning to the attractor state. Examples of this setting arise in the case of reliability models with highly reliable components as well as in the case of queueing networks with low traffic. Importance-sampling is a commonly used simulation technique for the fast estimation of rare-event probabilities. It involves simulating the Markov chain under a new probability measure that emphasizes the most likely paths to the rare set. Previous research focused on developing importance-sampling schemes for a special case of Markov chains that did not include "high-probability cycles." We show through examples that the Markov chains used to model many commonly encountered systems do have high-probability cycles, and existing importance-sampling schemes can lead to infinite variance in simulating such systems. We then develop the insight that in the presence of high-probability cycles care should be taken in allocating the new transition probabilities so that the variance accumulated over these cycles does not increase without bounds. Based on this observation we develop two importance-sampling techniques that have the bounded relative error property, i.e., the simulation run-length required to estimate the rare-event probability to a fixed degree of accuracy remains bounded as the event of interest becomes more rare.

[1]  Michael A. Crane,et al.  Simulating Stable Stochastic Systems: III. Regenerative Processes and Discrete-Event Simulations , 1975, Oper. Res..

[2]  D. Siegmund Importance Sampling in the Monte Carlo Study of Sequential Tests , 1976 .

[3]  N. D. Durie,et al.  Digest of papers , 1976 .

[4]  Elmer E Lewis,et al.  Monte Carlo simulation of Markov unreliability models , 1984 .

[5]  Stephen S. Lavenberg,et al.  Modeling and Analysis of Computer System Availability , 1987, Computer Performance and Reliability.

[6]  李幼升,et al.  Ph , 1989 .

[7]  Donald L. Iglehart,et al.  Importance sampling for stochastic simulations , 1989 .

[8]  Jean Walrand,et al.  A quick simulation method for excessive backlogs in networks of queues , 1989 .

[9]  J. Sadowsky Large deviations theory and efficient simulation of excessive backlogs in a GI/GI/m queue , 1991 .

[10]  Juan A. Carrasco Failure distance-based simulation of repairable fault-tolerant systems , 1992 .

[11]  Sandeep Juneja,et al.  Fast simulation of Markovian reliability/availability models with general repair policies , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[12]  Philip Heidelberger,et al.  A Unified Framework for Simulating Markovian Models of Highly Dependable Systems , 1992, IEEE Trans. Computers.

[13]  Perwez Shahabuddin,et al.  Importance sampling for the simulation of highly reliable Markovian systems , 1994 .

[14]  William H. Sanders,et al.  Importance Sampling Simulation in UltraSAN , 1994, Simul..

[15]  Marvin K. Nakayama A characterization of the simple failure-biasing method for simulations of highly reliable Markovian Systems , 1994, TOMC.

[16]  Philip Heidelberger,et al.  Modeling and analysis of system dependability using the System Availability Estimator , 1994, Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing.

[17]  Philip Heidelberger,et al.  Effective Bandwidth and Fast Simulation of ATM Intree Networks , 1994, Perform. Evaluation.

[18]  Philip Heidelberger,et al.  Bounded relative error in estimating transient measures of highly dependable non-Markovian systems , 1994, TOMC.

[19]  Philip Heidelberger,et al.  Fast simulation of rare events in queueing and reliability models , 1993, TOMC.

[20]  Marvin K. Nakayama,et al.  General conditions for bounded relative error in simulations of highly reliable Markovian systems , 1996, Advances in Applied Probability.

[21]  Kishor S. Trivedi,et al.  THE SYSTEM AVAILABILITY ESTIMATOR , 1996 .

[22]  Ambuj,et al.  Monte Carlo Simulation of Computer System Availability / Reliability Models , 2001 .

[23]  Sandeep Juneja Importance Sampling and the Cyclic Approach , 2001, Oper. Res..