Distributed dynamic event tree generation for reliability and risk assessment

Level 2 probabilistic risk assessments of nuclear plants (analysis of radionuclide release from containment) may require hundreds of runs of severe accident analysis codes such as MELCOR or RELAP/SCDAP to analyze possible sequences of events (scenarios) that may follow given initiating events. With the advances in computer architectures and ubiquitous networking, it is now possible to utilize multiple computing and storage resources for such computational experiments. This paper presents a system software infrastructure that supports execution and analysis of multiple dynamic event-tree simulations on distributed environments. The infrastructure allow for 1) the testing of event tree completeness, and, 2) the assessment and propagation of uncertainty on the plant state in the quantification of event trees

[1]  Joel H. Saltz,et al.  A distributed data management middleware for data-driven application systems , 2004, 2004 IEEE International Conference on Cluster Computing (IEEE Cat. No.04EX935).

[2]  Thomas G. Robertazzi,et al.  Ten Reasons to Use Divisible Load Theory , 2003, Computer.

[3]  Gary J. Powers,et al.  Computer-aided Synthesis of Fault-trees , 1977, IEEE Transactions on Reliability.

[4]  Tunc Aldemir,et al.  A data base oriented dynamic methodology for the failure analysis of closed loop control systems in process plant , 1990 .

[5]  Cevdet Aykanat,et al.  Iterative-Improvement-Based Heuristics for Adaptive Scheduling of Tasks Sharing Files on Heterogeneous Master-Slave Environments , 2006, IEEE Transactions on Parallel and Distributed Systems.

[6]  Nathan O. Siu Dynamic Approaches — Issues and Methods: An Overview , 1994 .

[7]  Yves Robert,et al.  Scheduling Tasks Sharing Files on Heterogeneous Master-Slave Platforms , 2004, PDP.

[8]  Dror G. Feitelson,et al.  Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling , 2001, IEEE Trans. Parallel Distributed Syst..

[9]  Yves Robert,et al.  Scheduling Tasks Sharing Files from Distributed Repositories , 2004, Euro-Par.

[10]  Andrew A. Chien,et al.  Resource Management for Rapid Application Turnaround on Enterprise Desktop Grids , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[11]  Tunc Aldemir,et al.  Computer-Assisted Markov Failure Modeling of Process Control Systems , 1987, IEEE Transactions on Reliability.

[12]  Jeffrey K. Hollingsworth,et al.  An API for Runtime Code Patching , 2000, Int. J. High Perform. Comput. Appl..

[13]  Karsten Schwan,et al.  Falcon: On‐line monitoring for steering parallel programs , 1998 .

[14]  Yolanda Gil,et al.  Pegasus: Mapping Scientific Workflows onto the Grid , 2004, European Across Grids Conference.

[15]  Sathish S. Vadhiyar,et al.  SRS: A Framework for Developing Malleable and Migratable Parallel Applications for Distributed Systems , 2003, Parallel Process. Lett..

[16]  Gareth W. Parry Critique of Current Practice in the Treatment of Human Interactions in Probabilistic Safety Assessments , 1994 .

[17]  R. M. Summers,et al.  MELCOR 1. 8. 0: A computer code for nuclear reactor severe accident source term and risk assessment analyses , 1991 .

[18]  Willy Zwaenepoel,et al.  The performance of consistent checkpointing , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[19]  Kai Li,et al.  Memory Exclusion: Optimizing the Performance of Checkpointing Systems , 1999, Softw. Pract. Exp..

[20]  Hiromitsu Kumamoto,et al.  Safety and reliability synthesis of systems with control loops , 1979 .

[21]  G. Reina,et al.  DYLAM-1 : a software package for event sequence and consequence spectrum methodology , 1984 .

[22]  Joel H. Saltz,et al.  A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O , 2005, CCGRID.

[23]  J.R. Taylor,et al.  An Algorithm For Fault-Tree Construction , 1982, IEEE Transactions on Reliability.

[24]  Carol Smidts,et al.  The Event Sequence Diagram framework for dynamic Probabilistic Risk Assessment , 1999 .

[25]  Norman J. McCormick,et al.  Reliability and Risk Analysis , 1981, IEEE Transactions on Reliability.

[26]  Ken Kennedy,et al.  TaskScheduling Strategies forWorkflow-based Applications inGrids , 2005 .

[27]  Rajkumar Buyya,et al.  A novel architecture for realizing grid workflow using tuple spaces , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[28]  J. Devooght,et al.  Probabilistic Reactor Dynamics —I: The Theory of Continuous Event Trees , 1992 .

[29]  Oscar H. Ibarra,et al.  Heuristic Algorithms for Scheduling Independent Tasks on Nonidentical Processors , 1977, JACM.

[30]  Xi Zhang,et al.  Applying database support for large scale data driven science in distributed environments , 2003, Proceedings. First Latin American Web Congress.

[31]  P. K. Andow,et al.  Failures in control systems , 1984 .

[32]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[33]  Fred Douglis,et al.  Transparent process migration: Design alternatives and the sprite implementation , 1991, Softw. Pract. Exp..

[34]  Francine Berman,et al.  Heuristics for scheduling parameter sweep applications in grid environments , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[35]  Henri Casanova,et al.  Practical divisible load scheduling on grid platforms with APST-DV , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[36]  Francine Berman,et al.  The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[37]  Hiromitsu Kumamoto,et al.  Signal-Flow-Based Graphs for Failure-Mode Analysis of Systems with Control Loops , 1981, IEEE Transactions on Reliability.

[38]  P. C. Cacciabue,et al.  Dynamic logical analytical methodology versus fault tree: the case study of the auxiliary feedwater system of a nuclear power plant , 1986 .

[39]  Debasish Ghose,et al.  Divisible Load Theory: A New Paradigm for Load Scheduling in Distributed Systems , 2004, Cluster Computing.

[40]  Joel H. Saltz,et al.  Scheduling of tasks with batch-shared I/O on heterogeneous systems , 2006 .

[41]  P. Andow Difficulties in Fault-Tree Synthesis for Process Plant , 1980, IEEE Transactions on Reliability.

[42]  Francine Berman,et al.  New Grid Scheduling and Rescheduling Methods in the GrADS Project , 2004, IPDPS Next Generation Software Program - NSFNGS - PI Workshop.

[43]  Joel H. Saltz,et al.  Database Support for Data-Driven Scientific Applications in the Grid , 2003, Parallel Process. Lett..

[44]  Michael Litzkow,et al.  Supporting checkpointing and process migration outside the UNIX kernel , 1999 .