Sim-X: parallel system software for interactive multi-experiment computational studies

Advances in high-performance computing have led to the broad use of computational studies in everyday engineering and scientific applications. A single study may require thousands of computational experiments, each corresponding to individual runs of simulation software with different parameter settings; in complex studies, the pattern of parameter changes is complex and may have to be adjusted by the user based on partial simulation results. Unfortunately, existing tools have limited high-level support for managing large ensembles of simultaneous computational experiments. In this paper, we present a system architecture for interactive computational studies targeting two goals. The first is to provide a framework for high-level user interaction with computational studies, rather than individual experiments; the second is to maximize the size of the studies that can be performed at close to interactive rates. We describe a prototype implementation of the system and demonstrate performance improvements obtained using our approach for a simple model problem

[1]  Karsten Schwan,et al.  Falcon: On-line monitoring for steering parallel programs , 1998, Concurr. Pract. Exp..

[2]  Gary B. Lamont,et al.  Evolutionary Algorithms for Solving Multi-Objective Problems , 2002, Genetic Algorithms and Evolutionary Computation.

[3]  John E. Dennis,et al.  Normal-Boundary Intersection: A New Method for Generating the Pareto Surface in Nonlinear Multicriteria Optimization Problems , 1998, SIAM J. Optim..

[4]  Yolanda Gil,et al.  The Role of Planning in Grid Computing , 2003, ICAPS.

[5]  Andrew A. Chien,et al.  Architectural Support and Mechanisms for Object Caching in Dynamic Multithreaded Computations , 1999, J. Parallel Distributed Comput..

[6]  Robert A. Ellis,et al.  Eliminating Distance in Scientific Computing: an Experiment in Televisualization , 1990, Int. J. High Perform. Comput. Appl..

[7]  Karsten Schwan,et al.  Falcon: On‐line monitoring for steering parallel programs , 1998 .

[8]  Steven G. Parker,et al.  Biomedical computing and visualization software environments , 2004, CACM.

[9]  Vijay Mann,et al.  DISCOVER: An environment for Web‐based interaction and steering of high‐performance scientific applications , 2001, Concurr. Comput. Pract. Exp..

[10]  David Abramson,et al.  Nimrod: a tool for performing parametrised simulations using distributed workstations , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[11]  Steven G. Parker,et al.  Uintah: a massively parallel problem solving environment , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[12]  Henri Casanova,et al.  Netsolve: a Network-Enabled Server for Solving Computational Science Problems , 1997, Int. J. High Perform. Comput. Appl..

[13]  Erwin Schwab,et al.  State of the art and future trends , 2003 .

[14]  Jarek Nabrzyski,et al.  Grid resource management: state of the art and future trends , 2004 .

[15]  D. Abramson,et al.  An Automatic Design Optimization Tool and its Application to Computational Fluid Dynamics , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[16]  Michelle Miller,et al.  Simulation steering with SCIRun in a distributed environment , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[17]  Achille Messac,et al.  Physical programming - Effective optimization for computational design , 1996 .

[18]  C.R. Johnson,et al.  SCIRun: A Scientific Programming Environment for Computational Steering , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[19]  Francine Berman,et al.  Resource Allocation for Steerable Parallel Parameter Searches , 2002, GRID.

[20]  Keming Zhang,et al.  SCIRun2: a CCA framework for high performance computing , 2004, Ninth International Workshop on High-Level Parallel Programming Models and Supportive Environments, 2004. Proceedings..

[21]  John Brooke,et al.  Computational steering in realitygrid , 2003 .

[22]  Karsten Schwan,et al.  From interactive applications to distributed laboratories , 1998, IEEE Concurr..

[23]  Francine Berman,et al.  The AppLeS Parameter Sweep Template: User-Level Middleware for the Grid , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[24]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[25]  James Arthur Kohl,et al.  Cumulvs: Providing Fault Toler. Ance, Visualization, and Steer Ing of Parallel Applications , 1996, Int. J. High Perform. Comput. Appl..