Reducing Complexity in Management of eScience Computations

In this paper we address reduction of complexity in management of scientific computations in distributed computing environments. We explore an approach based on separation of computation design (application development) and distributed execution of computations, and investigate best practices for construction of virtual infrastructures for computational science - software systems that abstract and virtualize the processes of managing scientific computations on heterogeneous distributed resource systems. As a result we present StratUm, a toolkit for management of eScience computations. To illustrate use of the toolkit, we present it in the context of a case study where we extend the capabilities of an existing kinetic Monte Carlo software framework to utilize distributed computational resources. The case study illustrates a viable design pattern for construction of virtual infrastructures for distributed scientific computing. The resulting infrastructure is evaluated using a computational experiment from molecular systems biology.

[1]  Andreas Hellander,et al.  Simulation of Stochastic Reaction-Diffusion Processes on Unstructured Meshes , 2008, SIAM J. Sci. Comput..

[2]  N. Wingreen,et al.  Dynamic structures in Escherichia coli: Spontaneous formation of MinE rings and MinD polar zones , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Erik Elmroth,et al.  Abstractions for Scaling eScience Applications to Distributed Computing Environments - A StratUm Integration Case Study in Molecular Systems Biology , 2012, BIOINFORMATICS.

[4]  J. Lindemann,et al.  Advanced Resource Connector middleware for lightweight computational Grids , 2007, Future Gener. Comput. Syst..

[5]  Francine Berman,et al.  The Virtual Instrument: Support for Grid-Enabled Mcell Simulations , 2004, Int. J. High Perform. Comput. Appl..

[6]  Ian T. Foster,et al.  Globus Toolkit Version 4: Software for Service-Oriented Systems , 2005, Journal of Computer Science and Technology.

[7]  Yong Zhao,et al.  Many-task computing for grids and supercomputers , 2008, 2008 Workshop on Many-Task Computing on Grids and Supercomputers.

[8]  A. Stephen McGough,et al.  Performance Evaluation of the GridSAM Job Submission and Monitoring System , 2005 .

[9]  Chandra Krintz,et al.  Neptune: a domain specific language for deploying hpc software on cloud platforms , 2011, ScienceCloud '11.

[10]  Erik Elmroth,et al.  GJMF - a composable service-oriented grid job management framework , 2013, Future Gener. Comput. Syst..

[11]  David E. Culler,et al.  SEDA: an architecture for well-conditioned, scalable internet services , 2001, SOSP.

[12]  Peter S Swain,et al.  Efficient attenuation of stochasticity in gene expression through post-transcriptional control. , 2004, Journal of molecular biology.

[13]  Erik Elmroth,et al.  Mediation of Service Overhead in Service-Oriented Grid Architectures , 2011, 2011 IEEE/ACM 12th International Conference on Grid Computing.

[14]  Yong Zhao,et al.  Falkon: a Fast and Light-weight tasK executiON framework , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).

[15]  Gabrielle Allen,et al.  The SAGA C++ reference implementation: a milestone toward new high-level grid applications , 2006, SC.

[16]  David Fange,et al.  Noise-Induced Min Phenotypes in E. coli , 2006, PLoS Comput. Biol..