Biomolecular Path Sampling Enabled by Processing in Network Storage

Computationally complex and data intensive atomic scale biomolecular simulation is enabled via processing in network storage (PINS): a novel distributed system framework to overcome bandwidth, compute, storage, and security challenges inherent to the wide area computation and storage grid. High throughput data generation requirements for our scientific target are overcome through novel aggregate bandwidth capabilities. Biomolecular simulation methods are correlated with the client tools, hybrid database/file server (GEMS), computation engine (Condor), virtual file system adapter (Parrot), and local file servers (Chirp). PINS performance is reported for the path sampling of a solvated protein domain requiring over 1000 simulations with total output data generation on the order of 1TB.

[1]  Michael R. Shirts,et al.  Atomistic protein folding simulations on the submillisecond time scale using worldwide distributed computing. , 2003, Biopolymers.

[2]  Osamu Tatebe,et al.  Gfarm v2: A Grid file system that supports high-performance distributed and parallel data computing , 2005 .

[3]  David Chandler,et al.  Transition path sampling: throwing ropes over rough mountain passes, in the dark. , 2002, Annual review of physical chemistry.

[4]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[5]  Stuart Murdock,et al.  BioSimGrid: towards a worldwide repository for biomolecular simulations. , 2004, Organic & biomolecular chemistry.

[6]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[7]  Perla B. Balbuena,et al.  Molecular dynamics : from classical to quantum methods , 1999 .

[8]  Ron Elber,et al.  Long-timescale simulation methods. , 2005, Current opinion in structural biology.

[9]  Douglas Thain,et al.  Access control for a replica management database , 2006, StorageSS '06.

[10]  Andrew E. Torda,et al.  Biomolecular modelling: Overview of types of methods to search and sample conformational space , 2008 .

[11]  C. Kesselman,et al.  A Metadata Catalog Service for Data Intensive Applications , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[12]  W. G. Hoover molecular dynamics , 1986, Catalysis from A to Z.

[13]  Berend Smit,et al.  Understanding Molecular Simulation , 2001 .

[14]  J. Valverde Molecular Modelling: Principles and Applications , 2001 .

[15]  Douglas Thain,et al.  Separating Abstractions from Resources in a Tactical Storage System , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[16]  L. Buée,et al.  1H NMR Study on the Binding of Pin1 Trp-Trp Domain with Phosphothreonine Peptides* , 2001, The Journal of Biological Chemistry.

[17]  Douglas Thain,et al.  Generosity and gluttony in GEMS: grid enabled molecular simulations , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[18]  Krzysztof Sliwa,et al.  Functions of WW domains in the nucleus , 2001, FEBS letters.

[19]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[20]  Data Grids , 2009, Encyclopedia of Database Systems.

[21]  K. Lu Pinning down cell signaling, cancer and Alzheimer's disease. , 2004, Trends in biochemical sciences.

[22]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[23]  Reagan Moore,et al.  Data grids, collections, and grid bricks , 2003, 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies, 2003. (MSST 2003). Proceedings..

[24]  Ian T. Foster,et al.  GASS: a data movement and access service for wide area computing systems , 1999, IOPADS '99.

[25]  Douglas Thain,et al.  Parrot: Transparent User-Level Middleware for Data-Intensive Computing , 2005, Scalable Comput. Pract. Exp..

[26]  G. Forbes Molecular Dynamics , 1885, Nature.

[27]  D. Thain,et al.  Applying feedback control to a replica management system , 2006, 2006 Proceeding of the Thirty-Eighth Southeastern Symposium on System Theory.

[28]  R Elber,et al.  Novel methods for molecular dynamics simulations. , 1996, Current opinion in structural biology.

[29]  B. Berne,et al.  Novel methods of sampling phase space in the simulation of biological systems. , 1997, Current opinion in structural biology.

[30]  Rebecca C. Wade,et al.  Chapter 21 - Ways and Means to Enhance the Configurational Sampling of Small Peptides in Aqueous Solution in Molecular Dynamics Simulations , 1999 .

[31]  Carl Kesselman,et al.  Performance and scalability of a replica location service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..