Extracting information ASAP!

Designing I/O systems capable of scaling up to deal with the next generation of extreme scale scientific environments is a significant challenge. Scientific applications already strain the capabilities of current filesystems and storage systems. This work presents a middleware-based approach that generalizes from previous work on staging areas to focus more generally on staging resources. Exploiting the steady increase of the ratio of compute capability to I/O bandwidth, the EnStage middleware system allows for metadata characterization and I/O processing to occur when as where appropriate. This includes in reserved staging areas, buffered memory, and even in the writing processes' execution context. Using the EnStage extension to previous work, we find a 1.4% increase in runtime due to additional functionality resulted in I/O time in the staging area dropping to only 16% of the non-reduced output.

[1]  Karsten Schwan,et al.  Adaptable, metadata rich IO methods for portable high performance IO , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[2]  Karsten Schwan,et al.  DataStager: scalable data staging services for petascale applications , 2009, HPDC.

[3]  Mark F. Horstemeyer,et al.  Numerical tools for atomistic simulations. , 2004 .

[4]  Karsten Schwan,et al.  PreDatA – preparatory data analytics on peta-scale machines , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[5]  Kwan-Liu Ma,et al.  Visual interrogation of gyrokinetic particle simulations , 2007 .

[6]  Karsten Schwan,et al.  Managing Variability in the IO Performance of Petascale Storage Systems , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[7]  Vanish Talwar,et al.  GViM: GPU-accelerated virtual machines , 2009, HPCVirt '09.

[8]  David E. Shaw,et al.  Accelerating Parallel Analysis of Scientific Simulation Data via Zazen , 2010, FAST.

[9]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[10]  Karsten Schwan,et al.  DataStager: scalable data staging services for petascale applications , 2009, HPDC '09.

[11]  Greg Eisenhauer Dynamic Code Generation with the E-Code Language , 2005 .

[12]  Karsten Schwan,et al.  Exploiting Latent I/O Asynchrony in Petascale Science Applications , 2008, 2009 International Conference on Parallel Processing Workshops.

[13]  Karsten Schwan,et al.  Exploiting Latent I/O Asynchrony in Petascale Science Applications , 2011, Int. J. High Perform. Comput. Appl..

[14]  Karsten Schwan,et al.  LIVE data workspace: A flexible, dynamic and extensible platform for petascale applications , 2007, 2007 IEEE International Conference on Cluster Computing.

[15]  Karsten Schwan,et al.  Extending I/O through high performance data services , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.