GRAPLEr: A distributed collaborative environment for lake ecosystem modeling that integrates overlay networks, high‐throughput computing, and WEB services

The GLEON Research And PRAGMA Lake Expedition—GRAPLE—is a collaborative effort between computer science and lake ecology researchers. It aims to improve our understanding and predictive capacity of the threats to the water quality of our freshwater resources, including climate change. This paper presents GRAPLEr, a distributed computing system used to address the modeling needs of GRAPLE researchers. GRAPLEr integrates and applies overlay virtual network, high‐throughput computing, and WEB service technologies in a novel way. First, its user‐level IP‐over‐P2P overlay network allows compute and storage resources distributed across independently administered institutions (including private and public clouds) to be aggregated into a common virtual network, despite the presence of firewalls and network address translators. Second, resources aggregated by the IP‐over‐P2P virtual network run unmodified high‐throughput‐computing middleware to enable large numbers of model simulations to be executed concurrently across the distributed computing resources. Third, a WEB service interface allows end users to submit job requests to the system using client libraries that integrate with the R statistical computing environment. The paper presents the GRAPLEr architecture, describes its implementation and reports on its performance for batches of general lake model simulations across 3 cloud infrastructures (University of Florida, CloudLab, and Microsoft Azure).

[1]  Shreyas Cholia,et al.  NEWT: A RESTful service for building High Performance Computing web applications , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[2]  David P. Hamilton,et al.  THE GLOBAL LAKE ECOLOGICAL OBSERVATORY NETWORK (GLEON): THE EVOLUTION OF GRASSROOTS NETWORK SCIENCE , 2013 .

[3]  Justin D. Brookes,et al.  The interaction between climate warming and eutrophication to promote cyanobacteria is dependent on trophic state and varies among taxa , 2014 .

[4]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[5]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[6]  P. Oscar Boykin,et al.  IP over P2P: enabling self-configuring virtual IP networks for grid computing , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[7]  Hans W. Paerl,et al.  Harmful Freshwater Algal Blooms, With an Emphasis on Cyanobacteria , 2001, TheScientificWorldJournal.

[8]  Pierre St. Juste,et al.  TinCan: User-Defined P2P Virtual Network Overlays for Ad-hoc Collaboration , 2014, EAI Endorsed Trans. Collab. Comput..

[9]  David P. Hamilton,et al.  Predicting the resilience and recovery of aquatic systems: A framework for model evolution within environmental observatories , 2015 .

[10]  Pierre St. Juste,et al.  Integrating Overlay and Social Networks for Seamless P2P Networking , 2008, 2008 IEEE 17th Workshop on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[11]  Kevin J. Flynn,et al.  Castles built on sand : dysfunctionality in plankton models and the inadequacy of dialogue between biologists and modellers , 2005 .

[12]  Elena Litchman,et al.  Trait-Based Community Ecology of Phytoplankton , 2008 .

[13]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[14]  Cayelan C. Carey,et al.  Using wavelet analyses to examine variability in phytoplankton seasonal succession and annual periodicity , 2016 .

[15]  Jorge Luis Rodriguez,et al.  The Open Science Grid , 2005 .

[16]  Stephen R. Carpenter,et al.  State of the world's freshwater ecosystems: physical, chemical, and biological changes. , 2011 .

[17]  V. Smith,et al.  Eutrophication: impacts of excess nutrient inputs on freshwater, marine, and terrestrial ecosystems. , 1999, Environmental pollution.

[18]  Cayelan C. Carey,et al.  Fish and Phytoplankton Exhibit Contrasting Temporal Species Abundance Patterns in a Dynamic North Temperate Lake , 2015, PloS one.

[19]  P. Kacsuk,et al.  WS-PGRADE: Supporting parameter sweep applications in workflows , 2008, 2008 Third Workshop on Workflows in Support of Large-Scale Science.

[20]  H. Paerl,et al.  Blooms Like It Hot , 2008, Science.

[21]  Cayelan C Carey,et al.  Resilience to Blooms , 2011, Science.