The Agave Platform: An Open, Science-as-a-Service Platform for Digital Science

The Agave Platform first appeared in 2011 as a pilot project for the iPlant Collaborative [11]. In its first two years, Foundation saw over 40% growth per month, supporting 1000+ clients, 600+ applications, 4 HPC systems at 3 centers across the US. It also gained users outside of plant biology. To better serve the needs of the general open science community, we rewrote Foundation as a scalable, cloud native application and named it the Agave Platform. In this paper we present the Agave Platform, a Science-as-a-Service (ScaaS) platform for reproducible science. We provide a brief history and technical overview of the project, and highlight three case studies leveraging the platform to create synergistic value for their users.

[1]  Chuck Palahniuk Fight Club : a novel , 1997 .

[2]  Maria Petrova-El Sayed,et al.  UNICORE 7 — Middleware services for distributed and federated computing , 2016, 2016 International Conference on High Performance Computing & Simulation (HPCS).

[3]  Srinath Perera,et al.  Apache airavata: a framework for distributed applications and computational workflows , 2011, GCE '11.

[4]  Marlon E. Pierce,et al.  The Apache Airavata Application Programming Interface: Overview and Evaluation with the UltraScan Science Gateway , 2014, 2014 9th Gateway Computing Environments Workshop.

[5]  Michael McLennan,et al.  HUBzero: A Platform for Dissemination and Collaboration in Computational Science and Engineering , 2010, Computing in Science & Engineering.

[6]  Ludek Matyska,et al.  Introduction to the CHAIN-REDS Project (objectives and achievements) , 2013 .

[7]  Daniel C. Stanzione,et al.  iPlant atmosphere: a gateway to cloud infrastructure for the plant sciences , 2011, GCE '11.

[8]  Matthew W. Vaughn,et al.  Containers-as-a-service via the Actor Model , 2017 .

[9]  Nancy Wilkins-Diehr,et al.  Roadmaps, not blueprints: paving the way to science gateway success , 2012, XSEDE '12.

[10]  Shaowen Wang,et al.  CyberGIS Gateway for enabling data‐rich geospatial research and education , 2015, Concurr. Comput. Pract. Exp..

[11]  B. S. Manjunath,et al.  The iPlant Collaborative: Cyberinfrastructure for Plant Biology , 2011, Front. Plant Sci..

[12]  Nancy Wilkins-Diehr,et al.  Science gateways today and tomorrow: positive perspectives of nearly 5000 members of the research community , 2015, Concurr. Comput. Pract. Exp..

[13]  John Shalf,et al.  SAGA: A Simple API for Grid Applications. High-level application programming on the Grid , 2006 .

[14]  Daniel C. Stanzione,et al.  Building an environment to facilitate discoveries for plant sciences , 2011, GCE '11.

[15]  Matthew R. Hanlon,et al.  DesignSafe: New Cyberinfrastructure for Natural Hazards Engineering , 2017 .

[16]  Jarek Nabrzyski,et al.  The Vine Toolkit: A Java Framework for Developing Grid Applications , 2007, PPAM.

[17]  Kurt Mueller,et al.  The GridPort toolkit: a system for building Grid portals , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[18]  Tanya E. Clement Methodologies in the digital humanities for analyzing aural patterns in texts , 2012, iConference '12.

[19]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[20]  Josue Balandrano Coronel,et al.  DesignSafe: Using Elasticsearch to Share and Search Data on a Science Web Portal , 2017, PEARC.

[21]  Rion Dooley,et al.  Software-as-a-Service: The iPlant Foundation API , 2012 .

[22]  E. Mullaart,et al.  The 1000 bull genomes project - Toward genomic selection from whole genome sequence data in dairy and beef cattle , 2013 .

[23]  Mark A. Miller,et al.  Creating the CIPRES Science Gateway for inference of large phylogenetic trees , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[24]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[25]  Ian T. Foster,et al.  Globus platform‐as‐a‐service for collaborative science applications , 2015, Concurr. Comput. Pract. Exp..

[26]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[27]  Erwin Laure,et al.  Middleware for the next generation Grid infrastructure , 2004 .

[28]  K. D. Borne,et al.  The Zooniverse: A Framework for Knowledge Discovery from Citizen Science Data , 2011 .