HUBzero®: Novel Concepts Applied to Established Computing Infrastructures to Address Communities' Needs

The science gateway framework HUBzero® has been enhanced and further developed since its initial vision in 1996 -- always driven by requirements of the diverse communities applying HUBzero® for their research. HUBzero® is part of a computational landscape that has never evolved as fast as in the last decade. Novel frameworks and concepts on the user interface side such as Javascript libraries and Jupyter notebooks support communities in their working environment with easy-to-use user interfaces while novel technologies and concepts in the backend allow for effective and efficient modeling, simulations and processing research tools and data. HUBzero®'s enhancements include extensions for BOINC and XSEDE infrastructures on the backend while offering interactive computations and analytical tools via Jupyter Notebooks, RStudio, and other web applications as publishing environments. The paper goes into detail for novel developments for the three use cases Purdue University Research Repository (PURR), nanoHUB and MyGeoHub. First, PURR has been extended to utilize the enhanced data storage service Data Depot and high-speed networks of the Purdue local campus infrastructure. Second, nanoHUB offers over 500 simulation tools and it has been enhanced with a novel caching system to explore the input parameter space for already computed results via BOINC. The third extension is concerned with builtin features for geospatial data and modeling in MyGeoHub that offers to execute compute-intensive tasks on XSEDE. The diverse extensions can be reused in various hubs developed with HUBzero® requiring such diverse features and accessing different distributed computing infrastructures.

[1]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[2]  Srinath Perera,et al.  Apache airavata: a framework for distributed applications and computational workflows , 2011, GCE '11.

[3]  José A. B. Fortes,et al.  WWW-based computation in microelectronics education and research , 1999, Proceedings of the Thirteenth Biennial University/Government/Industry Microelectronics Symposium (Cat. No.99CH36301).

[4]  John M. Fonner,et al.  The Agave Platform: An Open, Science-as-a-Service Platform for Digital Science , 2018, PEARC.

[5]  Rajesh Kalyanam,et al.  MyGeoHub Geospatial Gateway , 2017 .

[6]  Gerhard Klimeck,et al.  nanoHUB.org: cloud-based services for nanoscale modeling, simulation, and education , 2013 .

[7]  Lan Zhao,et al.  SWATShare - A web platform for collaborative research and education through online sharing, simulation and visualization of SWAT models , 2016, Environ. Model. Softw..

[8]  Eli Dart,et al.  The Modern Research Data Portal: a design pattern for networked, data-intensive science , 2018, PeerJ Comput. Sci..

[9]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[10]  Lan Zhao,et al.  Rapid aggregation of global gridded crop model outputs to facilitate cross-disciplinary analysis of climate change impacts in agriculture , 2016, Environ. Model. Softw..

[11]  Rajesh Kalyanam,et al.  GABBs - Reusable Geospatial Data Analysis Building Blocks for Science Gateways , 2017, IWSG.

[12]  Nancy Wilkins-Diehr,et al.  Science Gateways: The Long Road to the Birth of an Institute , 2017, HICSS.

[13]  Rajesh Kalyanam,et al.  Social Media Modeling of Human Behavior in Natural Emergencies , 2018, PEARC.

[14]  Reagan Moore,et al.  The integrated Rule-Oriented Data System (iRODS 4.0) Microservice Workbook , 2015 .

[15]  Carole A. Goble,et al.  The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud , 2013, Nucleic Acids Res..

[16]  Lan Zhao,et al.  Cyberinfrastructure for the collaborative development of U2U decision support tools , 2017 .

[17]  Michael McLennan,et al.  HUBzero: A Platform for Dissemination and Collaboration in Computational Science and Engineering , 2010, Computing in Science & Engineering.

[18]  Mark A. Miller,et al.  Creating the CIPRES Science Gateway for inference of large phylogenetic trees , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[19]  Rajesh Kalyanam,et al.  MyGeoHub - A sustainable and evolving geospatial science gateway , 2018, Future Gener. Comput. Syst..