GISandbox: A Science Gateway for Geospatial Computing

Science gateways provide easy access to domain-specific tools and data. The field of Geographic Information Science and Systems (GIS) uses myriad tools and datasets, which raises challenges in designing a science gateway to meet users' diverse research and teaching needs. We describe a new science gateway called the GISandbox that is designed meet the needs of researchers and educators leveraging geospatial computing, which is situated at the nexus of GIS and computational science. The GISandbox is built on Jupyter Notebooks to create an easy, open, and flexible platform for geospatial computing. Jupyter Notebooks is a widely used interactive computing environment running in the browser that integrates live code, narrative, equations and images. We extend the Jupyter Notebook platform to enable users to run interactive notebooks on the cloud resource Jetstream or computationally-intensive notebooks on the Bridges supercomputer located at the Pittsburgh Supercomputing Center. A novel Job Management platform allows the user to easily submit a Jupyter Notebook for batch execution on Bridges (and eventually Comet), monitor the SLURM job, and retrieve output files. GISandbox Virtual Machines are created in Jetstream's Atmosphere interface and then deployed and configured using a series of Ansible scripts, which allow us to create an easily reproducible and scalable system. This paper outlines our vision for GISandbox, the current implementation, with a discussion looking toward the future and how the GISandbox could be used in other domains.

[1]  Matthias Drusch,et al.  Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services , 2012 .

[2]  E. LESTER SMITH,et al.  AND OTHERS , 2005 .

[3]  Steven M. Manson,et al.  Terra Populus' architecture for integrated big geospatial services , 2017, Trans. GIS.

[4]  Shaowen Wang,et al.  CyberGIS Gateway for enabling data‐rich geospatial research and education , 2015, Concurr. Comput. Pract. Exp..

[5]  Shaowen Wang,et al.  A CyberGIS-Jupyter Framework for Geospatial Analytics at Scale , 2017, PEARC.

[6]  Curtis W. Marean,et al.  Middle and Late Pleistocene paleoscape modeling along the southern coast of South Africa , 2010 .

[7]  Eric Shook,et al.  The socio-environmental data explorer (SEDE): a social media–enhanced decision support system to explore risk perception to hazard events , 2016 .

[8]  Helen Shen,et al.  Interactive notebooks: Sharing the code , 2014, Nature.

[9]  Jim Basney,et al.  CILogon: A federated X.509 certification authority for cyberinfrastructure logon , 2013, Concurr. Comput. Pract. Exp..

[10]  Nancy Wilkins-Diehr,et al.  Gateways to Discovery: Cyberinfrastructure for the Long Tail of Science , 2014, XSEDE '14.

[11]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[12]  Brian E. Granger,et al.  IPython: A System for Interactive Scientific Computing , 2007, Computing in Science & Engineering.

[13]  William B. Meyer,et al.  Global land-use/land-cover change: towards an integrated study , 1994 .

[14]  Janet Franklin,et al.  Paleoscape model of coastal South Africa during modern human origins: progress in scaling and coupling climate, vegetation, and agent-based models on XSEDE , 2015, XSEDE.

[15]  Nancy Wilkins-Diehr,et al.  Science gateways today and tomorrow: positive perspectives of nearly 5000 members of the research community , 2015, Concurr. Comput. Pract. Exp..

[16]  Michael McLennan,et al.  HUBzero: A Platform for Dissemination and Collaboration in Computational Science and Engineering , 2010, Computing in Science & Engineering.

[17]  Shaowen Wang,et al.  CyberGIS software: a synthetic review and integration roadmap , 2013, Int. J. Geogr. Inf. Sci..

[18]  Nancy Wilkins-Diehr,et al.  XSEDE: Accelerating Scientific Discovery , 2014, Computing in Science & Engineering.

[19]  et al.,et al.  Jupyter Notebooks - a publishing format for reproducible computational workflows , 2016, ELPUB.

[20]  Daniel C. Stanzione,et al.  Jetstream: performance, early experiences, and early results , 2016, XSEDE.

[21]  Uri Wilensky,et al.  NetLogo: A simple environment for modeling complexity , 2014 .

[22]  Shaowen Wang A CyberGIS Framework for the Synthesis of Cyberinfrastructure, GIS, and Spatial Analysis , 2010 .

[23]  Mark Gahegan,et al.  Geospatial Cyberinfrastructure: Past, present and future , 2010, Comput. Environ. Urban Syst..

[24]  Marc P. Armstrong,et al.  Geography and Computational Science , 2000 .

[25]  Dick Hardt,et al.  The OAuth 2.0 Authorization Framework , 2012, RFC.

[26]  Steven K. Feiner,et al.  Spatial computing , 2015, Commun. ACM.

[27]  Gordon Christie,et al.  Functional Map of the World , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[28]  Nancy Wilkins-Diehr,et al.  An Overview of the XSEDE Extended Collaborative Support Program , 2015, ICS 2015.

[29]  Rajesh Kalyanam,et al.  MyGeoHub - A sustainable and evolving geospatial science gateway , 2018, Future Gener. Comput. Syst..