Cyberinfrastructure Center of Excellence Pilot: Connecting Large Facilities Cyberinfrastructure

The National Science Foundation's Large Facilities are major, multi-user research facilities that operate and manage sophisticated and diverse research instruments and platforms (e.g., large telescopes, interferometers, distributed sensor arrays) that serve a variety of scientific disciplines, from astronomy and physics to geology and biology and beyond. Large Facilities are increasingly dependent on advanced cyberinfrastructure (i.e., computing, data, and software systems; networking; and associated human capital) to enable the broad delivery and analysis of facility-generated data. These cyberinfrastructure tools enable scientists and the public to gain new insights into fundamental questions about the structure and history of the universe, the world we live in today, and how our environment may change in the coming decades. This paper describes a pilot project that aims to develop a model for a Cyberinfrastructure Center of Excellence (CI CoE) that facilitates community building and knowledge sharing and that disseminates and applies best practices and innovative solutions for facility CI.

[1]  A. Goshaw The ATLAS Experiment at the CERN Large Hadron Collider , 2008 .

[2]  Miron Livny,et al.  Pegasus, a workflow management system for science automation , 2015, Future Gener. Comput. Syst..

[3]  Jorge Luis Rodriguez,et al.  The Open Science Grid , 2005 .

[4]  A. Schukraft,et al.  The IceCube Neutrino Observatory: Instrumentation and Online Systems , 2016, 1612.05093.

[5]  Jim Basney,et al.  CILogon: A federated X.509 certification authority for cyberinfrastructure logon , 2013, Concurr. Comput. Pract. Exp..

[6]  João Paulo Teixeira,et al.  The CMS experiment at the CERN LHC , 2008 .

[7]  Hong Linh Truong,et al.  MQTT-S — A publish/subscribe protocol for Wireless Sensor Networks , 2008, 2008 3rd International Conference on Communication Systems Software and Middleware and Workshops (COMSWARE '08).

[8]  John E. Gross,et al.  The terrestrial organism and biogeochemistry spatial sampling design for the National Ecological Observatory Network , 2019, Ecosphere.

[9]  Eduardo Serrano,et al.  LSST: From Science Drivers to Reference Design and Anticipated Data Products , 2008, The Astrophysical Journal.

[10]  Douglas Thain,et al.  Makeflow: a portable abstraction for data intensive computing on clusters, clouds, and grids , 2012, SWEET '12.

[11]  B. J. Butler,et al.  THE EXPANDED VERY LARGE ARRAY: A NEW TELESCOPE FOR NEW SCIENCE , 2011, 1106.0532.

[12]  Nancy Wilkins-Diehr,et al.  NSF’s Inaugural Software Institutes: The Science Gateways Community Institute and the Molecular Sciences Software Institute , 2018, Computing in Science & Engineering.

[13]  Robert B. Ross,et al.  PIDX: Efficient Parallel I/O for Multi-resolution Multi-dimensional Scientific Datasets , 2011, 2011 IEEE International Conference on Cluster Computing.

[14]  Herman Stehouwer,et al.  Research data alliance , 2013 .

[15]  P.-T. Bremer,et al.  The ViSUS Visualization Framework , 2012, High Performance Visualization.

[16]  Manish Parashar,et al.  Data Cyberinfrastructure for End-to-End Science , 2020, Computing in Science & Engineering.

[17]  Marianne Swanson,et al.  Contingency Planning Guide for Federal Information Systems , 2010 .

[18]  G. M. Harry,et al.  Advanced LIGO: the next generation of gravitational wave detectors , 2010 .

[19]  Manish Parashar,et al.  Architecting the cyberinfrastructure for National Science Foundation Ocean Observatories Initiative (OOI) , 2016 .

[20]  Mark H. Edwards,et al.  Science at Sea: Meeting Future Oceanographic Goals with a Robust Academic Research Fleet , 2009 .

[21]  Barton P. Miller,et al.  Trusted CI Experiences in Cybersecurity and Service to Open Science , 2019, PEARC.