The gCube system: Delivering Virtual Research Environments as-a-Service

Abstract Important changes have characterised research and knowledge production in recent decades. These changes are associated with developments in information technologies and infrastructures. The processes characterising research and knowledge production are changing through the digitalisation of science, the virtualisation of research communities and networks, the offering of underlying systems and services by infrastructures. This paper gives an overview of gCube, a software system promoting elastic and seamless access to research assets (data, services, computing) across the boundaries of institutions, disciplines and providers to favour collaboration-oriented research tasks. gCube’s technology is primarily conceived to enable Hybrid Data Infrastructures facilitating the dynamic definition and operation of Virtual Research Environments. To this end, it offers a comprehensive set of data management commodities on various types of data and a rich array of “mediators” to interface well-established Infrastructures and Information Systems from various domains. Its effectiveness has been proved by operating the D4Science.org infrastructure and serving concrete, multidisciplinary, challenging, and large scale scenarios.

[1]  Sascha Friesike,et al.  Towards Another Scientific Revolution , 2014 .

[2]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[3]  Yogesh L. Simmhan,et al.  A survey of data provenance in e-science , 2005, SGMD.

[4]  Rainer Froese,et al.  A Bayesian approach for estimating length‐weight relationships in fishes , 2014 .

[5]  Pasquale Pagano,et al.  Parallelizing the execution of native data mining algorithms for computational biology , 2015, Concurr. Comput. Pract. Exp..

[6]  Pasquale Pagano,et al.  Towards a Global Record of Stocks and Fisheries , 2017, HAICTA.

[7]  Pasquale Pagano,et al.  An infrastructure-oriented approach for supporting biodiversity research , 2015, Ecol. Informatics.

[8]  Ian T. Foster,et al.  Globus platform‐as‐a‐service for collaborative science applications , 2015, Concurr. Comput. Pract. Exp..

[9]  I. Foster,et al.  Service-Oriented Science , 2005, Science.

[10]  Yannis E. Ioannidis,et al.  Dataflow Processing and Optimization on Grid and Cloud Infrastructures , 2009, IEEE Data Eng. Bull..

[11]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[12]  Pasquale Pagano,et al.  Realising Virtual Research Environments by Hybrid Data Infrastructures: the D4Science Experience , 2014 .

[13]  Pasquale Pagano,et al.  Functional Adaptivity for Digital Library Services in e-Infrastructures: The gCube Approach , 2009, ECDL.

[14]  Pasquale Pagano,et al.  An Approach to Virtual Research Environment User Interfaces Dynamic Construction , 2011, TPDL.

[15]  Pasquale Pagano,et al.  A Grid-Based Infrastructure for Distributed Retrieval , 2007, ECDL.

[16]  Ian T. Foster,et al.  Software as a service for data scientists , 2012, Commun. ACM.

[17]  Nancy Wilkins-Diehr,et al.  Who Cares about Science Gateways? A Large-Scale Survey of Community Use and Needs , 2014, 2014 9th Gateway Computing Environments Workshop.

[18]  Paolo Manghi,et al.  An Extensible Virtual Digital Libraries Generator , 2008, ECDL.

[19]  Pasquale Pagano,et al.  Virtual Research Environments: An Overview and a Research Agenda , 2013, Data Sci. J..

[20]  Nancy Wilkins-Diehr,et al.  Science gateway workshops 2014 special issue conference publications , 2015, Concurr. Comput. Pract. Exp..

[21]  Paolo Manghi,et al.  Science 2.0 Repositories: Time for a Change in Scholarly Communication , 2015, D Lib Mag..

[22]  Alexander Papaspyrou,et al.  Toward an Open Cloud Standard , 2012, IEEE Internet Computing.

[23]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[24]  Florida Estrella,et al.  Towards next generations of software for distributed infrastructures: The European Middleware Initiative , 2012, 2012 IEEE 8th International Conference on E-Science.

[25]  Herbert Van de Sompel,et al.  The open archives initiative: building a low-barrier interoperability framework , 2001, JCDL '01.

[26]  M. S. Othman,et al.  A review on open source architecture in Geographical Information Systems , 2012, 2012 International Conference on Computer & Information Science (ICCIS).

[27]  A. D. Meglio,et al.  Programming the Grid with gLite , 2006 .

[28]  Antoine H. C. van Kampen,et al.  Science Gateway Canvas: A business reference model for Science Gateways , 2015, SCREAM@HPDC.

[29]  P. Pagano,et al.  Forecasting the ongoing invasion of Lagocephalus sceleratus in the Mediterranean Sea , 2018 .