A comparison of research data management platforms: architecture, flexible metadata and interoperability

Research data management is rapidly becoming a regular concern for researchers, and institutions need to provide them with platforms to support data organization and preparation for publication. Some institutions have adopted institutional repositories as the basis for data deposit, whereas others are experimenting with richer environments for data description, in spite of the diversity of existing workflows. This paper is a synthetic overview of current platforms that can be used for data management purposes. Adopting a pragmatic view on data management, the paper focuses on solutions that can be adopted in the long tail of science, where investments in tools and manpower are modest. First, a broad set of data management platforms is presented—some designed for institutional repositories and digital libraries—to select a short list of the more promising ones for data management. These platforms are compared considering their architecture, support for metadata, existing programming interfaces, as well as their search mechanisms and community acceptance. In this process, the stakeholders’ requirements are also taken into account. The results show that there is still plenty of room for improvement, mainly regarding the specificity of data description in different domains, as well as the potential for integration of the data management platforms with existing research management tools. Nevertheless, depending on the context, some platforms can meet all or part of the stakeholders’ requirements.

[1]  W H Waldo On "Improving Scientific Communication". , 1955, Science.

[2]  Herbert Van de Sompel,et al.  The open archives initiative: building a low-barrier interoperability framework , 2001, JCDL '01.

[3]  Carl Lagoze,et al.  The Open Archives Initiative Protocol for Metadata Harvesting Protocol , 2002 .

[4]  Clifford A. Lynch,et al.  Institutional Repositories: Essential Infrastructure For Scholarship In The Digital Age , 2003 .

[5]  Wang Jun Open Archives Initiative Protocol for Metadata Harvesting , 2005 .

[6]  Muriel Foulonneau The Open Archives Initiative Protocol for Metadata Harvesting: an Opportunity for Sharing Content in a Distributed Environment , 2006 .

[7]  L. Lyon Dealing with Data: Roles, Rights, Responsibilities and Relationships. Consultancy Report. , 2007 .

[8]  José Carlos Ramalho,et al.  RODA and Crib : A Service-Oriented Digital Repository , 2008, iPRES.

[9]  P. Bryan Heidorn,et al.  Shedding Light on the Dark Data in the Long Tail of Science , 2008, Libr. Trends.

[10]  Pamela M. Bluh,et al.  Institutional Repositories: Essential Infrastructure for Scholarship in the Digital Age , 2009 .

[11]  Ann G. Green,et al.  Policy-making for Research Data in Repositories: A Guide , 2009 .

[12]  Laurent Romary,et al.  Comparing Repository Types - Challenges and barriers for subject-based repositories, research repositories, national repository systems and institutional repositories in serving scholarly communication , 2010, Int. J. Digit. Libr. Syst..

[13]  Ed Fay Repository Software Comparison: Building Digital Library Infrastructure at LSE , 2010 .

[14]  Cristina Ribeiro,et al.  UPData - A Data Curation Experiment at U.Porto using DSpace , 2011, iPRES.

[15]  Julia Hoxha,et al.  Open Government Data on the Web: A Semantic Approach , 2011, 2011 International Conference on Emerging Intelligent Data and Web Technologies.

[16]  Ranjeet Devarakonda,et al.  Data sharing and retrieval using OAI-PMH , 2011, Earth Sci. Informatics.

[17]  J. Silva Managing multidisciplinary research data Extending DSpace to enable long-term preservation of tabular datasets , 2012 .

[18]  Rob Procter,et al.  Development of a Pilot Data Management Infrastructure for Biomedical Researchers at University of Manchester - Approach, Findings, Challenges and Outlook of the MaDAM Project , 2012, Int. J. Digit. Curation.

[19]  A. H. Ball Tools for research data management , 2012 .

[20]  Craig Willis,et al.  Analysis and synthesis of metadata goals for scientific data , 2012, J. Assoc. Inf. Sci. Technol..

[21]  Ccsds Secretariat,et al.  Reference Model for an Open Archival Information System (OAIS) , 1999 .

[22]  Christine L. Borgman,et al.  The conundrum of sharing research data , 2012, J. Assoc. Inf. Sci. Technol..

[23]  Cristina Ribeiro,et al.  UPBox and DataNotes: a collaborative data management environment for the long tail of research data , 2013, iPRES.

[24]  Heather A. Piwowar,et al.  Data reuse and the open data citation advantage , 2013, PeerJ.

[25]  John M. Budd,et al.  Institutional Repositories: Exploration of Costs and Value , 2013, D Lib Mag..

[26]  Joss Winn Open data and the academy: an evaluation of CKAN for research data management , 2013 .

[27]  Jonathan W. Essex,et al.  How to pick a winning team: approaches towards the selection of computationally derived protein structures for ensemble-based virtual screening , 2013, Journal of Cheminformatics.

[28]  Jeremy G. Frey,et al.  First steps towards semantic descriptions of electronic laboratory notebook records , 2013, Journal of Cheminformatics.

[29]  Cristina Ribeiro,et al.  Dendro: Collaborative Research Data Management Built on Linked Open Data , 2014, ESWC.

[30]  Cristina Ribeiro,et al.  LabTablet: Semantic Metadata Collection on a Multi-domain Laboratory Notebook , 2014, MTSR.

[31]  Cristina Ribeiro,et al.  Ontology-based multi-domain metadata for research data management using triple stores , 2014, IDEAS.

[32]  Cristina Ribeiro,et al.  The Dendro research data management platform: Applying ontologies to long-term preservation in a collaborative environment , 2014, iPRES.

[33]  Jean Gabriel Bankier,et al.  Institutional Repository Software Comparison , 2014 .

[34]  Veerle Van den Eynden,et al.  Managing and Sharing Research Data: A Guide to Good Practice , 2014 .

[35]  Paolo Manghi,et al.  Science 2.0 Repositories: Time for a Change in Scholarly Communication , 2015, D Lib Mag..

[36]  Karima Rafes,et al.  A platform for scientific data sharing , 2015 .

[37]  Paolo Manghi,et al.  Data journals: A survey , 2014, J. Assoc. Inf. Sci. Technol..

[38]  Andias Wira-Alam,et al.  datorium: Sharing Platform for Social Science Data - Deposit and Publish Research Data for Better Visibility , 2015, ISI.