A Methodology for Citing Linked Open Data Subsets

In this paper we discuss the problem of data citation with a specific focus on Linked Open Data. We outline the main requirements a data citation methodology must fulfill: (i) uniquely identify the cited objects; (ii) provide descriptive metadata; (iii) enable variable granularity citations; and (iv) produce both human- and machine-readable references. We propose a methodology based on named graphs and RDF quad semantics that allows us to create citation meta-graphs respecting the outlined requirements. We also present a compelling use case based on search engines experimental evaluation data and possible applications of the citation methodology.

[1]  Marjan Vernooy-Gerritsen Enhanced Publications: Linking Publications and Research Data in Digital Repositories , 2009 .

[2]  Nicola Ferro,et al.  DIRECTions: Design and Specification of an IR Evaluation Infrastructure , 2012, CLEF.

[3]  Maarten Hoogerwerf,et al.  Enhanced Publications : Linking Publications and Research Data in Digital Repositories , 2009 .

[4]  Erhard Rahm,et al.  The Scholarly Impact of CLEF (2000-2009) , 2013, CLEF.

[5]  Umberto Straccia,et al.  A Minimal Deductive System for General Fuzzy RDF , 2009, RR.

[6]  Paul T. Groth,et al.  The anatomy of a nanopublication , 2010, Inf. Serv. Use.

[7]  Jeremy J. Carroll,et al.  Named graphs, provenance and trust , 2005, WWW '05.

[8]  Donna Harman,et al.  Information Retrieval Evaluation , 2011, Synthesis Lectures on Information Concepts, Retrieval, and Services.

[9]  Dimitris Sacharidis,et al.  Diachronic linked data: towards long-term preservation of structured interrelated information , 2012, WOD.

[10]  Sarah Callaghan,et al.  Citation and Peer Review of Data: Moving Towards Formal Data Publication , 2011, Int. J. Digit. Curation.

[11]  Claudio Gutiérrez,et al.  Introducing Time into RDF , 2007, IEEE Transactions on Knowledge and Data Engineering.

[12]  Peter Buneman,et al.  A Rule-Based Citation System for Structured and Evolving Datasets , 2010, IEEE Data Eng. Bull..

[13]  Antoine Zimmermann RDF 1.1: On Semantics of RDF Datasets , 2014 .

[14]  Andreas Rauber,et al.  Scalable data citation in dynamic, large databases: Model and reference implementation , 2013, 2013 IEEE International Conference on Big Data.

[15]  Jeremy J. Carroll,et al.  Named graphs , 2005, J. Web Semant..

[16]  Christine L. Borgman,et al.  The conundrum of sharing research data , 2012, J. Assoc. Inf. Sci. Technol..

[17]  Christine L Borgman,et al.  Why are the attribution and citation of scientific data important? In: Uhlir, Paul and Cohen, Daniel (eds.). Report from Developing Data Attribution and Citation Practices and Standards: An International Symposium and Workshop. , 2012 .

[18]  Peter Buneman,et al.  How to cite curated databases and how to make them citable , 2006, 18th International Conference on Scientific and Statistical Database Management (SSDBM'06).

[19]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[20]  Romain Rouvoy,et al.  Debugging with the Crowd: A Debug Recommendation System Based on StackOverflow , 2014, ERCIM News.

[21]  Carol Peters,et al.  CLEF 2007: Ad Hoc Track Overview , 2008, CLEF.