Methodological Guidelines for Publishing Library Data as Linked Data

Publishing data as Linked Data increases the interoperability and discoverability of resources over the web space. This process involves several design decisions and technologies. However, there is no one-size-fits-all formula for publishing data as Linked Data. Also, the quality of linked data published is a key issue to take into account. In the library domain, the quality of linked data is a crucial point for improving the retrieval and use of the data. In this paper, we propose a set of methodological guidelines based on five activities for publishing library data as Linked Data. The proposed guidelines consider the quality of published data as a key issue. In this line, our approach includes a preprocessing task for data cleansing and normalization. The proposed approach has been applied in a use case for publishing bibliographic data from Open Access journals in Cuba. The results obtained show the applicability of the methodological guidelines proposed in a real environment.

[1]  Stephan Bloehdorn,et al.  The SWRC Ontology - Semantic Web for Research Communities , 2005, EPIA.

[2]  Yusniel Hidalgo Delgado,et al.  Detección de comunidades a partir de redes de coautoría en grafos RDF , 2016 .

[3]  Tom Heath,et al.  Linked Data: Evolving the Web into a Global Data Space , 2011, Linked Data.

[4]  Marcos André Gonçalves,et al.  A brief survey of automatic methods for author name disambiguation , 2012, SGMD.

[5]  David Wood,et al.  The Joy of Data - A Cookbook for Publishing Linked Government Data on the Web , 2011 .

[6]  Avigdor Gal Uncertain entity resolution: re-evaluating entity resolution in the big data era: tutorial , 2014, VLDB 2014.

[7]  Ashwin Machanavajjhala,et al.  Entity Resolution: Theory, Practice & Open Challenges , 2012, Proc. VLDB Endow..

[8]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[9]  Jens Lehmann,et al.  Quality assessment for Linked Data: A Survey , 2015, Semantic Web.

[10]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[11]  Yusniel Hidalgo-Delgado,et al.  Using Search Paradigms and Architecture Information Components to Consume Linked Data , 2014 .

[12]  Rik Van de Walle,et al.  Querying Datasets on the Web with High Availability , 2014, SEMWEB.

[13]  Silvio Peroni,et al.  FaBiO and CiTO: Ontologies for describing bibliographic resources and citations , 2012, J. Web Semant..

[14]  Oscar Corcho,et al.  Methodological Guidelines for Publishing Government Linked Data , 2011 .

[15]  Carlo Meghini,et al.  Towards a Methodology for Publishing Library Linked Data , 2013, IRCDL.

[16]  Asunción Gómez-Pérez,et al.  Guidelines for multilingual linked data , 2013, WIMS '13.

[17]  Yusniel Hidalgo Delgado,et al.  Detección de comunidades a partir de redes de coautoría en grafos RDF , 2016 .