Is the LOD cloud at risk of becoming a museum for datasets? Looking ahead towards a fully collaborative and sustainable LOD cloud

The Linked Open Data (LOD) cloud has been around since 2007. Throughout the years, this prominent depiction served as the epitome for Linked Data and acted as a starting point for many. In this article we perform a number of experiments on the dataset metadata provided by the LOD cloud, in order to understand better whether the current visualised datasets are accessible and with an open license. Furthermore, we perform quality assessment of 17 metrics over accessible datasets that are part of the LOD cloud. These experiments were compared with previous experiments performed on older versions of the LOD cloud. The results showed that there was no improvement on previously identified problems. Based on our findings, we therefore propose a strategy and architecture for a potential collaborative and sustainable LOD cloud.

[1]  Krzysztof Janowicz,et al.  Linked Data, Big Data, and the 4th Paradigm , 2013, Semantic Web.

[2]  Christoph Lange,et al.  Representing dataset quality metadata using multi-dimensional views , 2014, SEM '14.

[3]  Anja Jentzsch Linked Open Data Cloud , 2014 .

[4]  Christoph Lange,et al.  Luzzu—A Methodology and Framework for Linked Data Quality Assessment , 2016, JDIQ.

[5]  Michael Hausenblas,et al.  Describing linked datasets with the VoID vocabulary , 2011 .

[6]  Heiko Paulheim,et al.  Adoption of the Linked Data Best Practices in Different Topical Domains , 2014, SEMWEB.

[7]  Raphaël Troncy,et al.  What's up LOD Cloud? Observing The State of Linked Open Data Cloud Metadata , 2015, LDQ@ESWC.

[8]  Christoph Lange,et al.  Linked Data Notifications: A Resource-Centric Communication Protocol , 2017, ESWC.

[9]  Dimitris Kontokostas,et al.  IDOL: Comprehensive & Complete LOD Insights , 2017, SEMANTICS.

[10]  Jürgen Umbrich,et al.  An empirical survey of Linked Data conformance , 2012, J. Web Semant..

[11]  Christoph Lange,et al.  Evaluating the quality of the LOD cloud: An empirical investigation , 2018, Semantic Web.

[12]  Peroni Silvio,et al.  Media type as Linked Open Data , 2015 .

[13]  Asunción Gómez-Pérez,et al.  A dataset of RDF licenses , 2014, JURIX.

[14]  Jean-Paul Calbimonte Linked Data Notifications for RDF Streams , 2017, WSP/WOMoCoE@ISWC.

[15]  Педагогика Open Knowledge Foundation , 2010 .

[16]  Divyakant Agrawal,et al.  Duplicate detection in click streams , 2005, WWW '05.

[17]  Sören Auer,et al.  Linked SDMX Data: Path to high fidelity Statistical Linked Data , 2015, Semantic Web.

[18]  Andreas Harth,et al.  Weaving the Pedantic Web , 2010, LDOW.