Towards Dataset Dynamics: Change Frequency of Linked Open Data Sources

Datasets in the LOD cloud are far from being static in their nature and how they are exposed. As resources are added and new links are set, applications consuming the data should be able to deal with these changes. In this paper we investigate how LOD datasets change and what sensible measures there are to accommodate dataset dynamics. We compare our findings with traditional, document-centric studies concerning the “freshness” of the document collections and propose metrics for LOD datasets.

[1]  Andreas Harth,et al.  Scalable Authoritative OWL Reasoning for the Web , 2009, Int. J. Semantic Web Inf. Syst..

[2]  M. Hausenblas,et al.  What is the Size of the Semantic Web ? , 2008 .

[3]  Jürgen Umbrich,et al.  Data summaries for on-demand queries over linked data , 2010, WWW '10.

[4]  Giovanni Tummarello,et al.  RDFSync: Efficient Remote Synchronization of RDF Models , 2007, ISWC/ASWC.

[5]  Hector Garcia-Molina,et al.  Estimating frequency of change , 2003, TOIT.

[6]  Surendra Reddy Requirements for Event Notification Protocol , 1998 .

[7]  Yuzhong Qu,et al.  Term Dependence on the Semantic Web , 2008, SEMWEB.

[8]  Jürgen Umbrich,et al.  MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data , 2006, SEMWEB.

[9]  Hector Garcia-Molina,et al.  The Evolution of the Web and Implications for an Incremental Crawler , 2000, VLDB.

[10]  Bernhard Haslhofer,et al.  DSNotify - Detecting and Fixing Broken Links in Linked Data Sets , 2009, 2009 20th International Workshop on Database and Expert Systems Application.

[11]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[12]  A. Nico Habermann,et al.  Beyond schema evolution to database reorganization , 1990, OOPSLA/ECOOP '90.

[13]  George Cybenko,et al.  How dynamic is the Web? , 2000, Comput. Networks.

[14]  Andreas Harth,et al.  Analysing Dependency Dynamics in Web Data , 2010, AAAI Spring Symposium: Linked Data Meets Artificial Intelligence.

[15]  Sandeep Pandey,et al.  Monitoring the dynamic web to respond to continuous queries , 2003, WWW '03.

[16]  Erhard Rahm,et al.  Analyzing the Evolution of Life Science Ontologies and Mappings , 2008, DILS.

[17]  Dmitri Loguinov,et al.  IRLbot: Scaling to 6 billion pages and beyond , 2009, TWEB.

[18]  Roy T. Fielding,et al.  Principled design of the modern Web architecture , 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium.

[19]  D. Fensel,et al.  Architecture of the World Wide Web , Volume One , 2004 .

[20]  Lars R. Clausen,et al.  Concerning Etags and Datestamps , 2004 .