Enhancing the Quality of Open Data

This paper looks at some of the quality issues relating to open data. This is problematic because of an open-data specific paradox: most metrics of quality are user-relative, but open data are aimed at no specific user and are simply available online under an open licence, so there is no user to be relevant to. Nevertheless, it is argued that opening data to scrutiny can improve quality by building feedback into the data production process, although much depends on the context of publication. The paper discusses various heuristics for addressing quality, and also looks at institutional approaches. Furthermore, if the open data can be published in linkable or bookmarkable form using Semantic Web technologies, that will provide further mechanisms to improve quality.

[1]  James A. Hendler,et al.  US Government Linked Open Data: Semantic.data.gov , 2012, IEEE Intelligent Systems.

[2]  Yolanda Gil,et al.  PROV Model Primer , 2012 .

[3]  K. O’Hara Data quality, government data and the open data infosphere , 2012 .

[4]  F. Hayek The economic nature of the firm: The use of knowledge in society , 1945 .

[5]  Mirina Grosz,et al.  World Wide Web Consortium , 2010 .

[6]  Wendy Hall,et al.  Building a Pragmatic Semantic Web , 2008, IEEE Intelligent Systems.

[7]  Peter Murray-Rust,et al.  Open Data in Science , 2008 .

[8]  Kieron O'Hara,et al.  Transparent government, not transparent citizens: a report on privacy and transparency for the Cabinet Office , 2011 .

[9]  Hugh Glaser,et al.  Linked Open Government Data: Lessons from Data.gov.uk , 2012, IEEE Intelligent Systems.

[10]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[11]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[12]  Tim Berners-Lee,et al.  Linked data , 2020, Semantic Web for the Working Ontologist.

[13]  Diane M. Strong,et al.  Information quality benchmarks: product and service performance , 2002, CACM.

[14]  Nigel Shadbolt,et al.  Linked Data in Government , 2013, IEEE Internet Computing.

[15]  J. Manyika Big data: The next frontier for innovation, competition, and productivity , 2011 .

[16]  Kieron O'Hara,et al.  Transparency, open data and trust in government: shaping the infosphere , 2012, WebSci '12.