Identifying and mitigating risks to the quality of open data in the post-truth era

Big Data analysis often relies on open data, integrating it with large private data sets, using it as ground truth information, or providing it as part of the input to large simulations. Data can be released openly by governments to achieve various objectives: transparency, informing citizen engagement, or supporting private enterprise, to name a few. To the latter objective, Big Data analytics algorithms rely on high-quality, timely access to various data sources, including open data. Examples include retail analytics drawing on open demographic data and weather forecast systems drawing on open weather and climate data. In this paper, we describe the rise of post-truth in society, and the risks this poses to the quality, integrity, and authenticity of open data. We also discuss approaches to identifying, assessing, and mitigating these risks, and suggest future steps to manage this data quality concern.

[1]  Thomas Redman,et al.  The impact of poor data quality on the typical enterprise , 1998, CACM.

[2]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[3]  Michael Gross The dangers of a post-truth world , 2017, Current Biology.

[4]  Joel Gurin,et al.  Open Governments, Open Data: A New Lever for Transparency, Citizen Engagement, and Economic Growth , 2014 .

[5]  Sean D Dessureault,et al.  Understanding big data , 2016 .

[6]  Alieda Blandford,et al.  Examining the role of information in the civic engagement of youth , 2015, ASIST.

[7]  Antonino Virgillito Carlo Marchetti,et al.  The DaQuinCIS Architecture : a Platform for Exchanging and Improving Data Quality in Cooperative Information Systems ? , 2003 .

[8]  Mercè Crosas,et al.  The Dataverse Network®: An Open-Source Application for Sharing, Discovering and Preserving Data , 2011, D Lib Mag..

[9]  D. Harrison McKnight,et al.  Perceived Information Quality in Data Exchanges: Effects on Risk, Trust, and Intention to Use , 2006, Inf. Syst. Res..

[10]  Niels Bjørn-Andersen,et al.  Data-Driven Innovation through Open Government Data , 2014, J. Theor. Appl. Electron. Commer. Res..

[11]  Richard Heeks,et al.  The multiple meanings of open government data: Understanding different stakeholders and their perspectives , 2015, Gov. Inf. Q..

[12]  Judit Dobránszki,et al.  Potential Dangers with Open Access Data Files in the Expanding Open Data Movement , 2015 .

[13]  David Stuart,et al.  The Data Revolution: Big Data, Open Data, Data Infrastructures and Their Consequences , 2015, Online Inf. Rev..

[14]  I. Boyd Take the long view , 2016, Nature.

[15]  Michael X. Delli Carpini,et al.  In search of the informed citizen: What Americans know about politics and why it matters , 2000 .

[16]  Kieron O'Hara,et al.  Transparency, open data and trust in government: shaping the infosphere , 2012, WebSci '12.

[17]  Tim Davies,et al.  Open data, democracy and public sector reform. A look at open government data use from data.gov.uk , 2010 .

[18]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[19]  Clay Shirky Here Comes Everybody: The Power of Organizing Without Organizations , 2008 .

[20]  Stan Matwin,et al.  Strategies and Best Practices for Data Literacy Education: Knowledge Synthesis Report , 2015 .