Towards a political theory of data justice: a public good perspective

Purpose This study aims to develop an interdisciplinary political theory of data justice by connecting three major political theories of the public good with empirical studies about the functions of big data and offering normative principles for restricting and guiding the state’s data practices from a public good perspective. Design/methodology/approach Drawing on three major political theories of the public good – the market failure approach, the basic rights approach and the democratic approach – and critical data studies, this study synthesizes existing studies on the promises and perils of big data for public good purposes. The outcome is a conceptual paper that maps philosophical discussions about the conditions under which the state has a legitimate right to collect and use big data for public goods purposes. Findings This study argues that market failure, basic rights protection and deepening democracy can be normative grounds for justifying the state’s right to data collection and utilization, from the perspective of political theories of the public good. The state’s data practices, however, should be guided by three political principles, namely, the principle of transparency and accountability; the principle of fairness; and the principle of democratic legitimacy. The paper draws on empirical studies and practical examples to explicate these principles. Originality/value Bringing together normative political theory and critical data studies, this study contributes to a more philosophically rigorous understanding of how and why big data should be used for public good purposes while discussing the normative boundaries of such data practices.

[1]  Vernon W. Cisney,et al.  Biopower and the Avalanche of Printed Numbers , 2015 .

[2]  R. Kitchin,et al.  Big Data, new epistemologies and paradigm shifts , 2014, Big Data Soc..

[3]  Keith W. Miller,et al.  Information systems ethics - challenges and opportunities , 2017, J. Inf. Commun. Ethics Soc..

[4]  Simeon J. Yates,et al.  Data citizenship: rethinking data literacy in the age of disinformation, misinformation, and malinformation , 2020, Internet Policy Rev..

[5]  Helen Nissenbaum Deregulating Collection: Must Privacy Give Way to Use Regulation? , 2017 .

[6]  Sandra Wachter,et al.  A Right to Reasonable Inferences: Re-Thinking Data Protection Law in the Age of Big Data and AI , 2018 .

[7]  S. Milan Techno-solutionism and the standard human in the making of the COVID-19 pandemic , 2020, Big Data & Society.

[8]  Jennifer Gabrys,et al.  Just good enough data: Figuring data citizenships through air pollution sensing and data stories , 2016, Big Data Soc..

[9]  Qi Shi,et al.  Big Data applications in real-time traffic operation and safety monitoring and improvement on urban expressways , 2015 .

[10]  Jessica Vitak,et al.  More Than Just Privacy: Using Contextual Integrity to Evaluate the Long-Term Risks from COVID-19 Surveillance Technologies , 2020, Social media + society.

[11]  Nicol Turner Lee,et al.  Detecting racial bias in algorithms and machine learning , 2018, J. Inf. Commun. Ethics Soc..

[12]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[13]  Luigi Zingales Towards a Political Theory of the Firm , 2017 .

[14]  Jenna Burrell,et al.  How the machine ‘thinks’: Understanding opacity in machine learning algorithms , 2016 .

[15]  Daan Kolkman,et al.  Transparent to whom? No algorithmic accountability without a critical audience , 2018, Information, Communication & Society.

[16]  Richard Heeks,et al.  Datafication, development and marginalised urban communities: an applied data justice framework , 2019, Information, Communication & Society.

[17]  A. Acquisti Privacy, Big Data, and the Public Good: The Economics and Behavioral Economics of Privacy , 2014 .

[18]  Alok Mishra,et al.  E-Government: A global view and an empirical evaluation of some attributes of citizens , 2005, Gov. Inf. Q..

[19]  Leonid Stoimenov,et al.  Benchmarking open government: An open data perspective , 2014, Gov. Inf. Q..

[20]  Lina Dencik,et al.  Towards data justice? The ambiguity of anti-surveillance resistance in political activism , 2016, Big Data Soc..

[21]  A. Fotopoulou Conceptualising critical data literacies for civil society organisations: agency, care, and social responsibility , 2020 .

[22]  Kieran Healy,et al.  Seeing like a market , 2016 .

[23]  O. Järv,et al.  COVID-19 is spatial: Ensuring that mobile Big Data is used for social good , 2020, Big data & society.

[24]  I. Brown,et al.  Digital Citizenship amd Surveillance| Enabling Digital Citizenship? The Reshaping of Surveillance Policy After Snowden , 2017 .

[25]  Maranke Wieringa,et al.  What to account for when accounting for algorithms: a systematic literature review on algorithmic accountability , 2020, FAT*.

[26]  Elissa M. Redmiles,et al.  Americans' willingness to adopt a COVID-19 tracking app The role of app distributor , 2020, First Monday.

[27]  Ina Sander,et al.  What is critical big data literacy and how can it be implemented? , 2020, Internet Policy Rev..

[28]  D. Boyd,et al.  CRITICAL QUESTIONS FOR BIG DATA , 2012 .

[29]  Pak-Hang Wong,et al.  Democratizing Algorithmic Fairness , 2019, Philosophy & Technology.

[30]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[31]  Alex Pentland,et al.  Fair, Transparent, and Accountable Algorithmic Decision-making Processes , 2017, Philosophy & Technology.

[32]  David Lyon,et al.  Surveillance, Snowden, and Big Data: Capacities, consequences, critique , 2014, Big Data Soc..

[33]  Gwen Shaffer,et al.  Data ideologies of an interested public: A study of grassroots open government data intermediaries , 2017, Big Data Soc..

[34]  Mark Andrejevic,et al.  The big data divide , 2014 .

[35]  Chris Russell,et al.  Counterfactual Explanations Without Opening the Black Box: Automated Decisions and the GDPR , 2017, ArXiv.

[36]  J. Habermas Three Normative Models of Democracy , 1994, Democracy and Difference.

[37]  L. Taylor What is data justice? The case for connecting digital rights and freedoms globally , 2017, Big Data Soc..

[38]  Mike Ananny,et al.  Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic accountability , 2018, New Media Soc..

[39]  Katharina Pistor Statehood in the digital age 1 , 2020 .

[40]  Marcello Ienca,et al.  On the responsible use of digital data to tackle the COVID-19 pandemic , 2020, Nature Medicine.

[41]  J. Dijck Datafication, dataism and dataveillance: Big Data between scientific paradigm and ideology , 2014 .

[42]  S. Milan,et al.  From data politics to the contentious politics of data , 2019, Big Data Soc..

[43]  Shira Mitchell,et al.  Algorithmic Fairness: Choices, Assumptions, and Definitions , 2021, Annual Review of Statistics and Its Application.

[44]  Rita Raley,et al.  Dataveillance and Countervailance , 2013 .

[45]  R. Brook,et al.  Response to COVID-19 in Taiwan: Big Data Analytics, New Technology, and Proactive Testing. , 2020, JAMA.

[46]  M. Kohn Public Goods and Social Justice , 2020, Perspectives on Politics.

[47]  C. D’Ignazio,et al.  Seven intersectional feminist principles for equitable and actionable COVID-19 data , 2020, Big Data Soc..

[48]  Danah Boyd,et al.  Fairness and Abstraction in Sociotechnical Systems , 2019, FAT.

[49]  D. Bigo,et al.  Data politics , 2017, Big Data Soc..

[50]  Morgan E. Currie Data as performance – Showcasing cities through open data maps , 2020, Big Data Soc..

[51]  S. Milan,et al.  Big Data from the South(s): Beyond Data Universalism , 2019, Television & New Media.