Ten simple rules for responsible big data research

The use of big data research methods has grown tremendously over the past five years in both academia and industry. As the size and complexity of available datasets has grown, so too have the ethical questions raised by big data research. These questions become increasingly urgent as data and research agendas move well beyond those typical of the computational and natural sciences, to more directly address sensitive aspects of human behavior, interaction, and health. The tools of big data research are increasingly woven into our daily lives, including mining digital medical records for scientific and economic insights, mapping relationships via social media, capturing individuals’ speech and action via sensors, tracking movement across space, shaping police and security policy via “predictive policing,” and much more.

[1]  D. Massey American Apartheid: Segregation and the Making of the Underclass , 1993 .

[2]  I. Luckey : American Apartheid: Segregation and the Making of the Underclass , 1995 .

[3]  Latanya Sweeney,et al.  k-Anonymity: A Model for Protecting Privacy , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[4]  Massimo Barbaro,et al.  A Face Is Exposed for AOL Searcher No , 2006 .

[5]  Matthew Zook,et al.  Mapping DigiPlace: Geocoded Internet Data and the Representation of Place , 2007 .

[6]  H. Greely,et al.  Research ethics consultation: the Stanford experience. , 2008, IRB.

[7]  S. Nelson,et al.  BFAST: An Alignment Tool for Large Scale Genome Resequencing , 2009, PloS one.

[8]  Helen Nissenbaum,et al.  Privacy in Context - Technology, Policy, and the Integrity of Social Life , 2009 .

[9]  M. Zimmer “But the data is already public”: on the ethics of research in Facebook , 2010, Ethics and Information Technology.

[10]  Geoffrey C. Bowker,et al.  Values in design , 2011, Commun. ACM.

[11]  Frédo Durand,et al.  Eulerian video magnification for revealing subtle changes in the world , 2012, ACM Trans. Graph..

[12]  Jules Polonetsky,et al.  A Theory of Creepy: Technology, Privacy and Shifting Social Norms , 2013 .

[13]  Karen Barad,et al.  Experiments in Collaboration: Interdisciplinary Graduate Education in Science and Justice , 2013, PLoS Biology.

[14]  K. Crawford,et al.  Big Data and Due Process: Toward a Framework to Redress Predictive Privacy Harms , 2013 .

[15]  W. A. Danyllo,et al.  Identifying Relevant Users and Groups in the Context of Credit Analysis Based on Data from Twitter , 2013, 2013 International Conference on Cloud and Green Computing.

[16]  A. Zwitter Big Data ethics , 2014, Big Data Soc..

[17]  Jon M. Kleinberg,et al.  Community membership identification from small seed sets , 2014, KDD.

[18]  Alessandro Acquisti,et al.  Face Recognition and Privacy in the Age of Augmented Reality , 2014, J. Priv. Confidentiality.

[19]  Paul T. Groth,et al.  Ten Simple Rules for the Care and Feeding of Scientific Data , 2014, PLoS Comput. Biol..

[20]  Michael P. Gilmore,et al.  Subaltern Empowerment in the Geoweb: Tensions between Publicity and Privacy , 2014 .

[21]  Danah Boyd,et al.  Networked privacy: How teenagers negotiate context in social media , 2014, New Media Soc..

[22]  B. Lo Sharing clinical trial data: maximizing benefits, minimizing risk. , 2015, JAMA.

[23]  Matthew Zook,et al.  Social Media and the City: Rethinking Urban Socio-Spatial Inequality Using User-Generated Geographic Information , 2015 .

[24]  Gabi Nakibly,et al.  PowerSpy: Location Tracking Using Mobile Device Power Analysis , 2015, USENIX Security Symposium.

[25]  Matthew Zook,et al.  Social Media and the City: Rethinking Urban Socio-Spatial Inequality Using User-Generated Geographic Information , 2015 .

[26]  D. Boyd Untangling research and practice: What Facebook’s “emotional contagion” study teaches us , 2016 .

[27]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[28]  Edward W. Felten,et al.  A Precautionary Approach to Big Data Privacy , 2016 .

[29]  K. Crawford,et al.  Where are human subjects in Big Data research? The emerging ethics divide , 2016, Big Data Soc..

[30]  Michelle V. Hauge,et al.  Tagging Banksy: using geographic profiling to investigate a modern art mystery , 2016 .

[31]  D. Boyd,et al.  Perspectives on Big Data, Ethics, and Society , 2016 .