Privacy, Altruism, and Experience: Estimating the Perceived Value of Internet Data for Medical Uses

People increasingly turn to the Internet when they have a medical condition. The data they create during this process is a valuable source for medical research and future health services. When used for these purposes, it is imperative to balance use with user privacy. One way to understand how to harmonize these requirements is to match the perceived value that users assign to their data with the value of the services derived from them. Here we describe novel experiments where methods from Mechanism Design, Crowdsourcing and Data Science were used together to elicit truthful valuations from users for their Internet data and for services derived from these data, specifically for medical screening. In these experiments, 880 people from around the world were asked to participate in an auction to provide their data for uses differing in their contribution to the participant and to society, and in the disease they addressed. Some users were offered monetary compensation for their participation, while others were asked to pay to participate. Our findings show that 99% of people were willing to contribute their data in exchange for monetary compensation and an analysis of their data, while 53% were willing to pay to have their data analyzed. The average perceived value users assigned to their data was estimated at US$49. Their value for services which offer personal benefit to them was US$22, while the value of this service offered to the general public was US$20. Our findings show that it is possible to place a monetary value on health-related uses of highly personal data. Our methodology can be extended to other areas where sensitive data may be exchanged for services to individuals and to society.

[1]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[2]  V. Bosnes,et al.  Motivation, recruitment and retention of voluntary non‐remunerated blood donors: a survey‐based questionnaire study , 2005, Vox sanguinis.

[3]  Gianclaudio Malgieri,et al.  Pricing Privacy – The Right to Know the Value of Your Personal Data , 2017, Comput. Law Secur. Rev..

[4]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[5]  Elad Yom-Tov,et al.  Crowdsourced Health: How What You Do on the Internet Will Improve Medicine , 2016 .

[6]  William Vickrey,et al.  Counterspeculation, Auctions, And Competitive Sealed Tenders , 1961 .

[7]  Eric J. Johnson,et al.  The Construction of Preference: Do Defaults Save Lives? , 2006 .

[8]  Keith Dewar The value exchange , 2017 .

[9]  Ryen W. White,et al.  Detecting neurodegenerative disorders from web search signals , 2018, npj Digital Medicine.

[10]  David R. Cox,et al.  The Oxford Dictionary of Statistical Terms , 2006 .

[11]  E. Yom-Tov,et al.  Can internet search engine queries be used to diagnose diabetes? Analysis of archival search data , 2019, Acta Diabetologica.

[12]  Ryen W. White,et al.  Evaluation of the Feasibility of Screening Patients for Early Signs of Lung Carcinoma in Web Search Logs , 2017, JAMA oncology.

[13]  M. Massagli,et al.  Accelerated clinical discovery using self-reported patient data collected online and a patient-matching algorithm , 2011, Nature Biotechnology.

[14]  A. Acquisti,et al.  Beyond the Turk: Alternative Platforms for Crowdsourcing Behavioral Research , 2016 .

[15]  Elad Yom-Tov,et al.  Detecting Parkinson's Disease from Interactions with a Search Engine: Is Expert Knowledge Sufficient? , 2018, CIKM.

[16]  Elad Yom-Tov,et al.  Online Concerns of Parents Suspecting Autism Spectrum Disorder in Their Child: Content Analysis of Signs and Automated Prediction of Risk , 2016, Journal of medical Internet research.

[17]  Vijay Erramilli,et al.  Your browsing behavior for a big mac: economics of personal information online , 2011, WWW.

[18]  Ron Lavi,et al.  Algorithmic Mechanism Design , 2008, Encyclopedia of Algorithms.

[19]  Marc Walter,et al.  Motivation for Living-Donor Liver Transplantation from the Donor’s Perspective: An In-Depth Qualitative Research Study , 2004, Transplantation.

[20]  Ryen W. White,et al.  Screening for Pancreatic Adenocarcinoma Using Signals From Web Search Logs: Feasibility Study and Results. , 2016, Journal of oncology practice.

[21]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[22]  Elad Yom-Tov,et al.  Inferring Individual Attributes from Search Engine Queries and Auxiliary Information , 2016, WWW.

[23]  George Danezis,et al.  How Much Is Location Privacy Worth? , 2005, WEIS.

[24]  G. Debreu Topological Methods in Cardinal Utility Theory , 1959 .

[25]  Logan Kugler The war over the value of personal data , 2018, Commun. ACM.