Valuating User Data in a Human-Centric Data Economy

The idea of paying people for their data is increasingly seen as a promising direction for resolving privacy debates, improving the quality of online data, and even offering an alternative to labor-based compensation in a future dominated by automation and self-operating machines. In this paper we demonstrate how a Human-Centric Data Economy would compensate the users of an online streaming service. We borrow the notion of the Shapley value from cooperative game theory to define what a fair compensation for each user should be for movie scores offered to the recommender system of the service. Since determining the Shapley value exactly is computationally inefficient in the general case, we derive faster alternatives using clustering, dimensionality reduction, and partial information. We apply our algorithms to a movie recommendation data set and demonstrate that different users may have a vastly different value for the service. We also analyze the reasons that some movie ratings may be more valuable than others and discuss the consequences for compensating users fairly.

[1]  Jaron Lanier,et al.  Who Owns the Future , 2013 .

[2]  D. Blackwell Equivalent Comparisons of Experiments , 1953 .

[3]  Philippe van Basshuysen,et al.  Radical Markets: Uprooting Capitalism and Democracy for a Just Society , 2019, Review of Political Economy.

[4]  Pablo Rodriguez,et al.  On economic heavy hitters: shapley value analysis of 95th-percentile pricing , 2010, IMC '10.

[5]  Pablo Rodriguez,et al.  From advertising profits to bandwidth prices: A quantitative methodology for negotiating premium peering , 2014, PERV.

[6]  Curtis R. Taylor,et al.  The Economics of Privacy , 2016 .

[7]  F. Maxwell Harper,et al.  The MovieLens Datasets: History and Context , 2016, TIIS.

[8]  J. S. Mateo The Shapley Value , 2012 .

[9]  Xiaotie Deng,et al.  On the Complexity of Cooperative Solution Concepts , 1994, Math. Oper. Res..

[10]  A. Roth The Shapley value , 2005, Game Theory.

[11]  L. Shapley A Value for n-person Games , 1988 .

[12]  Kaifeng Zhao,et al.  Shapley Value Methods for Attribution Modeling in Online Advertising , 2018, 1804.05327.

[13]  Daniel L. Moody,et al.  Measuring the Value Of Information - An Asset Valuation Approach , 1999, ECIS.

[14]  Timothy M. Chan,et al.  Computing Shapley Values in the Plane , 2018, Discrete & Computational Geometry.

[15]  Vijay Erramilli,et al.  Your browsing behavior for a big mac: economics of personal information online , 2011, WWW.

[16]  Marshall W. van Alstyne,et al.  Valuing Information & Instrumental Goods , 1998, ICIS.

[17]  L. S. Shapley,et al.  17. A Value for n-Person Games , 1953 .

[18]  Kelvin King A case study in the valuation of a database , 2007 .

[19]  Jörg Rothe,et al.  The Cost of Stability in Coalitional Games , 2009, SAGT.