Time Dependency, Data Flow, and Competitive Advantage

Data is fundamental to machine learning-based products and services and is considered strategic due to its externalities for businesses, governments, non-profits, and more generally for society. It is renowned that the value of organizations (businesses, government agencies and programs, and even industries) scales with the volume of available data. What is often less appreciated is that the data value in making useful organizational predictions will range widely and is prominently a function of data characteristics and underlying algorithms. In this research, our goal is to study how the value of data changes over time and how this change varies across contexts and business areas (e.g. next word prediction in the context of history, sports, politics). We focus on data from Reddit.com and compare the value’s time-dependency across various Reddit topics (Subreddits). We make this comparison by measuring the rate at which user-generated text data loses its relevance to the algorithmic prediction of conversations. We show that different subreddits have different rates of relevance decline over time. Relating the text topics to various business areas of interest, we argue that competing in a business area in which data value decays rapidly alters strategies to acquire competitive advantage. When data value decays rapidly, access to a continuous flow of data will be more valuable than access to a fixed stock of data. In this kind of setting, improving user engagement and increasing user-base help creating and maintaining a competitive advantage.

[1]  C. Shapiro,et al.  Network Externalities, Competition, and Compatibility , 1985 .

[2]  David Godes,et al.  Using Online Conversations to Study Word-of-Mouth Communication , 2004 .

[3]  Gilad Mishne,et al.  Predicting Movie Sales from Blogger Sentiment , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[4]  Rayid Ghani,et al.  Text mining for product attribute extraction , 2006, SKDD.

[5]  V. Dhar,et al.  Does Chatter Matter? The Impact of User-Generated Content on Music Sales , 2007 .

[6]  Yubo Chen,et al.  Online Consumer Review: Word-of-Mouth as a New Element of Marketing Communication Mix , 2004, Manag. Sci..

[7]  Panagiotis G. Ipeirotis,et al.  Deriving the Pricing Power of Product Features by Mining Consumer Reviews , 2011 .

[8]  Victoria D. Bush,et al.  What We Know and Don't Know about Online Word-of-Mouth: A Review and Synthesis of the Literature , 2014 .

[9]  Anja Lambrecht,et al.  Can Big Data Protect a Firm from Competition? , 2015 .

[10]  J. aiken Search , Antitrust , and the Economics of the Control of User Data , 2016 .

[11]  D. Sokol,et al.  Antitrust and Regulating Big Data , 2016 .

[12]  Tammo H. A. Bijmolt,et al.  The Effect of Electronic Word of Mouth on Sales: A Meta-Analytic Review of Platform, Product, and Metric Factors , 2016 .

[13]  Jens Prufer,et al.  Competing with Big Data , 2017, The Journal of Industrial Economics.

[14]  Yang Yang,et al.  Deep Learning Scaling is Predictable, Empirically , 2017, ArXiv.

[15]  Jianqing Chen,et al.  User-Generated Content and Competing Firms' Product Design , 2013, Manag. Sci..

[16]  David Bailey Competition policy in the digital era , 2018 .

[17]  Hal R. Varian,et al.  Artificial Intelligence, Economics, and Industrial Organization , 2018 .

[18]  Avi Goldfarb,et al.  Prediction Machines: The Simple Economics of Artificial Intelligence , 2018 .

[19]  V. Haksar,et al.  The Economics and Implications of Data , 2019, Departmental Papers / Policy Papers.

[20]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[21]  Robert C. Seamans,et al.  AI and the Economy , 2018, Innovation Policy and the Economy.

[22]  C. Cambini,et al.  The Economics of Artificial Intelligence: A Survey , 2019, SSRN Electronic Journal.

[23]  Jason Weston,et al.  ELI5: Long Form Question Answering , 2019, ACL.

[24]  G. Taylor,et al.  Data and Competition: A General Framework with Applications to Mergers, Market Structure, and Privacy Policy , 2020 .

[25]  Shota Ichihashi The Economics of Data Externalities , 2020, J. Econ. Theory.

[26]  Andrei Hagiu,et al.  Data-enabled learning, network effects and competitive advantage ∗ , 2020 .

[27]  Marco Iansiti,et al.  Time and the Value of Data , 2020, Academy of Management Proceedings.

[28]  Robert Wayne Gregory,et al.  The Role of Artificial Intelligence and Data Network Effects for Creating User Value , 2020, Academy of Management Review.