Message Distortion in Information Cascades

Information diffusion is usually modeled as a process in which immutable pieces of information propagate over a network. In reality, however, messages are not immutable, but may be morphed with every step, potentially entailing large cumulative distortions. This process may lead to misinformation even in the absence of malevolent actors, and understanding it is crucial for modeling and improving online information systems. Here, we perform a controlled, crowdsourced experiment in which we simulate the propagation of information from medical research papers. Starting from the original abstracts, crowd workers iteratively shorten previously produced summaries to increasingly smaller lengths. We also collect control summaries where the original abstract is compressed directly to the final target length. Comparing cascades to controls allows us to separate the effect of the length constraint from that of accumulated distortion. Via careful manual coding, we annotate lexical and semantic units in the medical abstracts and track them along cascades. We find that iterative summarization has a negative impact due to the accumulation of error, but that high-quality intermediate summaries result in less distorted messages than in the control case. Different types of information behave differently; in particular, the conclusion of a medical abstract (i.e., its key message) is distorted most. Finally, we compare extractive with abstractive summaries, finding that the latter are less prone to semantic distortion. Overall, this work is a first step in studying information cascades without the assumption that disseminated content is immutable, with implications on our understanding of the role of word-of-mouth effects on the misreporting of science.

[1]  R. Krauss Low-Fat Dietary Pattern and Risk of Invasive Breast Cancer: The Women's Health Initiative Randomized Controlled Dietary Modification Trial , 2006, Current atherosclerosis reports.

[2]  J. Wardle,et al.  Reliability and validity of a revised version of the General Nutrition Knowledge Questionnaire , 2016, European Journal of Clinical Nutrition.

[3]  Robert M. Entman,et al.  Framing: Toward Clarification of a Fractured Paradigm , 1993 .

[4]  C. Wichems,et al.  Response to a monovalent 2009 influenza A (H1N1) vaccine. , 2009, The New England journal of medicine.

[5]  K Reesten M Eldgaard,et al.  A Population-Based Study of Measles, Mumps, and Rubella Vaccination and Autism , 2002 .

[6]  Hang Li,et al.  Convolutional Neural Network Architectures for Matching Natural Language Sentences , 2014, NIPS.

[7]  Claire Cardie,et al.  Playing the Telephone Game: Determining the Hierarchical Structure of Perspective and Speech Expressions , 2004, COLING.

[8]  Lada A. Adamic,et al.  Information Evolution in Social Networks , 2014, WSDM.

[9]  Saiph Savage,et al.  Mobilizing the Trump Train: Understanding Collective Action in a Political Trolling Community , 2018, ICWSM.

[10]  Majid Ezzati,et al.  Global sodium consumption and death from cardiovascular causes. , 2014, The New England journal of medicine.

[11]  Petroc Sumner,et al.  The association between exaggeration in health related science news and academic press releases: retrospective observational study , 2014, BMJ : British Medical Journal.

[12]  Uraz Yavanoglu,et al.  Identifying Framing Bias in Online News , 2018, ACM Trans. Soc. Comput..

[13]  R. Sinha,et al.  Association of coffee drinking with total and cause-specific mortality. , 2012, The New England journal of medicine.

[14]  Wenpeng Yin,et al.  Convolutional Neural Network for Paraphrase Identification , 2015, NAACL.

[15]  Sebastian Tschiatschek,et al.  Fake News Detection in Social Networks via Crowd Signals , 2017, WWW.

[16]  Wolfgang Gaissmaier,et al.  The amplification of risk in experimental diffusion chains , 2015, Proceedings of the National Academy of Sciences.

[17]  Karen A Gelmon,et al.  Exemestane for breast-cancer prevention in postmenopausal women. , 2011, The New England journal of medicine.

[18]  Mario Cataldi,et al.  Emerging topic detection on Twitter based on temporal and social terms evaluation , 2010, MDMKDD '10.

[19]  A. Whiten,et al.  The multiple roles of cultural transmission experiments in understanding human cultural evolution , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[20]  E. Anderson Customer Satisfaction and Word of Mouth , 1998 .

[21]  P. Herr,et al.  Effects of Word-of-Mouth and Product-Attribute Information on Persuasion: An Accessibility-Diagnosticity Perspective , 1991 .

[22]  J. Gustafsson,et al.  The Women's Health Initiative. What is on trial: nutrition and chronic disease? Or misinterpreted science, media havoc and the sound of silence from peers? , 2006, Public Health Nutrition.

[23]  Pedro Alonso,et al.  First results of phase 3 trial of RTS,S/AS01 malaria vaccine in African children. , 2011, The New England journal of medicine.

[24]  A. Bleyer,et al.  Effect of three decades of screening mammography on breast-cancer incidence. , 2012, The New England journal of medicine.

[25]  Cécile Favre,et al.  Information diffusion in online social networks: a survey , 2013, SGMD.

[26]  Mikael Fogelholm,et al.  Faculty of 1000 evaluation for Primary prevention of cardiovascular disease with a Mediterranean diet. , 2013 .

[27]  M. Weigold,et al.  Communicating Science , 2001 .

[28]  Olivia Buzek,et al.  Improving Translation via Targeted Paraphrasing , 2010, EMNLP.

[29]  Sarah M. Scholl,et al.  Development of a Comprehensive Heart Disease Knowledge Questionnaire , 2011, American journal of health education.

[30]  Dwayne D. Gremler,et al.  Electronic word-of-mouth via consumer-opinion platforms: What motivates consumers to articulate themselves on the Internet? , 2004 .

[31]  A. Giobbie-Hurder,et al.  Adjuvant exemestane with ovarian suppression in premenopausal breast cancer. , 2014, The New England journal of medicine.

[32]  Donald A Williamson,et al.  Comparison of weight-loss diets with different compositions of fat, protein, and carbohydrates. , 2009, The New England journal of medicine.

[33]  Min Sun,et al.  A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss , 2018, ACL.

[34]  S. Nissen,et al.  Effect of rosiglitazone on the risk of myocardial infarction and death from cardiovascular causes. , 2007, The New England journal of medicine.

[35]  Marvin Zelen,et al.  Effect of screening mammography on breast-cancer mortality in Norway. , 2010, The New England journal of medicine.

[36]  Junyi Jessy Li,et al.  A Corpus with Multi-Level Annotations of Patients, Interventions and Outcomes to Support Language Processing for Medical Literature , 2018, ACL.

[37]  Inderjeet Mani,et al.  Summarization Evaluation: An Overview , 2001, NTCIR.

[38]  C Michael Stein,et al.  Azithromycin and the risk of cardiovascular death. , 2012, The New England journal of medicine.

[39]  Bruce Fireman,et al.  Waning protection after fifth dose of acellular pertussis vaccine in children. , 2012, The New England journal of medicine.

[40]  Jure Leskovec,et al.  Meme-tracking and the dynamics of the news cycle , 2009, KDD.

[41]  Fabrizio Lillo,et al.  $FAKE: Evidence of Spam and Bot Activity in Stock Microblogs on Twitter , 2018, ICWSM.

[42]  Robert West,et al.  How Constraints Affect Content: The Case of Twitter's Switch from 140 to 280 Characters , 2018, ICWSM.

[43]  Krishna P. Gummadi,et al.  On word-of-mouth based discovery of the web , 2011, IMC '11.

[44]  D. Mozaffarian,et al.  Changes in diet and lifestyle and long-term weight gain in women and men. , 2011, The New England journal of medicine.

[45]  Arya M. Sharma,et al.  Effect of sibutramine on cardiovascular outcomes in overweight and obese subjects. , 2010, The New England journal of medicine.

[46]  Filippo Menczer,et al.  Online Human-Bot Interactions: Detection, Estimation, and Characterization , 2017, ICWSM.

[47]  Mogens Vestergaard,et al.  A population-based study of measles, mumps, and rubella vaccination and autism. , 2002, The New England journal of medicine.

[48]  Hans Peter Peters,et al.  Gap between science and media revisited: Scientists as public communicators , 2013, Proceedings of the National Academy of Sciences.

[49]  Bowen Zhou,et al.  Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond , 2016, CoNLL.