Distilling the Outcomes of Personal Experiences: A Propensity-scored Analysis of Social Media

Millions of people regularly report the details of their real-world experiences on social media. This provides an opportunity to observe the outcomes of common and critical situations. Identifying and quantifying these outcomes may provide better decision-support and goal-achievement for individuals, and help policy-makers and scientists better understand important societal phenomena. We address several open questions about using social media data for open-domain outcome identification: Are the words people are more likely to use after some experience relevant to this experience? How well do these words cover the breadth of outcomes likely to occur for an experience? What kinds of outcomes are discovered? Studying 3-months of Twitter data capturing people who experienced 39 distinct situations across a variety of domains, we find that these outcomes are generally found to be relevant (55-100% on average) and that causally related concepts are more likely to be discovered than conceptual or semantically related concepts.

[1]  H. E. Burtt,et al.  A Study of Conversations. , 1924 .

[2]  A. Bandura Social learning theory , 1977 .

[3]  D. Rubin,et al.  Reducing Bias in Observational Studies Using Subclassification on the Propensity Score , 1984 .

[4]  Robin I. M. Dunbar,et al.  Human conversational behavior , 1997, Human nature.

[5]  Yoav Freund,et al.  Large Margin Classification Using the Perceptron Algorithm , 1998, COLT' 98.

[6]  S. Bikhchandani,et al.  Learning from the behavior of others : conformity, fads, and informational cascades , 1998 .

[7]  Andrew McCallum,et al.  A Machine Learning Approach to Building Domain-Specific Search Engines , 1999, IJCAI.

[8]  J. Robins,et al.  On the impossibility of inferring causation from association without background knowledge , 1999 .

[9]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[10]  Kimberly A. Neuendorf,et al.  The Content Analysis Guidebook , 2001 .

[11]  Erik T. Mueller,et al.  Open Mind Common Sense: Knowledge Acquisition from the General Public , 2002, OTM.

[12]  Hugo Liu,et al.  ConceptNet — A Practical Commonsense Reasoning Tool-Kit , 2004 .

[13]  Marco Caliendo,et al.  Some Practical Guidance for the Implementation of Propensity Score Matching , 2005, SSRN Electronic Journal.

[14]  E. Diener Guidelines for National Indicators of Subjective Well-Being and Ill-Being , 2006 .

[15]  P. Gollwitzer,et al.  Implementation intentions and goal achievement: A meta-analysis of effects and processes , 2006 .

[16]  Cliff Lampe,et al.  A face(book) in the crowd: social Searching vs. social browsing , 2006, CSCW '06.

[17]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[18]  Michael J. Muller,et al.  Motivations for social networking at work , 2008, CSCW.

[19]  Matthew Richardson,et al.  Learning about the world through long-term query logs , 2008, TWEB.

[20]  D. Funder,et al.  Personality as manifest in word use: correlations with self-report, acquaintance report, and behavior. , 2008, Journal of personality and social psychology.

[21]  Oren Etzioni,et al.  Open Information Extraction from the Web , 2007, CACM.

[22]  Adam N. Joinson,et al.  Looking at, looking up or keeping up with people?: motives and use of facebook , 2008, CHI.

[23]  J. Sekhon The Neyman— Rubin Model of Causal Inference and Estimation Via Matching Methods , 2008 .

[24]  Randy Goebel,et al.  Web-Scale N-gram Models for Lexical Disambiguation , 2009, IJCAI.

[25]  Cameron Marlow,et al.  Feed me: motivating newcomer contribution in social network sites , 2009, CHI.

[26]  James Caverlee,et al.  Ranking Comments on the Social Web , 2009, 2009 International Conference on Computational Science and Engineering.

[27]  Mark S. Ackerman,et al.  Questions in, knowledge in?: a study of naver's question answering community , 2009, CHI.

[28]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[29]  Susan T. Dumais,et al.  Classification-enhanced ranking , 2010, WWW '10.

[30]  David M. Pennock,et al.  Predicting consumer behavior with Web search , 2010, Proceedings of the National Academy of Sciences.

[31]  Mor Naaman,et al.  Is it really about me?: message content in social awareness streams , 2010, CSCW '10.

[32]  Vanessa May,et al.  What is Narrative Analysis , 2010 .

[33]  Kate Ehrlich,et al.  Microblogging Inside and Outside the Workplace , 2010, ICWSM.

[34]  M. Perugini,et al.  Can implementation intentions and text messages promote brisk walking? A randomized trial. , 2010, Health psychology : official journal of the Division of Health Psychology, American Psychological Association.

[35]  Andrei Z. Broder,et al.  Anatomy of the long tail: ordinary people with extraordinary tastes , 2010, WSDM '10.

[36]  David M. Pennock,et al.  What Can Search Predict? , 2010 .

[37]  Daniel Gayo-Avello Don't turn social media into another 'Literary Digest' poll , 2011, Commun. ACM.

[38]  E. Augustson,et al.  Cancer Survivorship in the Age of YouTube and Social Media: A Narrative Analysis , 2011, Journal of medical Internet research.

[39]  Barbara Poblete,et al.  Information credibility on twitter , 2011, WWW.

[40]  Nicholas Diakopoulos,et al.  Cooooooooooooooollllllllllllll!!!!!!!!!!!!!! Using Word Lengthening to Detect Sentiment in Microblogs , 2011, EMNLP.

[41]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[42]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[43]  Lydia B. Chilton,et al.  Addressing people's information needs directly in a web search result page , 2011, WWW.

[44]  Edith Law,et al.  Towards Large-Scale Collaborative Planning: Answering High-Level Search Queries Using Human Computation , 2011, AAAI.

[45]  Jason P. Mitchell,et al.  Disclosing information about the self is intrinsically rewarding , 2012, Proceedings of the National Academy of Sciences.

[46]  Mark S. Ackerman,et al.  Collaborative help in chronic disease management: supporting individualized problems , 2012, CSCW.

[47]  Catherine Havasi,et al.  Representing General Relational Knowledge in ConceptNet 5 , 2012, LREC.

[48]  Philipp Schaer,et al.  Better than Their Reputation? On the Reliability of Relevance Assessments with Students , 2012, CLEF.

[49]  Michael S. Bernstein,et al.  Direct answers for search queries in the long tail , 2012, CHI.

[50]  Jane Yung-jen Hsu,et al.  Contextual Commonsense Knowledge Acquisition from Social Content by Crowd-Sourcing Explanations , 2012, HCOMP@AAAI.

[51]  Eugene Agichtein,et al.  When web search fails, searchers become askers: understanding the transition , 2012, SIGIR '12.

[52]  Emre Kiciman,et al.  OMG, I Have to Tweet that! A Study of Factors that Influence Tweet Rates , 2012, ICWSM.

[53]  Henry A. Kautz,et al.  Predicting Disease Transmission from Geo-Tagged Micro-Blog Data , 2012, AAAI.

[54]  Scott Counts,et al.  Tweeting is believing?: understanding microblog credibility perceptions , 2012, CSCW.

[55]  Eric Horvitz,et al.  Social media as a measurement tool of depression in populations , 2013, WebSci.

[56]  W. Chapman,et al.  Using Twitter to Examine Smoking Behavior and Perceptions of Emerging Tobacco Products , 2013, Journal of medical Internet research.

[57]  Alessandro Vespignani,et al.  The Twitter of Babel: Mapping World Languages through Microblogging Platforms , 2012, PloS one.

[58]  Steven Diamond,et al.  TaskGenies: Automatically Providing Action Plans Helps People Complete Tasks , 2012, TCHI.

[59]  Margaret L. Kern,et al.  Personality, Gender, and Age in the Language of Social Media: The Open-Vocabulary Approach , 2013, PloS one.

[60]  E. Gabrilovich,et al.  Postmarket Drug Surveillance Without Trial Costs: Discovery of Adverse Drug Reactions Through Large-Scale Analysis of Web Search Queries , 2013, Journal of medical Internet research.

[61]  José van Dijck,et al.  'You have one identity': performing the self on Facebook and LinkedIn , 2013 .

[62]  Mor Naaman,et al.  Fitter with Twitter: Understanding Personal Health and Fitness Activity in Social Media , 2013, ICWSM.

[63]  Eric Horvitz,et al.  Predicting Depression via Social Media , 2013, ICWSM.

[64]  Eric Horvitz,et al.  Major life changes and behavioral markers in social media: case of childbirth , 2013, CSCW.

[65]  Xuchen Yao,et al.  Information Extraction over Structured Data: Question Answering with Freebase , 2014, ACL.

[66]  Venkata Rama Kiran Garimella,et al.  Inferring international and internal migration patterns from Twitter data , 2014, WWW.

[67]  Venkata Rama Kiran Garimella,et al.  From "I Love You Babe" to "Leave Me Alone" - Romantic Relationship Breakups on Twitter , 2014, SocInfo.

[68]  Jure Leskovec,et al.  How Community Feedback Shapes User Behavior , 2014, ICWSM.

[69]  Claire Cardie,et al.  Sentiment analysis on evolving social streams: how self-report imbalances can help , 2014, WSDM.

[70]  Zeynep Tufekci,et al.  Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls , 2014, ICWSM.

[71]  Michael Massimi,et al.  Life transitions and online health communities: reflecting on adoption, use, and disengagement , 2014, CSCW.

[72]  Oren Etzioni,et al.  Open question answering over curated and extracted knowledge bases , 2014, KDD.

[73]  Matthew Richardson,et al.  Towards Decision Support and Goal Achievement: Identifying Action-Outcome Relationships From Social Media , 2015, KDD.

[74]  Ee-Peng Lim,et al.  Characterizing Silent Users in Social Media Communities , 2015, ICWSM.

[75]  B. Nardi,et al.  Online Media Forums as Separate Social Lives: A Qualitative Study of Disclosure Within and Beyond Reddit , 2015 .

[76]  Ricardo Baeza-Yates,et al.  Wisdom of the Crowd or Wisdom of a Few?: An Analysis of Users' Content Generation , 2015, HT.

[77]  Carlos Castillo,et al.  What to Expect When the Unexpected Happens: Social Media Communications Across Crises , 2015, CSCW.

[78]  Ryen W. White,et al.  Exploring Time-Dependent Concerns about Pregnancy and Childbirth from Search Logs , 2015, CHI.

[79]  Kat Austen,et al.  What could derail the wearables revolution? , 2015, Nature.

[80]  Derek Ruths,et al.  Organizations Are Users Too: Characterizing and Detecting the Presence of Organizations on Twitter , 2015, ICWSM.

[81]  Wanda Pratt,et al.  Self-Characterized Illness Phase and Information Needs of Participants in an Online Cancer Forum , 2015, ICWSM.

[82]  Aron Culotta,et al.  Using matched samples to estimate the effects of exercise on mental health from twitter , 2015, AAAI 2015.

[83]  Haixun Wang,et al.  Short text understanding through lexical-semantic analysis , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[84]  Sofiane Abbar,et al.  You Tweet What You Eat: Studying Food Consumption Through Twitter , 2014, CHI.

[85]  Andrew T. Perrin Social Media Usage: 2005-2015 , 2015 .

[86]  Ryen W. White,et al.  Diagnoses, Decisions, and Outcomes: Web Search as Decision Support for Cancer , 2015, WWW.

[87]  Chul Lee,et al.  Persistent Sharing of Fitness App Status on Twitter , 2015, CSCW.

[88]  Virgile Landeiro,et al.  Robust Text Classification in the Presence of Confounding Bias , 2016, AAAI.

[89]  Mark Dredze,et al.  Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media , 2016, CHI.

[90]  Sean A. Munson,et al.  PlanSourcing: Generating Behavior Change Plans with Friends and Crowds , 2016, CSCW.

[91]  Robert E. Kraut,et al.  Modeling Self-Disclosure in Social Networking Sites , 2016, CSCW.

[92]  Rana El Kaliouby,et al.  On the Future of Personal Assistants , 2016, CHI Extended Abstracts.

[93]  G. Imbens,et al.  Efficient Inference of Average Treatment Effects in High Dimensions via Approximate Residual Balancing , 2016 .

[94]  Zahra Ashktorab,et al.  Designing Cyberbullying Mitigation and Prevention Solutions through Participatory Design With Teenagers , 2016, CHI.

[95]  Michael S. Bernstein,et al.  Augur: Mining Human Behaviors from Fiction to Power Interactive Systems , 2016, CHI.

[96]  Walid Magdy,et al.  #FailedRevolutions: Using Twitter to study the antecedents of ISIS support , 2015, First Monday.

[97]  Scott Counts,et al.  The psychology of job loss: using social media data to characterize and predict unemployment , 2016, WebSci.

[98]  Richard A. Nielsen,et al.  Why Propensity Scores Should Not Be Used for Matching , 2019, Political Analysis.