Causal discovery in social media using quasi-experimental designs

Social media systems have become increasingly attractive to both users and companies providing those systems. Efficient management of these systems is essential and requires knowledge of cause-and-effect relationships within the system. Online experimentation can be used to discover causal knowledge; however, this ignores the observational data that is already being collected for operational purposes. Quasi-experimental designs (QEDs) are commonly used in social sciences to discover causal knowledge from observational data, and QEDs can be exploited to discover causal knowledge about social media systems. In this paper, we apply three different QEDs to demonstrate how one can gain a causal understanding of a social media system. The conclusions drawn from using a QED can have threats to their validity, but we show how one can carefully construct sophisticated designs to overcome some of those threats.

[1]  Rory A. Fisher,et al.  The Arrangement of Field Experiments , 1992 .

[2]  Ron Kohavi,et al.  Online Experimentation at Microsoft , 2009 .

[3]  Heli Väätäjä,et al.  Rewarding in open innovation communities – how to motivate members , 2010 .

[4]  David C. Parkes,et al.  Designing incentives for online question and answer forums , 2009, EC '09.

[5]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[6]  S Thomke,et al.  Enlightened experimentation. The new imperative for innovation. , 2001, Harvard business review.

[7]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[8]  Eugene Agichtein,et al.  Finding the right facts in the crowd: factoid question answering over social media , 2008, WWW.

[9]  Marcelo Maia,et al.  Identifying user behavior in online social networks , 2008, SocialNets '08.

[10]  Tom Burr,et al.  Causation, Prediction, and Search , 2003, Technometrics.

[11]  John W. Tukey,et al.  Statistical Methods for Research Workers , 1930, Nature.

[12]  Brian J. Taylor,et al.  Automatic identification of quasi-experimental designs for discovering causal knowledge , 2008, KDD.

[13]  Brian J. Taylor,et al.  Learning Causal Models of Relational Domains , 2010, AAAI.

[14]  Sheizaf Rafaeli,et al.  Predictors of answer quality in online Q&A sites , 2008, CHI.

[15]  Mohan S. Kankanhalli,et al.  Motivating contributors in social media networks , 2009, WSM@MM.

[16]  Lada A. Adamic,et al.  Knowledge sharing and yahoo answers: everyone knows something , 2008, WWW.

[17]  Ron Kohavi,et al.  Responsible editor: R. Bayardo. , 2022 .

[18]  W. Shadish,et al.  Experimental and Quasi-Experimental Designs for Generalized Causal Inference , 2001 .

[19]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[20]  Alex Rolfe,et al.  Ask-an-expert services analysis , 2001, J. Assoc. Inf. Sci. Technol..

[21]  Mark S. Ackerman,et al.  Questions in, knowledge in?: a study of naver's question answering community , 2009, CHI.

[22]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.