Tools for Validating Causal and Predictive Claims in Social Science Models

Abstract To better understand and describe the world around them, and ultimately produce theories that can facilitate better decision making or policy interventions, social scientists need to be able to create, analyze, and validate social, political, and economic models that include causal and predictive elements. Causality is, however, notoriously difficult to analyze. This is especially true in social science due to the complexity of the phenomena being studied. To address this complexity, we describe a suite of causal/predictive analysis techniques adapted from a variety of social, natural, and computational science applications, specifically chosen for their unique applicability to the problems of analyzing temporally offset causes and effects. In particular, we describe four methods for analyzing predictive/causal claims: (1) Granger causality is a well-established method from econometrics that can identify relationships between temporally offset causes and effects when the offsets are fixed; (2) forward-only dynamic time-warping (DTW) addresses uneven temporal offsets; (3) convergent cross mapping (CCM) can be used to analyze bi-directional causality produced by the feedback relationships present in many social systems; (4) finally, we describe a novel feature-based qualitative pattern-recognition approach to identify and explain qualitative causal/predictive relationships that don’t fit more traditional correlation-based analysis techniques. Social scientists can use this mix of analytic techniques to validate (or more-precisely, to invalidate) their causal/predictive hypotheses and produce more robust understandings of complex systems. We present prototype software implementations of these analytic techniques and demonstrate the efficacy of our proposed approaches.

[1]  C. Granger Investigating Causal Relations by Econometric Models and Cross-Spectral Methods , 1969 .

[2]  Simeon Djankov,et al.  Poverty and Civil War: Revisiting the Evidence , 2008, The Review of Economics and Statistics.

[3]  Gerardo L. Munck Canons of research design in qualitative analysis , 1998 .

[4]  Lotta Themnér,et al.  Armed conflicts, 1946–2013 , 2014 .

[5]  Amy Sliva,et al.  A Big Data Methodology for Bridging Quantitative and Qualitative Political Science Research , 2014 .

[6]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[7]  J. Goldthorpe Causation, statistics, and sociology , 1998 .

[8]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[9]  Henry E. Brady,et al.  Toward a Pluralistic Vision of Methodology , 2006, Political Analysis.

[10]  George Sugihara,et al.  Detecting Causality in Complex Ecosystems , 2012, Science.

[11]  John R. Freeman Granger Causality and the Time Series Analysis of Political Relationships , 1983 .

[12]  P. Collier,et al.  Greed and Grievance in Civil War , 1999 .

[13]  Philip Chan,et al.  Toward accurate dynamic time warping in linear time and space , 2007, Intell. Data Anal..

[14]  Avi Pfeffer,et al.  A Metamodel Description Language for HSCB Modeling , 2010 .

[15]  David Hudson,et al.  Does poverty cause conflict? Isolating the causal origins of the conflict trap , 2016 .

[16]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[17]  C. Granger Testing for causality: a personal viewpoint , 1980 .