Forecasting Political Conict in Asia and the Middle East using Latent Dirichlet Allocation Models

Latent Dirichlet allocation models are a relatively new computational classification algorithm. In its standard application to document classification, the model assumes each document to be composed of a mixture of multiple, overlapping topics each with a typical set of words, and classification is done by associating words in a document with the latent topics most likely to have generated the observed distribution of those words. I apply this technique to the problem of political forecasting by assuming that the stream of events observed between a dyad of actors is a mixture of a variety of different political strategies and standard operating procedures (for example escalation of repressive measures against a minority group while simultaneously making efforts to co-opt the elites of that group). By identifying the dominant strategies being pursued at time t, one gets information that can be used to forecast likely patterns of interaction at a later time t + k. This approach is applied to event data generated for 29 Asian countries in the Integrated Conflict Early Warning System project for 1998-2010 to forecast the ICEWS conflict measures for rebellion, insurgency, ethno-religious violence, domestic political conflict and international conflict at a six month lead time, and to the Israel-Palestine and Israel-Lebanon dyads from the KEDS Levant data set for 1979-2009. In random samples balancing the occurrence of negative and positive outcomes on the dependent variable, LDA combined with a logistic model predicts with around 60% to 70% accuracy in in-sample evaluation in both data sets, and improves very substantially on the sensitivity of the classification compared with simple logistic models in full samples. A supervised version of LDA, however, does not provide much improvement over the unsupervised version, and shows some pathological behaviors. Some structure can be found in the factors, though more work is needed on this.

[1]  Philip A. Schrodt Parallel event sequences in international crises , 1990 .

[2]  Jens Meierhenrich Analogies at War , 2006 .

[3]  Gary King,et al.  Aggregation Among Binary, Count, and Duration Models: Estimating the Same Quantities from Different Levels of Data , 2008, Political Analysis.

[4]  Philip A. Schrodt Forecasting Conflict in the Balkans using Hidden Markov Models , 2006 .

[5]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[6]  Philip A. Schrodt Automated Production of High-Volume, Real-Time Political Event Data , 2010 .

[7]  J. Jenkins,et al.  Mapping Mass Political Conflict and Civil Society , 1997 .

[8]  J. Jenkins,et al.  Conflict-Carrying Capacity, Political Crisis, and Reconstruction , 2001 .

[9]  Philip A. Schrodt Inductive Event Data Scaling using Item Response Theory , 2007 .

[10]  Philip A. Schrodt,et al.  PREDICTING INTRASTATE CONFLICT ONSET : AN EVENT DATA APPROACH USING EUCLIDEAN AND LEVENSHTEIN DISTANCE MEASURES , 2011 .

[11]  Philip A. Schrodt Pattern Recognition of International Crises using Hidden Markov Models , 2007 .

[12]  James Joll Thinking in time: the uses of history for decision-makers , 1988 .

[13]  Jon C. W. Pevehouse,et al.  Serbian Compliance or Defiance in Kosovo? , 1999 .

[14]  Terri Gullickson Analyzing Interaction: Sequential Analysis With SDIS and GSEQ. , 1996 .

[15]  Philip A. Schrodt,et al.  Discrete Sequence Rule Models as a Social Science Methodology: An Exploratory Analysis of Foreign Policy Rule Enactment within Palestinian-Israeli Event Data , 2008 .

[16]  Ted Robert Gurr,et al.  Systematic Early Warning of Humanitarian Emergencies , 1998 .

[17]  John R. Freeman Systematic Sampling, Temporal Aggregation, and the Study of Political Relationships , 1989, Political Analysis.

[18]  V. Hudson,et al.  Artificial Intelligence and International Politics Gaming , 2020 .

[19]  Stephen M. Shellman,et al.  Time Series Intervals and Statistical Inference: The Effects of Temporal Aggregation on Event Data Analysis , 2004, Political Analysis.

[20]  A. Hämmerli,et al.  Conflict and Cooperation in an Actors' Network of Chechnya Based on Event Data , 2006 .

[21]  Philip A. Schrodt PATTERNS, RULES AND LEARNING: COMPUTATIONAL MODELS OF INTERNATIONAL BEHAVIOR , 2004 .

[22]  Sean P. O'Brien,et al.  Crisis Early Warning and Decision Support: Contemporary Approaches and Thoughts on Future Research , 2010 .

[23]  C. McClelland World event/interaction survey , 1978 .

[24]  Philip A. Schrodt,et al.  Validity Assessment of a Machine-Coded Event Data Set for the Middle East, 1982-92 , 1994 .

[25]  Joe Bond,et al.  Integrated Data for Events Analysis (IDEA): An Event Typology for Automated Events Data Development , 2003 .

[26]  Patrick T. Brandt,et al.  Advances in Bayesian Time Series Modeling and the Study of Politics: Theory Testing, Forecasting, and Policy Analysis , 2006, Political Analysis.

[27]  W. Thompson,et al.  Reconsidering the Aggregate Relationship between Size, Economic Development, and Some Types of Foreign Policy Behavior , 1980 .

[28]  Kristin M. Bakke,et al.  The perils of policy by p-value: Predicting civil conflicts , 2010 .

[29]  Gary King,et al.  An Automated Information Extraction Tool for International Conflict Data with Performance as Good as Human Coders: A Rare Events Evaluation Design , 2003, International Organization.

[30]  Philip A. Schrodt,et al.  Analyzing the dynamics of international mediation processes in the Middle East and Balkans , 2001 .

[31]  Brandon M Stewart,et al.  Predicting Risk Factors Associated with Forced Migration: An Early Warning Model of Haitian Flight , 2007 .

[32]  E. Azar The Conflict and Peace Data Bank (COPDAB) Project , 1980 .

[33]  Ernest R. May,et al.  "Lessons" of the Past: The Use and Misuse of History in American Foreign Policy , 1974 .

[34]  Joshua S. Goldstein A Conflict-Cooperation Scale for WEIS Events Data , 1992 .

[35]  Philip A. Schrodt,et al.  MACHINE CODING OF EVENT DATA USING REGIONAL AND INTERNATIONAL SOURCES , 1994 .

[36]  Philip A. Schrodt,et al.  An Event Data Analysis of Third-Party Mediation in the Middle East and Balkans , 2004 .

[37]  Robert Shearer Forecasting Israeli-Palestinian Conflict with Hidden Markov Models , 2007 .

[38]  Philip A. Schrodt Automated Production of High-Volume, Near-Real-Time Political Event Data , 2011 .

[39]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[40]  S. Shellman Process Matters: Conflict and Cooperation in Sequential Government-Dissident Interactions , 2006 .