The Hard Problem of Prediction for Conflict Prevention

There is a growing interest in better conflict prevention and this provides a strong motivation for better conflict forecasting. A key problem of conflict forecasting for prevention is that predicting the start of conflict in previously peaceful countries is extremely hard. To make progress in this hard problem this project exploits both supervised and unsupervised machine learning. Specifically, the latent Dirichlet allocation (LDA) model is used for feature extraction from 3.8 million newspaper articles and these features are then used in a random forest model to predict conflict. We find that forecasting hard cases is possible and benefits from supervised learning despite the small sample size. Several topics are negatively associated with the outbreak of conflict and these gain importance when predicting hard onsets. The trees in the random forest use the topics in lower nodes where they are evaluated conditionally on conflict history, which allows the random forest to adapt to the hard problem and provides useful forecasts for prevention.

[1]  Christian Rauh,et al.  Reading Between the Lines: Prediction of Political Violence Using Newspaper Text , 2016, American Political Science Review.

[2]  C. Granger,et al.  Handbook of Economic Forecasting , 2006 .

[3]  Sang Michael Xie,et al.  Combining satellite imagery and machine learning to predict poverty , 2016, Science.

[4]  Tatevik Sekhposyan,et al.  Macroeconomic uncertainty indices based on nowcast and forecast error distributions , 2015 .

[5]  A. Ciccone,et al.  International Commodity Prices and Civil War Outbreak: New Evidence for Sub-Saharan Africa and Beyond , 2018, SSRN Electronic Journal.

[6]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[7]  Erik Melander,et al.  Introducing the UCDP Georeferenced Event Dataset , 2013 .

[8]  Philip A. Schrodt,et al.  Data-based Computational Approaches to Forecasting Political Violence , 2013 .

[9]  Malcolm Chalmers,et al.  SPENDING TO SAVE? THE COST‐EFFECTIVENESS OF CONFLICT PREVENTION , 2007 .

[10]  Nicholas Bloom,et al.  The World Uncertainty Index , 2018, SSRN Electronic Journal.

[11]  Michael D. Ward,et al.  Learning from the Past and Stepping into the Future: Toward a New Generation of Conflict Prediction , 2013 .

[12]  Stelios Michalopoulos,et al.  The Long-Run E ff ects of the Scramble for Africa , 2012 .

[13]  Fernando Nogueira,et al.  Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning , 2016, J. Mach. Learn. Res..

[14]  J. Kleinberg,et al.  Prediction Policy Problems. , 2015, The American economic review.

[15]  Leonard Dudley,et al.  Innovation growth clusters: Lessons from the industrial revolution , 2018 .

[16]  Thomas Chadefaux,et al.  Conflict Forecasting and Its Limits , 2017, Data Sci..

[17]  Mathias Thoenig,et al.  This Mine is Mine! How Minerals Fuel Conflicts in Africa , 2014, SSRN Electronic Journal.

[18]  Weisi Guo,et al.  Retool AI to forecast and limit wars , 2018, Nature.

[19]  Hassan Benchekroun,et al.  Mergers in Nonrenewable Resource Oligopolies and Environmental Policies , 2018, European Economic Review.

[20]  Sean Horan,et al.  Precision may harm: The comparative statics of imprecise judgement , 2018 .

[21]  A. Timmermann Forecast Combinations , 2005 .

[22]  Sean Horan,et al.  Random consideration and choice: A case study of "default" options , 2019, Math. Soc. Sci..

[23]  Nicholas Sambanis,et al.  What Is A Civil War , 2004 .

[24]  Joshua Lewis,et al.  The impact of uncertainty in agriculture , 2018 .

[25]  Debraj Ray,et al.  Ethnicity and Conflict: An Empirical Study , 2012 .

[26]  Sven Ove Hansson,et al.  Measuring Uncertainty , 2009, Stud Logica.

[27]  J. Stock,et al.  Forecasting with Many Predictors , 2006 .

[28]  Nicholas Bloom,et al.  Policy News and Stock Market Volatility , 2019, SSRN Electronic Journal.

[29]  Christopher Blattman,et al.  The Promise and Pitfalls of Conflict Prediction: Evidence from Colombia and Indonesia , 2019, Review of Economics and Statistics.

[30]  Yves Sprumont,et al.  Belief-weighted Nash aggregation of Savage preferences , 2018, J. Econ. Theory.

[31]  J. Goldstone,et al.  A Global Model for Forecasting Political Instability , 2010 .

[32]  S. Davis,et al.  Measuring Economic Policy Uncertainty , 2013 .

[33]  Michael McMahon,et al.  Transparency and deliberation within the FOMC: a computational linguistics approach , 2014 .

[34]  Nicholas Sambanis,et al.  Understanding Civil War , 2002 .

[35]  Mathieu Marcoux Sharp test for equilibrium uniqueness in discrete games with a flexible information structure , 2018 .

[36]  Johannes Hoelzemann,et al.  Bandits in the lab , 2018, Quantitative Economics.

[37]  Marion Couldrey,et al.  States of fragility , 2013 .

[38]  Ck Cheng,et al.  The Age of Big Data , 2015 .

[39]  Sendhil Mullainathan,et al.  Machine Learning: An Applied Econometric Approach , 2017, Journal of Economic Perspectives.

[40]  Juan F. Vargas,et al.  Commodity Price Shocks and Civil Conflict: Evidence from Colombia , 2013 .

[41]  Joel David,et al.  Firm Performance and Macro Forecast Accuracy , 2018, Journal of Monetary Economics.

[42]  A. Timmermann,et al.  Economic Forecasting , 2007 .

[43]  Hannes Felix Mueller,et al.  Pathways for peace - inclusive approaches to preventing violent conflict , 2017 .

[44]  D. Donaldson,et al.  Evolving Comparative Advantage and the Impact of Climate Change in Agricultural Markets: Evidence from 1.7 Million Fields around the World , 2014, Journal of Political Economy.

[45]  Susan Athey,et al.  Machine Learning Methods That Economists Should Know About , 2019, Annual Review of Economics.

[46]  Kristin M. Bakke,et al.  The perils of policy by p-value: Predicting civil conflicts , 2010 .

[47]  Christopher Blattman,et al.  Economic Shocks and Conflict: Evidence from Commodity Prices , 2013 .

[48]  Nicholas Sambanis,et al.  Forecasting Civil Wars: Theory and Structure in an Age of “Big Data” and Machine Learning , 2020, Journal of Conflict Resolution.

[49]  Nicolas Klein,et al.  Strategic Investment and Learning with Private Information , 2018, J. Econ. Theory.

[50]  M. Burke,et al.  Climate and Conflict , 2014 .

[51]  Leif Anders Thorsrud,et al.  The value of news for economic developments , 2019, Journal of Econometrics.

[52]  Matt Taddy,et al.  Text As Data , 2017, Journal of Economic Literature.

[53]  Benson Mwangi,et al.  A Review of Feature Reduction Techniques in Neuroimaging , 2013, Neuroinformatics.

[54]  Raphael Godefroy,et al.  The Birth of the Congressional Clinic , 2011 .

[55]  Mathieu Marcoux,et al.  Games with unobservable heterogeneity and multiple equilibria: An application to mobile telecommunications , 2019 .

[56]  Stefano Giglio,et al.  Systemic Risk and the Macroeconomy: An Empirical Evaluation , 2015 .

[57]  Jingrui He,et al.  Comparing Random Forest with Logistic Regression for Predicting Class-Imbalanced Civil War Onset Data , 2016, Political Analysis.

[58]  Nils W. Metternich,et al.  Introduction: Forecasting in peace research , 2017 .

[59]  Daniel Leigh,et al.  Growth Forecast Errors and Fiscal Multipliers , 2013, SSRN Electronic Journal.

[60]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[61]  Marcus H. Böhme,et al.  Searching for a better life: Predicting international migration with online search keywords , 2020, Journal of Development Economics.

[62]  Lars E. O. Svensson,et al.  Cost-Benefit Analysis of Leaning Against the Wind , 2016 .

[63]  Hafedh Bouakez,et al.  Sectoral Heterogeneity, Production Networks, and the Effects of Government Spending , 2018 .

[64]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[65]  D. Fullerton,et al.  Environmental Economics: The 50th Anniversary of the Birth of this Field around the First Earth Day , 2017 .

[66]  Gabriel Cadamuro,et al.  Predicting poverty and wealth from mobile phone metadata , 2015, Science.

[67]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[68]  Rohan Dutta,et al.  Gradual Pairwise Comparison and Stochastic Choice , 2019, Theoretical Economics.

[69]  Christian Rauh,et al.  Measuring uncertainty at the regional level using newspaper text , 2019 .

[70]  Torsten Persson,et al.  The Logic of Political Violence , 2011 .