Explaining Rare Events in International Relations

Some of the most important phenomena in international conflict are coded as “rare events”: binary dependent variables with dozens to thousands of times fewer events, such as wars and coups, than “nonevents.” Unfortunately, rare events data are difficult to explain and predict, a problem stemming from at least two sources. First, and most important, the data-collection strategies used in international conflict studies are grossly inefficient. The fear of collecting data with too few events has led to data collections with huge numbers of observations but relatively few, and poorly measured, explanatory variables. As it turns out, more efficient sampling designs exist for making valid inferences, such as sampling all available events (wars, for example) and a tiny fraction of nonevents (peace). This enables scholars to save as much as 99 percent of their (nonfixed) data-collection costs or to collect much more meaningful explanatory variables. Second, logistic regression, and other commonly used statistical procedures, can underestimate the probability of rare events. We introduce some corrections that outperform existing methods and change the estimates of absolute and relative risks by as much as some estimated effects reported in the literature. We also provide easy-to-use methods and software that link these two results, enabling both types of corrections to work simultaneously.

[1]  James N. Rosenau,et al.  In search of global patterns , 1976 .

[2]  Steven R. Lerman,et al.  The Estimation of Choice Probabilities from Choice Based Samples , 1977 .

[3]  Richard Tucker,et al.  BTSCS: A BINARY TIME-SERIES{CROSS{SECTION DATA ANALYSIS UTILITY , 1999 .

[4]  Guido W. Imbens,et al.  An efficient method of moments estimator for discrete choice models with choice-based sampling , 1992 .

[5]  D. Rubin,et al.  Causal Inference in Retrospective Studies , 1987 .

[6]  Zeev Maoz,et al.  Normative and Structural Causes of Democratic Peace, 1946–1986 , 1993, American Political Science Review.

[7]  Curtis S. Signorino,et al.  Tau-b or Not Tau-b: Measuring the Similarity of Foreign Policy Positions , 1999 .

[8]  Bruce Bueno de Mesquita,et al.  The War Trap , 1981 .

[9]  Yoshua Bengio,et al.  Pattern Recognition and Neural Networks , 1995 .

[10]  Curtis S. Signorino Strategic Interaction and the Statistical Analysis of International Conflict , 1999, American Political Science Review.

[11]  Daniel C. Esty,et al.  State Failure Task Force Report: Phase II Findings , 1999 .

[12]  Zeev Maoz,et al.  NORMATIVE AND STRUCTURAL CAUSES OF DEMOCRATIC PEACE , 1993 .

[13]  Charles F. Manski Nonlinear statistical modeling: Nonparametric identification under response-based sampling , 2001 .

[14]  Robert O. Keohane,et al.  Designing Social Inquiry: Scientific Inference in Qualitative Research. , 1995 .

[15]  John A. Vasquez The War Puzzle: PRELIMINARIES , 1993 .

[16]  M. Plummer,et al.  International agency for research on cancer. , 2020, Archives of pathology.

[17]  Paul K. Huth Extended Deterrence and the Outbreak of War , 1988, American Political Science Review.

[18]  Gary King,et al.  Estimating risk and rate levels, ratios and differences in case‐control studies , 2002, Statistics in medicine.

[19]  J. David Singer,et al.  Nations at War: A Scientific Study of International Conflict , 1998 .

[20]  Gary King,et al.  Improving Quantitative Studies of International Conflict: A Conjecture , 2000, American Political Science Review.

[21]  R. Schaefer Bias correction in maximum likelihood logistic regression. , 1985, Statistics in medicine.

[22]  R. Pyke,et al.  Logistic disease incidence models and case-control studies , 1979 .

[23]  J. Davies,et al.  Preventive measures : building risk assessment and crisis early warning systems , 1998 .

[24]  N. Nagelkerke,et al.  Logistic regression in case-control studies: the effect of using independent as dependent variables. , 1995, Statistics in medicine.

[25]  J. Goldstone,et al.  The State Failure Project: Early Warning Research for US Foreign Policy Planning , 1998 .

[26]  Jason Wittenberg,et al.  Making the Most Of Statistical Analyses: Improving Interpretation and Presentation , 2000 .

[27]  N. Breslow,et al.  Statistics in Epidemiology : The Case-Control Study , 2008 .

[28]  Gary King,et al.  Improving Forecasts of State Failure , 2001 .

[29]  Philip E. Tetlock,et al.  Behavior, society, and nuclear war , 1993 .

[30]  Charles F. Manski,et al.  Estimation of Response Probabilities From Augmented Retrospective Observations , 1985 .

[31]  Wagner A. Kamakura,et al.  Book Review: Structural Analysis of Discrete Data with Econometric Applications , 1982 .

[32]  G. Imbens,et al.  Case-control studies with contaminated controls☆ , 1996 .

[33]  B. B. D. Mesquita,et al.  War and reason : domestic and international imperatives , 1992 .

[34]  N. E. Breslow Statistical Methods in Cancer Research , 1986 .