The Good Judgment Team led by psychologists P. Tetlock and B. Mellers of the University of Pennsylvania was the most successful of five research projects sponsored through 2015 by the Intelligence Advanced Research Projects Activity to develop improved group forecast aggregation algorithms. Each team had at least 10 algorithms under continuous development and evaluation over the 4†year project. The mean Brier score was used to rank the algorithms on approximately 130 questions concerning categorical geopolitical events each year. An algorithm would return aggregate probabilities for each question based on the probabilities provided per question by thousands of individuals, who had been recruited by the Good Judgment Team. This paper summarizes the theorized basis and implementation of one of the two most accurate algorithms at the conclusion of the Good Judgment Project. The algorithm incorporated a number of pre†and postprocessing steps, and relied upon a minimum distance robust regression method called L2E. The algorithm was just edged out by a variation of logistic regression, which has been described elsewhere. Work since the official conclusion of the project has led to an even smaller gap.
[1]
C. D. Gelatt,et al.
Optimization by Simulated Annealing
,
1983,
Science.
[2]
David W. Scott,et al.
Partial Mixture Estimation and Outlier Detection in Data and Regression
,
2004
.
[3]
F. J. Anscombe,et al.
THE TRANSFORMATION OF POISSON, BINOMIAL AND NEGATIVE-BINOMIAL DATA
,
1948
.
[4]
Lyle H. Ungar,et al.
Modeling Probability Forecasts via Information Diversity
,
2014
.
[5]
David W. Scott,et al.
Parametric Statistical Modeling by Minimum Integrated Square Error
,
2001,
Technometrics.