Reducing Overconfidence in the Interval Judgments of Experts

Elicitation of expert opinion is important for risk analysis when only limited data are available. Expert opinion is often elicited in the form of subjective confidence intervals; however, these are prone to substantial overconfidence. We investigated the influence of elicitation question format, in particular the number of steps in the elicitation procedure. In a 3-point elicitation procedure, an expert is asked for a lower limit, upper limit, and best guess, the two limits creating an interval of some assigned confidence level (e.g., 80%). In our 4-step interval elicitation procedure, experts were also asked for a realistic lower limit, upper limit, and best guess, but no confidence level was assigned; the fourth step was to rate their anticipated confidence in the interval produced. In our three studies, experts made interval predictions of rates of infectious diseases (Study 1, n = 21 and Study 2, n = 24: epidemiologists and public health experts), or marine invertebrate populations (Study 3, n = 34: ecologists and biologists). We combined the results from our studies using meta-analysis, which found average overconfidence of 11.9%, 95% CI [3.5, 20.3] (a hit rate of 68.1% for 80% intervals)-a substantial decrease in overconfidence compared with previous studies. Studies 2 and 3 suggest that the 4-step procedure is more likely to reduce overconfidence than the 3-point procedure (Cohen's d = 0.61, [0.04, 1.18]).

[1]  Roger E. Kirk,et al.  Experimental design: Procedures for the behavioral sciences (3rd ed.). , 1995 .

[2]  Baruch Fischhoff,et al.  Calibration of Probabilities: The State of the Art , 1977 .

[3]  Ilan Yaniv,et al.  Overconfidence in interval estimates: What does expertise buy you? , 2008 .

[4]  Yaacov Schul,et al.  Elimination and inclusion procedures in judgment. , 1997 .

[5]  S. Lichtenstein,et al.  Do those who know more also know more about how much they know?*1 , 1977 .

[6]  S. Oskamp OVERCONFIDENCE IN CASE-STUDY JUDGMENTS. , 1965, Journal of consulting psychology.

[7]  Dean P. Foster,et al.  Graininess of judgment under uncertainty: An accuracy-informativeness trade-off , 1995 .

[8]  S. Plous The psychology of judgment and decision making , 1994 .

[9]  Brad M. Barber,et al.  Boys Will Be Boys: Gender, Overconfidence, and Common Stock Investment , 1998 .

[10]  G. Cumming,et al.  Inference by eye: confidence intervals and how to read pictures of data. , 2005, The American psychologist.

[11]  P. Tetlock Expert Political Judgment: How Good Is It? How Can We Know? , 2005 .

[12]  B. Marx The Visual Display of Quantitative Information , 1985 .

[13]  Reid Hastie,et al.  Effects of Amount of Information on Judgment Accuracy and Confidence , 2008 .

[14]  R. Kirk Experimental Design: Procedures for the Behavioral Sciences , 1970 .

[15]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[16]  R. Dawes,et al.  House of Cards: Psychology and Psychotherapy Built on Myth. , 1995 .

[17]  M. G. Morgan,et al.  Subjective judgments by climate experts. , 1995 .

[18]  Dilek Önkal,et al.  Effects of task format on probabilistic forecasting of stock prices , 1996 .

[19]  B. Fischhoff,et al.  Calibration of probabilities: the state of the art to 1980 , 1982 .

[20]  Victoria Martínez,et al.  Introduction to Meta-Analysis , 2002 .

[21]  Magne Jørgensen,et al.  When 90% confidence intervals are 50% certain: on the credibility of credible intervals , 2005 .

[22]  W. Hays Experimental Design: Procedures for the Behavioral Sciences. 2nd ed. , 1983 .

[23]  E. Johnston,et al.  Pollution reduces native diversity and increases invader dominance in marine hard‐substrate communities , 2007 .

[24]  Joshua Klayman,et al.  Overconfidence in interval estimates. , 2004, Journal of experimental psychology. Learning, memory, and cognition.