Uncertainty quantification techniques for population density estimates derived from sparse open source data

The Population Density Tables (PDT) project at Oak Ridge National Laboratory (www.ornl.gov) is developing population density estimates for specific human activities under normal patterns of life based largely on information available in open source. Currently, activity-based density estimates are based on simple summary data statistics such as range and mean. Researchers are interested in improving activity estimation and uncertainty quantification by adopting a Bayesian framework that considers both data and sociocultural knowledge. Under a Bayesian approach, knowledge about population density may be encoded through the process of expert elicitation. Due to the scale of the PDT effort which considers over 250 countries, spans 50 human activity categories, and includes numerous contributors, an elicitation tool is required that can be operationalized within an enterprise data collection and reporting system. Such a method would ideally require that the contributor have minimal statistical knowledge, require minimal input by a statistician or facilitator, consider human difficulties in expressing qualitative knowledge in a quantitative setting, and provide methods by which the contributor can appraise whether their understanding and associated uncertainty was well captured. This paper introduces an algorithm that transforms answers to simple, non-statistical questions into a bivariate Gaussian distribution as the prior for the Beta distribution. Based on geometric properties of the Beta distribution parameter feasibility space and the bivariate Gaussian distribution, an automated method for encoding is developed that responds to these challenging enterprise requirements. Though created within the context of population density, this approach may be applicable to a wide array of problem domains requiring informative priors for the Beta distribution.

[1]  A. O'Hagan,et al.  Statistical Methods for Eliciting Probability Distributions , 2005 .

[2]  A. Tversky,et al.  Judgment under Uncertainty: Heuristics and Biases , 1974, Science.

[3]  B. Bhaduri,et al.  LandScan USA: a high-resolution geospatial and temporal modeling approach for population distribution and dynamics , 2007 .

[4]  David Johnson,et al.  The triangular distribution as a proxy for the beta distribution in risk analysis , 1997 .

[5]  Paul A. Zandbergen,et al.  Comparison of Dasymetric Mapping Techniques for Small-Area Population Estimates , 2010 .

[6]  Jeremy E. Oakley,et al.  Probability is perfect, but we can't elicit it perfectly , 2004, Reliab. Eng. Syst. Saf..

[7]  D. G. Altman,et al.  Plotting Probability Ellipses , 1978 .

[8]  P. Pollard,et al.  Intuitive judgments of proportions, means, and variances: A review , 1975 .

[9]  L. Beach,et al.  Intuitive estimation of means , 1966 .

[10]  S. T. Bucklanda,et al.  State-space models for the dynamics of wild animal populations , 2003 .

[11]  Sarah J. Brinegar,et al.  A Comparative Analysis of Small Area Population Estimation Methods , 2010 .

[12]  Paul J. Zsombor-Murray,et al.  Largest Ellipse Inscribing an Arbitrary Polygon , 2003 .

[13]  A. Tversky,et al.  On the psychology of prediction , 1973 .

[14]  W. M. Bolstad Introduction to Bayesian Statistics , 2004 .

[15]  A. Mitchell The ESRI guide to GIS analysis , 1999 .

[16]  C. Peterson,et al.  MODE, MEDIAN, AND MEAN AS OPTIMAL STRATEGIES. , 1964, Journal of experimental psychology.

[17]  David O'Sullivan,et al.  Geographic Information Analysis , 2002 .

[18]  Robin M. Hogarth,et al.  Cognitive Processes and the Assessment of Subjective Probability Distributions , 1975 .

[19]  Samuel Kotz,et al.  Beyond Beta: Other Continuous Families Of Distributions With Bounded Support And Applications , 2004 .

[20]  Lawrence D. Phillips,et al.  Group Elicitation of Probability Distributions: Are Many Heads Better Than One? , 1999 .

[21]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[22]  Joseph S. B. Mitchell,et al.  Finding large sticks and potatoes in polygons , 2006, SODA '06.