The spatial allocation of population: a review of large-scale gridded population data products and their fitness for use

Abstract. Population data represent an essential component in studies focusing on human–nature interrelationships, disaster risk assessment and environmental health. Several recent efforts have produced global- and continental-extent gridded population data which are becoming increasingly popular among various research communities. However, these data products, which are of very different characteristics and based on different modeling assumptions, have never been systematically reviewed and compared, which may impede their appropriate use. This article fills this gap and presents, compares and discusses a set of large-scale (global and continental) gridded datasets representing population counts or densities. It focuses on data properties, methodological approaches and relative quality aspects that are important to fully understand the characteristics of the data with regard to the intended uses. Written by the data producers and members of the user community, through the lens of the “fitness for use” concept, the aim of this paper is to provide potential data users with the knowledge base needed to make informed decisions about the appropriateness of the data products available in relation to the target application and for critical analysis.

[1]  Martino Pesaresi,et al.  Principles and Applications of the Global Human Settlement Layer as Baseline for the Land Use Efficiency Indicator - SDG 11.3.1 , 2018, ISPRS Int. J. Geo Inf..

[2]  Alessandro Sorichetta,et al.  High resolution global gridded data for use in population studies , 2017, Scientific Data.

[3]  Catherine Linard,et al.  High-resolution gridded population datasets for Latin America and the Caribbean in 2010, 2015, and 2020 , 2015, Scientific Data.

[4]  Sérgio Freire,et al.  Remote Sensing Derived Built-Up Area and Population Density to Quantify Global Exposure to Five Natural Hazards over Time , 2018, Remote. Sens..

[5]  Stan Openshaw,et al.  Modifiable Areal Unit Problem , 2008, Encyclopedia of GIS.

[6]  Pesaresi Martino,et al.  Development of new open and free multi-temporal global population grids at 250 m resolution , 2016 .

[7]  U. Deichmann,et al.  The Economics of Renewable Energy Expansion in Rural Sub-Saharan Africa , 2010 .

[8]  Steffen Fritz,et al.  Harnessing the power of volunteers, the internet and Google Earth to collect and validate global spatial information using Geo-Wiki , 2015 .

[9]  J. Mennis Generating Surface Models of Population Using Dasymetric Mapping , 2003, The Professional Geographer.

[10]  D. Civco,et al.  Mapping urban areas on a global scale: which of the eight maps now available is more accurate? , 2009 .

[11]  C. Elvidge,et al.  Spatial analysis of global urban extent from DMSP-OLS night lights , 2005 .

[12]  Andrew J. Tatem,et al.  Mapping the denominator: spatial demography in the measurement of progress. , 2014, International health.

[13]  Jane Mills,et al.  Enhanced data and methods for improving open and free global population grids: putting ‘leaving no one behind’ into practice , 2018, Int. J. Digit. Earth.

[14]  Barbara P. Buttenfield,et al.  Modeling residential developed land in rural areas: A size-restricted approach using parcel data , 2014 .

[15]  J. E. Cohen,et al.  Hypsographic demography: the distribution of human population by altitude. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Catherine Linard,et al.  Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data , 2015, PloS one.

[17]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[18]  Catherine Linard,et al.  Spatiotemporal patterns of population in mainland China, 1990 to 2010 , 2016, Scientific Data.

[19]  Catherine Linard,et al.  Large-scale spatial population databases in infectious disease research , 2012, International Journal of Health Geographics.

[20]  Stefan Leyk,et al.  Understanding the Combined Impacts of Aggregation and Spatial Non‐Stationarity: The Case of Migration‐Environment Associations in Rural South Africa , 2015, Trans. GIS.

[21]  Alessandro Sorichetta,et al.  Mapping internal connectivity through human migration in malaria endemic countries , 2016, Scientific Data.

[22]  Pierre Soille,et al.  A New European Settlement Map From Optical Remotely Sensed Data , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[23]  P. Stern,et al.  People and pixels : linking remote sensing and social science , 1999 .

[24]  Xiaohuan Yang,et al.  Research on Grid Size Suitability of Gridded Population Distribution in Urban Area: A Case Study in Urban Area of Xuanzhou District, China , 2017, PloS one.

[25]  C. Elvidge,et al.  VIIRS night-time lights , 2017, Remote Sensing of Night-time Light.

[26]  Jeremy Mennis,et al.  Dasymetric Mapping for Estimating Population in Small Areas , 2009 .

[27]  Jordan Graesser,et al.  Generation of fine-scale population layers using multi-resolution satellite imagery and geospatial data , 2013 .

[28]  Xianming Liu,et al.  Mapping the world population one building at a time , 2017, ArXiv.

[29]  W. Nordhaus Geography and macroeconomics: new data and new findings. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Andrew J. Tatem,et al.  WorldPop, open data for spatial demography , 2017, Scientific Data.

[31]  E. Stehfest,et al.  Anthropogenic land use estimates for the Holocene – HYDE 3.2 , 2016 .

[32]  S. Freire,et al.  Analysing spatiotemporal patterns of tourism in Europe at high-resolution with conventional and big data sources , 2018, Tourism Management.

[33]  Barbara P. Buttenfield,et al.  Maximum Entropy Dasymetric Modeling for Demographic Small Area Estimation , 2013 .

[34]  Martino Pesaresi,et al.  Built-up area and population density: Two Essential Societal Variables to address climate hazard impact , 2018, Environmental science & policy.

[35]  J. Lerner,et al.  Three‐dimensional model synthesis of the global methane cycle , 1991 .

[36]  J. E. Dobson,et al.  LandScan: A Global Population Database for Estimating Populations at Risk , 2000 .

[37]  David J. Martin An Assessment of Surface and Zonal Models of population , 1996, Int. J. Geogr. Inf. Sci..

[38]  N. Lam,et al.  Remote Sensing and Socioeconomic Data Integration: Lessons from the NASA Socioeconomic Data and Applications Center , 2017 .

[39]  Halkia Stamatia,et al.  GHSL application in Europe: Towards new population grids , 2014 .

[40]  Stefan Leyk,et al.  Descriptor : HISDAC-US , historical settlement data compilation for the conterminous United States over 200 years , 2018 .

[41]  Yuri Gorokhovich,et al.  Tsunami mortality estimates and vulnerability mapping in Aceh, Indonesia. , 2007, American journal of public health.

[42]  C. Elvidge,et al.  Mapping City Lights With Nighttime Data from the DMSP Operational Linescan System , 1997 .

[43]  Mick Green,et al.  Behaviour of regression models under random aggregation , 2001 .

[44]  Forrest R. Stevens,et al.  GridSample: an R package to generate household survey primary sampling units (PSUs) from gridded population data , 2017, International Journal of Health Geographics.

[45]  Deborah Balk More Than a Name: Why Is Global Urban Population Mapping a GRUMPy Proposition? , 2009 .

[46]  Forrest R. Stevens,et al.  Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling , 2019, Comput. Environ. Urban Syst..

[47]  J. E. Dobson,et al.  LandScan: Locating People is What Matters , 2002 .

[48]  Lars Eklundh,et al.  Global digital datasets for land degradation studies: a GIS approach , 1991 .

[49]  C. Revenga,et al.  Urban growth, climate change, and freshwater availability , 2011, Proceedings of the National Academy of Sciences.

[50]  Benjamin Semenov-Tian-Shansky Russia: Territory and Population: A Perspective on the 1926 Census , 1928 .

[51]  Catherine Linard,et al.  Exploring nationally and regionally defined models for large area population mapping , 2015, Int. J. Digit. Earth.

[52]  Martin Charlton,et al.  The Geography of Parameter Space: An Investigation of Spatial Non-Stationarity , 1996, Int. J. Geogr. Inf. Sci..

[53]  Glen D. Johnson,et al.  Spatially varying relationships between risk factors and child diarrhea in West Africa, 2008-2013 , 2019, Mathematical Population Studies.

[54]  David G Steel,et al.  Exploring a relationship between aggregate and individual levels spatial data through semivariogram models , 2006 .

[55]  Martino Pesaresi,et al.  A new map of the European settlements by automatic classification of 2.5m resolution SPOT data , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[56]  John K. Wright A Method of Mapping Densities of Population: With Cape Cod as an Example , 1936 .

[57]  G. Hunter,et al.  A Risk-Based Approach to Assessing the ‘ Fitness for Use ’ of Spatial Data , 1999 .

[58]  Robert G. Cromley,et al.  Singly‐ and Doubly‐Constrained Methods of Areal Interpolation for Vector‐based GIS , 1999, Trans. GIS.

[59]  S I Hay,et al.  Determining global population distribution: methods, applications and data. , 2006, Advances in parasitology.

[60]  A. Tatema,et al.  Spatially disaggregated population estimates in the absence of national population and housing census data , 2018 .

[61]  Waldo R. Tobler,et al.  The Global Demography Project , 1995 .

[62]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[63]  J E Potter,et al.  The completeness of enumeration in the 1973 census of the population of Colombia. , 1976, Population index.

[64]  Kate E. Jones,et al.  Global trends in emerging infectious diseases , 2008, Nature.

[65]  A. Tatem,et al.  High Resolution Population Distribution Maps for Southeast Asia in 2010 and 2015 , 2013, PloS one.

[66]  C. Amrhein Searching for the Elusive Aggregation Effect: Evidence from Statistical Simulations , 1995 .

[67]  Eric M. Fèvre,et al.  Risk for Human African Trypanosomiasis, Central Africa, 2000–2009 , 2011, Emerging infectious diseases.

[68]  Andrew J Tatem,et al.  The global distribution and population at risk of malaria: past, present, and future. , 2004, The Lancet. Infectious diseases.

[69]  S. Dasgupta,et al.  Exposure of developing countries to sea-level rise and storm surges , 2011 .

[70]  Kees Klein Goldewijk,et al.  The HYDE 3.1 spatially explicit database of human‐induced global land‐use change over the past 12,000 years , 2011 .

[71]  Cheng Liu,et al.  Data driven approach for high resolution population distribution and dynamics models , 2014, Proceedings of the Winter Simulation Conference 2014.

[72]  J. Koch,et al.  Modeling the impacts of grazing land management on land-use change for the Jordan River region , 2008 .

[73]  Budhendra L. Bhaduri,et al.  Census-independent population mapping in northern Nigeria☆ , 2018, Remote sensing of environment.

[74]  Catherine Linard,et al.  Examining the correlates and drivers of human population distributions across low- and middle-income countries , 2017, Journal of The Royal Society Interface.

[75]  Ferri Stefano,et al.  Operating procedure for the production of the Global Human Settlement Layer from Landsat data of the epochs 1975, 1990, 2000, and 2014 , 2016 .

[76]  Jonathan P. Schroeder Target-Density Weighting Interpolation and Uncertainty Evaluation for Temporal Analysis of Census Data , 2007 .

[77]  Xiaomin Qiu,et al.  Population Estimation Methods in GIS and Remote Sensing: A Review , 2005 .

[78]  Jack T. Harvey,et al.  Estimating census district populations from satellite imagery: Some approaches and limitations , 2002 .

[79]  Brian Blankespoor,et al.  Mangroves as a protection from storm surges in a changing climate , 2016, Ambio.

[80]  A. Tatem,et al.  The effects of spatial population dataset choice on estimates of population at risk of disease , 2011, Population health metrics.

[81]  D. Wright,et al.  Using Classified and Unclassified Land Cover Data to Estimate the Footprint of Human Settlement , 2018, Data Sci. J..

[82]  Warren C. Jochem,et al.  Spatially disaggregated population estimates in the absence of national population and housing census data , 2018, Proceedings of the National Academy of Sciences.

[83]  T. Esch,et al.  Breaking new ground in mapping human settlements from space – The Global Urban Footprint , 2017, 1706.04862.

[84]  Martin Clarke,et al.  Synthesis—A Synthetic Spatial Information System for Urban and Regional Analysis: Methods and Examples , 1988 .

[85]  Uwe Deichmann,et al.  A Review of Spatial Population Database Design and Modeling , 1996 .

[86]  Huadong Guo,et al.  A Global Human Settlement Layer From Optical HR/VHR RS Data: Concept and First Results , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[87]  Paul A. Zandbergen,et al.  Comparison of Dasymetric Mapping Techniques for Small-Area Population Estimates , 2010 .

[88]  D. Dodman,et al.  The Rising Tide: Assessing the Risks of Climate Change and Human Settlements in Low-Elevation Coastal Zones: Gordon McGranahan, Deborah Balk and Bridget Anderson , 2012 .

[89]  Forrest R. Stevens,et al.  Gridded Population Maps Informed by Different Built Settlement Products , 2018, Data.

[90]  Xin Lu,et al.  Remotely measuring populations during a crisis by overlaying two data sources , 2015, International health.

[91]  P. Shrestha,et al.  Understanding the Impacts of Climate Change , 2014 .

[92]  Arnold Bregt,et al.  Assessing fitness for use: the expected value of spatial data sets , 2001, Int. J. Geogr. Inf. Sci..

[93]  Alfred Stein,et al.  Thirty Years of Research on Spatial Data Quality: Achievements, Failures, and Opportunities , 2010, Trans. GIS.

[94]  Aneta J. Florczyk,et al.  Exposing the urban continuum: implications and cross-comparison from an interdisciplinary perspective , 2018, Int. J. Digit. Earth.

[95]  L. Waller,et al.  Applied Spatial Statistics for Public Health Data , 2004 .

[96]  A. Tatem,et al.  Dynamic population mapping using mobile phone data , 2014, Proceedings of the National Academy of Sciences.

[97]  Barbara P. Buttenfield,et al.  Dasymetric Modeling and Uncertainty , 2014, Annals of the Association of American Geographers. Association of American Geographers.

[98]  Michael F. Goodchild,et al.  Areal interpolation: A variant of the traditional spatial problem , 1980 .

[99]  P. Rogerson,et al.  The Sage handbook of spatial analysis , 2009 .

[100]  Sérgio Freire,et al.  The global human settlement layer from landsat imagery , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[101]  Deborah Balk,et al.  The Distribution of People and the Dimension of Place: Methodologies to Improve the Global Estimation of Urban Extents , 2004 .

[102]  Cynthia A. Brewer,et al.  Dasymetric Mapping and Areal Interpolation: Implementation and Evaluation , 2001 .

[103]  Stefan Leyk,et al.  Assessing the Accuracy of Multi-Temporal Built-Up Land Layers across Rural-Urban Trajectories in the United States. , 2018, Remote sensing of environment.

[104]  Kees Klein Goldewijk,et al.  Long-term dynamic modeling of global population and built-up area in a spatially explicit way: HYDE 3.1 , 2010 .

[105]  J. Jenness Calculating landscape surface area from digital elevation models , 2004 .

[106]  I. Ivánová,et al.  Searching for spatial data resources by fitness for use , 2011 .

[107]  Andrew Nelson,et al.  Agglomeration Index : Towards a New Measure of Urban Concentration , 2010 .

[108]  I Bracken,et al.  Techniques for Modelling Population-Related Raster Databases , 1991, Environment & planning A.

[109]  Barbara P. Buttenfield,et al.  Exploiting temporal information in parcel data to refine small area population estimates , 2016, Comput. Environ. Urban Syst..

[110]  Naota Hanasaki,et al.  A grid-based assessment of global water scarcity including virtual water trading , 2006 .

[111]  Brian C. O'Neill,et al.  Spatially explicit global population scenarios consistent with the Shared Socioeconomic Pathways , 2016 .

[112]  David G Steel,et al.  Rules for Random Aggregation , 1996 .

[113]  R. Snow,et al.  Estimating mortality, morbidity and disability due to malaria among Africa's non-pregnant population. , 1999, Bulletin of the World Health Organization.

[114]  P. Bogaert Spatial prediction of categorical variables: the Bayesian maximum entropy approach , 2002 .

[115]  Robert Jeansoulin,et al.  Towards spatial data quality information analysis tools for experts assessing the fitness for use of spatial data , 2007, Int. J. Geogr. Inf. Sci..

[116]  B. Anderson,et al.  The rising tide: assessing the risks of climate change and human settlements in low elevation coastal zones , 2007 .

[117]  Alex de Sherbinin,et al.  Groundswell : Preparing for Internal Climate Migration , 2018 .

[118]  J. Wickham,et al.  Accuracy assessment of NLCD 2006 land cover and impervious surface , 2013 .

[119]  Yan Jin,et al.  Principles and methods of scaling geospatial Earth science data , 2019, Earth-Science Reviews.

[120]  Jeremy Mennis,et al.  Intelligent Dasymetric Mapping and Its Application to Areal Interpolation , 2006 .

[121]  A. Tatem,et al.  Commentary: Containing the Ebola Outbreak - the Potential and Challenge of Mobile Network Data , 2014, PLoS currents.

[122]  L. Waller,et al.  Applied Spatial Statistics for Public Health Data: Waller/Applied Spatial Statistics , 2004 .

[123]  R. Engstrom,et al.  Spatial refinement of census population distribution using remotely sensed estimates of impervious surfaces in Haiti , 2010 .

[124]  D. R. Montello Scale in Geography , 2001 .

[125]  Andrea Taramelli,et al.  Modelling risk hurricane elements in potentially affected areas by a GIS system , 2010 .

[126]  Pinki Mondal,et al.  Uncertainties in Measuring Populations Potentially Impacted by Sea Level Rise and Coastal Flooding , 2012, PloS one.

[127]  Kytt MacManus,et al.  Taking Advantage of the Improved Availability of Census Data: A First Look at the Gridded Population of the World, Version 4 , 2015 .

[128]  Michael E. Bufalino,et al.  Street-Weighted Interpolation Techniques for Demographic Count Estimation in Incompatible Zone Systems , 2005 .

[129]  Giuseppe Arbia,et al.  Effects of MAUP on spatial econometric models , 2011 .

[130]  Gary L. Raines,et al.  Elements of spatial data quality , 1997 .

[131]  David W. S. Wong,et al.  The Reliability of Using the Iterative Proportional Fitting Procedure , 1992 .

[132]  S. Piantadosi,et al.  The ecological fallacy. , 1988, American journal of epidemiology.

[133]  Karsten Steinhaeuser,et al.  Estimating future global per capita water availability based on changes in climate and population , 2012, Comput. Geosci..

[134]  Christoph Aubrecht,et al.  Developing an adaptive global exposure model to support the generation of country : disaster risk profiles , 2015 .

[135]  David W. S. Wong The Modifiable Areal Unit Problem (MAUP) , 2004 .

[136]  U. Deichmann,et al.  World population in a grid of spherical quadrilaterals. , 1997, International journal of population geography : IJPG.

[137]  Ken Sexton,et al.  Modifiable Areal Unit Problem (MAUP) , 2008 .

[138]  Christoph Aubrecht,et al.  Consistent yet adaptive global geospatial identification of urban–rural patterns: The iURBAN model , 2016 .