Allocating people to pixels: A review of large-scale gridded population data products and their fitness for use

Abstract. Population data represent an essential component in studies focusing on human-nature interrelationships, disaster risk assessment and environmental health. Several recent efforts have produced global and continental-extent gridded population data which are becoming increasingly popular among various research communities. However, these data products, which are of very different characteristics and based on different modeling assumptions, have never been systematically reviewed and compared which may impede their appropriate use. This article fills this gap and presents, compares and discusses a set of large-scale (global and continental) gridded datasets representing population counts or densities. It focuses on data properties, methodological approaches and relative quality aspects that are important to fully understand the characteristics of the data with regard to the intended uses. Written by the data producers and members of the user community, through the lens of the “fitness for use” concept, the aim of this paper is to provide potential data users with the knowledge base needed to make informed decisions about the appropriateness of the data products available in relation to the target application and for critical analysis.

[1]  Catherine Linard,et al.  High-resolution gridded population datasets for Latin America and the Caribbean in 2010, 2015, and 2020 , 2015, Scientific Data.

[2]  Jeremy Mennis,et al.  Intelligent Dasymetric Mapping and Its Application to Areal Interpolation , 2006 .

[3]  A. Tatem,et al.  Commentary: Containing the Ebola Outbreak - the Potential and Challenge of Mobile Network Data , 2014, PLoS currents.

[4]  Pesaresi Martino,et al.  Development of new open and free multi-temporal global population grids at 250 m resolution , 2016 .

[5]  P. Diggle Applied Spatial Statistics for Public Health Data , 2005 .

[6]  W. Nordhaus Geography and macroeconomics: new data and new findings. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Catherine Linard,et al.  Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data , 2015, PloS one.

[8]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[9]  Robert Jeansoulin,et al.  Towards spatial data quality information analysis tools for experts assessing the fitness for use of spatial data , 2007, Int. J. Geogr. Inf. Sci..

[10]  Forrest R. Stevens,et al.  Gridded Population Maps Informed by Different Built Settlement Products , 2018, Data.

[11]  B. Anderson,et al.  The rising tide: assessing the risks of climate change and human settlements in low elevation coastal zones , 2007 .

[12]  Alex de Sherbinin,et al.  Groundswell : Preparing for Internal Climate Migration , 2018 .

[13]  J. Wickham,et al.  Accuracy assessment of NLCD 2006 land cover and impervious surface , 2013 .

[14]  Andrew J. Tatem,et al.  WorldPop, open data for spatial demography , 2017, Scientific Data.

[15]  Alessandro Sorichetta,et al.  Mapping internal connectivity through human migration in malaria endemic countries , 2016, Scientific Data.

[16]  Martin Charlton,et al.  The Geography of Parameter Space: An Investigation of Spatial Non-Stationarity , 1996, Int. J. Geogr. Inf. Sci..

[17]  Pierre Soille,et al.  A New European Settlement Map From Optical Remotely Sensed Data , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[18]  Forrest R. Stevens,et al.  Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling , 2019, Comput. Environ. Urban Syst..

[19]  J. E. Dobson,et al.  LandScan: Locating People is What Matters , 2002 .

[20]  Jordan Graesser,et al.  Generation of fine-scale population layers using multi-resolution satellite imagery and geospatial data , 2013 .

[21]  Xianming Liu,et al.  Mapping the world population one building at a time , 2017, ArXiv.

[22]  Jack T. Harvey,et al.  Estimating census district populations from satellite imagery: Some approaches and limitations , 2002 .

[23]  Brian Blankespoor,et al.  Mangroves as a protection from storm surges in a changing climate , 2016, Ambio.

[24]  Stefan Leyk,et al.  Descriptor : HISDAC-US , historical settlement data compilation for the conterminous United States over 200 years , 2018 .

[25]  A. Tatem,et al.  The effects of spatial population dataset choice on estimates of population at risk of disease , 2011, Population health metrics.

[26]  Warren C. Jochem,et al.  Spatially disaggregated population estimates in the absence of national population and housing census data , 2018, Proceedings of the National Academy of Sciences.

[27]  David G Steel,et al.  Exploring a relationship between aggregate and individual levels spatial data through semivariogram models , 2006 .

[28]  T. Esch,et al.  Breaking new ground in mapping human settlements from space – The Global Urban Footprint , 2017, 1706.04862.

[29]  C. Elvidge,et al.  Mapping City Lights With Nighttime Data from the DMSP Operational Linescan System , 1997 .

[30]  Robert S. Chen,et al.  The Integration of Remote Sensing and Socioeconomic Data: Lessons from the Socioeconomic Data and Applications Center , 2014 .

[31]  David W. S. Wong,et al.  The Reliability of Using the Iterative Proportional Fitting Procedure , 1992 .

[32]  S. Piantadosi,et al.  The ecological fallacy. , 1988, American journal of epidemiology.

[33]  S. Guptill,et al.  Elements of Spatial Data Quality , 1995 .

[34]  J. Lerner,et al.  Three‐dimensional model synthesis of the global methane cycle , 1991 .

[35]  Eric M. Fèvre,et al.  Risk for Human African Trypanosomiasis, Central Africa, 2000–2009 , 2011, Emerging infectious diseases.

[36]  Steffen Fritz,et al.  Harnessing the power of volunteers, the internet and Google Earth to collect and validate global spatial information using Geo-Wiki , 2015 .

[37]  J. Mennis Generating Surface Models of Population Using Dasymetric Mapping , 2003, The Professional Geographer.

[38]  E. Stehfest,et al.  Anthropogenic land use estimates for the Holocene – HYDE 3.2 , 2016 .

[39]  D. R. Montello Scale in Geography , 2001 .

[40]  Andrea Taramelli,et al.  Modelling risk hurricane elements in potentially affected areas by a GIS system , 2010 .

[41]  Andrew J Tatem,et al.  The global distribution and population at risk of malaria: past, present, and future. , 2004, The Lancet. Infectious diseases.

[42]  Karsten Steinhaeuser,et al.  Estimating future global per capita water availability based on changes in climate and population , 2012, Comput. Geosci..

[43]  Christoph Aubrecht,et al.  Developing an adaptive global exposure model to support the generation of country : disaster risk profiles , 2015 .

[44]  Pinki Mondal,et al.  Uncertainties in Measuring Populations Potentially Impacted by Sea Level Rise and Coastal Flooding , 2012, PloS one.

[45]  Kytt MacManus,et al.  Taking Advantage of the Improved Availability of Census Data: A First Look at the Gridded Population of the World, Version 4 , 2015 .

[46]  Michael E. Bufalino,et al.  Street-Weighted Interpolation Techniques for Demographic Count Estimation in Incompatible Zone Systems , 2005 .

[47]  Budhendra L. Bhaduri,et al.  Census-independent population mapping in northern Nigeria☆ , 2018, Remote sensing of environment.

[48]  Giuseppe Arbia,et al.  Effects of MAUP on spatial econometric models , 2011 .

[49]  Catherine Linard,et al.  Examining the correlates and drivers of human population distributions across low- and middle-income countries , 2017, Journal of The Royal Society Interface.

[50]  P. Shrestha,et al.  Understanding the Impacts of Climate Change , 2014 .

[51]  Arnold Bregt,et al.  Assessing fitness for use: the expected value of spatial data sets , 2001, Int. J. Geogr. Inf. Sci..

[52]  Alfred Stein,et al.  Thirty Years of Research on Spatial Data Quality: Achievements, Failures, and Opportunities , 2010, Trans. GIS.

[53]  Catherine Linard,et al.  Spatiotemporal patterns of population in mainland China, 1990 to 2010 , 2016, Scientific Data.

[54]  Catherine Linard,et al.  Large-scale spatial population databases in infectious disease research , 2012, International Journal of Health Geographics.

[55]  Stan Openshaw,et al.  Modifiable Areal Unit Problem , 2008, Encyclopedia of GIS.

[56]  Andrew J. Tatem,et al.  Mapping the denominator: spatial demography in the measurement of progress. , 2014, International health.

[57]  Michael F. Goodchild,et al.  Areal interpolation: A variant of the traditional spatial problem , 1980 .

[58]  Sérgio Freire,et al.  The global human settlement layer from landsat imagery , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[59]  Deborah Balk,et al.  The Distribution of People and the Dimension of Place: Methodologies to Improve the Global Estimation of Urban Extents , 2004 .

[60]  Halkia Stamatia,et al.  GHSL application in Europe: Towards new population grids , 2014 .

[61]  Jane Mills,et al.  Enhanced data and methods for improving open and free global population grids: putting ‘leaving no one behind’ into practice , 2018, Int. J. Digit. Earth.

[62]  S. Dasgupta,et al.  Exposure of developing countries to sea-level rise and storm surges , 2011 .

[63]  Benjamin Semenov-Tian-Shansky Russia: Territory and Population: A Perspective on the 1926 Census , 1928 .

[64]  Barbara P. Buttenfield,et al.  Modeling residential developed land in rural areas: A size-restricted approach using parcel data , 2014 .

[65]  Catherine Linard,et al.  Exploring nationally and regionally defined models for large area population mapping , 2015, Int. J. Digit. Earth.

[66]  Waldo R. Tobler,et al.  The Global Demography Project , 1995 .

[67]  C. Elvidge,et al.  VIIRS night-time lights , 2017, Remote Sensing of Night-time Light.

[68]  Naota Hanasaki,et al.  A grid-based assessment of global water scarcity including virtual water trading , 2006 .

[69]  Brian C. O'Neill,et al.  Spatially explicit global population scenarios consistent with the Shared Socioeconomic Pathways , 2016 .

[70]  David G Steel,et al.  Rules for Random Aggregation , 1996 .

[71]  Martino Pesaresi,et al.  Principles and Applications of the Global Human Settlement Layer as Baseline for the Land Use Efficiency Indicator - SDG 11.3.1 , 2018, ISPRS Int. J. Geo Inf..

[72]  Alessandro Sorichetta,et al.  High resolution global gridded data for use in population studies , 2017, Scientific Data.

[73]  D. Civco,et al.  Mapping urban areas on a global scale: which of the eight maps now available is more accurate? , 2009 .

[74]  C. Elvidge,et al.  Spatial analysis of global urban extent from DMSP-OLS night lights , 2005 .

[75]  A. Tatem,et al.  High Resolution Population Distribution Maps for Southeast Asia in 2010 and 2015 , 2013, PloS one.

[76]  Xiaomin Qiu,et al.  Population Estimation Methods in GIS and Remote Sensing: A Review , 2005 .

[77]  Robert G. Cromley,et al.  Singly‐ and Doubly‐Constrained Methods of Areal Interpolation for Vector‐based GIS , 1999, Trans. GIS.

[78]  J E Potter,et al.  The completeness of enumeration in the 1973 census of the population of Colombia. , 1976, Population index.

[79]  Kate E. Jones,et al.  Global trends in emerging infectious diseases , 2008, Nature.

[80]  Stefan Leyk,et al.  Assessing the Accuracy of Multi-Temporal Built-Up Land Layers across Rural-Urban Trajectories in the United States. , 2018, Remote sensing of environment.

[81]  Kees Klein Goldewijk,et al.  Long-term dynamic modeling of global population and built-up area in a spatially explicit way: HYDE 3.1 , 2010 .

[82]  J. Jenness Calculating landscape surface area from digital elevation models , 2004 .

[83]  I. Ivánová,et al.  Searching for spatial data resources by fitness for use , 2011 .

[84]  Christoph Aubrecht,et al.  Consistent yet adaptive global geospatial identification of urban–rural patterns: The iURBAN model , 2016 .

[85]  Huadong Guo,et al.  A Global Human Settlement Layer From Optical HR/VHR RS Data: Concept and First Results , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[86]  Paul A. Zandbergen,et al.  Comparison of Dasymetric Mapping Techniques for Small-Area Population Estimates , 2010 .

[87]  Barbara P. Buttenfield,et al.  Dasymetric Modeling and Uncertainty , 2014, Annals of the Association of American Geographers. Association of American Geographers.

[88]  Barbara P. Buttenfield,et al.  Maximum Entropy Dasymetric Modeling for Demographic Small Area Estimation , 2013 .

[89]  Martino Pesaresi,et al.  Built-up area and population density: Two Essential Societal Variables to address climate hazard impact , 2018, Environmental science & policy.

[90]  J. E. Dobson,et al.  LandScan: A Global Population Database for Estimating Populations at Risk , 2000 .

[91]  Kees Klein Goldewijk,et al.  The HYDE 3.1 spatially explicit database of human‐induced global land‐use change over the past 12,000 years , 2011 .

[92]  J. Koch,et al.  Modeling the impacts of grazing land management on land-use change for the Jordan River region , 2008 .

[93]  Martino Pesaresi,et al.  A new map of the European settlements by automatic classification of 2.5m resolution SPOT data , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[94]  John K. Wright A Method of Mapping Densities of Population: With Cape Cod as an Example , 1936 .

[95]  G. Hunter,et al.  A Risk-Based Approach to Assessing the ‘ Fitness for Use ’ of Spatial Data , 1999 .

[96]  David W. S. Wong The Modifiable Areal Unit Problem (MAUP) , 2004 .

[97]  U. Deichmann,et al.  World population in a grid of spherical quadrilaterals. , 1997, International journal of population geography : IJPG.

[98]  Forrest R. Stevens,et al.  GridSample: an R package to generate household survey primary sampling units (PSUs) from gridded population data , 2017, International Journal of Health Geographics.

[99]  C. Revenga,et al.  Urban growth, climate change, and freshwater availability , 2011, Proceedings of the National Academy of Sciences.

[100]  S. Freire,et al.  Analysing spatiotemporal patterns of tourism in Europe at high-resolution with conventional and big data sources , 2018, Tourism Management.

[101]  Stefan Leyk,et al.  Understanding the Combined Impacts of Aggregation and Spatial Non‐Stationarity: The Case of Migration‐Environment Associations in Rural South Africa , 2015, Trans. GIS.