Fine-resolution population mapping using OpenStreetMap points-of-interest

Data on population at building level is required for various purposes. However, to protect privacy, government population data is aggregated. Population estimates at finer scales can be obtained through areal interpolation, a process where data from a first spatial unit system is transferred to another system. Areal interpolation can be conducted with ancillary data that guide the redistribution of population. For population estimation at the building level, common ancillary data include three-dimensional data on buildings, obtained through costly processes such as LiDAR. Meanwhile, volunteered geographic information (VGI) is emerging as a new category of data and is already used for purposes related to urban management. The objective of this paper is to present an alternative approach for building level areal interpolation that uses VGI as ancillary data. The proposed method integrates existing interpolation techniques, i.e., multi-class dasymetric mapping and interpolation by surface volume integration; data on building footprints and points-of-interest (POIs) extracted from OpenStreetMap (OSM) are used to refine population estimates at building level. A case study was conducted for the city of Hamburg and the results were compared using different types of POIs. The results suggest that VGI can be used to accurately estimate population distribution, but that further research is needed to understand how POIs can reveal population distribution patterns.

[1]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[2]  Daniel A. Griffith,et al.  Estimating missing data values for georeferenced poisson counts , 2013 .

[3]  Peter A. Rogerson,et al.  Spatial Analysis and GIS , 1994 .

[4]  Jack T. Harvey,et al.  Estimating census district populations from satellite imagery: Some approaches and limitations , 2002 .

[5]  Mitchel Langford,et al.  Obtaining population estimates in non-census reporting zones: An evaluation of the 3-class dasymetric method , 2006, Comput. Environ. Urban Syst..

[6]  Christoph Aubrecht,et al.  Exploring the potential of volunteered geographic information for modeling spatio-temporal characteristics of urban population A case study for Lisbon Metro using foursquare check-in data , 2011 .

[7]  Benjamin Semenov-Tian-Shansky Russia: Territory and Population: A Perspective on the 1926 Census , 1928 .

[8]  K. K. K. K. Lwin Development of GIS Tool for Dasymetric Mapping , 2010 .

[9]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[10]  Mark W. Green,et al.  Areal interpolation and types of data , 1994 .

[11]  Anna F. Tapp,et al.  Areal Interpolation and Dasymetric Mapping Methods Using Local Ancillary Data Sources , 2010 .

[12]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[13]  Fang Qiu,et al.  A Spatially Disaggregated Areal Interpolation Model Using Light Detection and Ranging‐Derived Building Volumes , 2013 .

[14]  Eugenio Di Sciascio,et al.  A Framework and a Tool for Semantic Annotation of POIs in OpenStreetMap , 2014 .

[16]  Harold Moellering,et al.  A Comparative Analysis of Areal Interpolation Methods , 2005 .

[17]  Mohamed Bishr,et al.  A trust and reputation model for filtering and classifying knowledge about urban growth , 2008 .

[18]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[19]  A. Zipf,et al.  A Comparative Study of Proprietary Geodata and Volunteered Geographic Information for Germany , 2010 .

[20]  Barbara P. Buttenfield,et al.  Maximum Entropy Dasymetric Modeling for Demographic Small Area Estimation , 2013 .

[21]  Alexander Zipf,et al.  Semantic Interoperability of Sensor Data with Volunteered Geographic Information: A Unified Model , 2013, ISPRS Int. J. Geo Inf..

[22]  Jonathan P. Schroeder,et al.  Bicomponent Trend Maps: A Multivariate Approach to Visualizing Geographic Time Series , 2010, Cartography and geographic information science.

[23]  Michael F. Goodchild,et al.  International Journal of Geographical Information Science , 2022 .

[24]  C. Lo Population Estimation Using Geographically Weighted Regression , 2008 .

[25]  Aditya Agrawal,et al.  Areal Interpolation of Population Counts Using Pre-classified Land Cover Data , 2007 .

[26]  Gilbert Shapiro,et al.  The Linkage of Data Describing Overlapping Geographical Units , 1973 .

[27]  Xiaomin Qiu,et al.  Incorporating GIS Building Data and Census Housing Statistics for Sub-Block-Level Population Estimation , 2008 .

[28]  Anthony Stefanidis,et al.  Assessing Completeness and Spatial Error of Features in Volunteered Geographic Information , 2013, ISPRS Int. J. Geo Inf..

[29]  Jie Shan,et al.  Building population mapping with aerial imagery and GIS data , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[30]  David L. Tulloch Is VGI participation? From vernal pools to video games , 2008 .

[31]  Geng-Ming Jiang,et al.  Preliminary mapping of high-resolution rural population distribution based on imagery from Google Earth: A case study in the Lake Tai basin, eastern China , 2012 .

[32]  Mitchel Langford,et al.  An Evaluation of Small Area Population Estimation Techniques Using Open Access Ancillary Data , 2013 .

[33]  Barbara P. Buttenfield,et al.  Establishing relationships between parcel data and land cover for demographic small area estimation , 2013 .

[34]  Filipe Rodrigues,et al.  Estimating Disaggregated Employment Size from Points-of-Interest and Census Data: From Mining the Web to Model Implementation and Visualization , 2013 .

[35]  Miriam J. Metzger,et al.  The credibility of volunteered geographic information , 2008 .

[36]  Filipe Rodrigues,et al.  Automatic Classification of Points-of-Interest for Land-use Analysis , 2012 .

[37]  Jeremy Mennis,et al.  Dasymetric Mapping for Estimating Population in Small Areas , 2009 .

[38]  Weidong Song,et al.  The role of mobile volunteered geographic information in urban management , 2010, 2010 18th International Conference on Geoinformatics.

[39]  Jungho Im,et al.  Population estimation based on multi-sensor data fusion , 2010 .

[40]  R. Weibel,et al.  Generalisation of point data for mobile devices: , 2010 .

[41]  Jonathan P. Schroeder Target-Density Weighting Interpolation and Uncertainty Evaluation for Temporal Analysis of Census Data , 2007 .

[42]  Xiaomin Qiu,et al.  Population Estimation Methods in GIS and Remote Sensing: A Review , 2005 .

[43]  Alexander Zipf,et al.  A dynamic and context-aware semantic mediation service for discovering and fusion of heterogeneous sensor data , 2013, J. Spatial Inf. Sci..

[44]  Alan T. Murray,et al.  A cokriging method for estimating population density in urban areas , 2005, Comput. Environ. Urban Syst..

[45]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[46]  Peter Mooney,et al.  Can Volunteered Geographic Information Be a Participant in eEnvironment and SDI? , 2011, ISESS.

[47]  Pascal Neis,et al.  The Street Network Evolution of Crowdsourced Maps: OpenStreetMap in Germany 2007-2011 , 2011, Future Internet.

[48]  Alexander Zipf,et al.  Toward mapping land-use patterns from volunteered geographic information , 2013, Int. J. Geogr. Inf. Sci..

[49]  Alexander Zipf,et al.  Towards Defining a Framework for the Automatic Derivation of 3D CityGML Models from Volunteered Geographic Information , 2012, Int. J. 3 D Inf. Model..

[50]  Donald B. Rubin,et al.  Max-imum Likelihood from Incomplete Data , 1972 .

[51]  Simon Scheider,et al.  Semantic Referencing of Geosensor Data and Volunteered Geographic Information , 2011, Geospatial Semantics and the Semantic Web.

[52]  Michael F. Goodchild,et al.  Areal interpolation: A variant of the traditional spatial problem , 1980 .

[53]  Christopher R. Herrmann,et al.  Mapping Population Distribution in the Urban Environment: The Cadastral-based Expert Dasymetric System (CEDS) , 2007 .

[54]  Michael Auer,et al.  An Algorithm Based Methodology for the Creation of a Regularly Updated Global Online Map Derived From Volunteered Geographic Information , 2012 .

[55]  Robert G. Cromley,et al.  A Quantile Regression Approach to Areal Interpolation , 2012 .

[56]  Paul A. Zandbergen,et al.  Comparison of Dasymetric Mapping Techniques for Small-Area Population Estimates , 2010 .

[57]  Fang Qiu,et al.  A Point-Based Intelligent Approach to Areal Interpolation , 2011 .

[58]  Barbara P. Buttenfield,et al.  Dasymetric Modeling and Uncertainty , 2014, Annals of the Association of American Geographers. Association of American Geographers.

[59]  J. Mennis Generating Surface Models of Population Using Dasymetric Mapping , 2003, The Professional Geographer.

[60]  Alexander Zipf,et al.  Road-based travel recommendation using geo-tagged images , 2015, Comput. Environ. Urban Syst..

[61]  P. Kyriakidis A Geostatistical Framework for Area-to-Point Spatial Interpolation , 2004 .

[62]  Robert G. Cromley,et al.  Evaluating the use of publicly available remotely sensed land cover data for areal interpolation , 2013 .

[63]  Yichun Xie The overlaid network algorithms for areal interpolation problem , 1995 .

[64]  W. Tobler Smooth pycnophylactic interpolation for geographical regions. , 1979, Journal of the American Statistical Association.

[65]  K. Lwin Online micro-spatial analysis based on GIS estimated building population : a case of Tsukuba City , 2010 .

[66]  Sven Schade,et al.  Digital Earth's Nervous System for crisis events: real-time Sensor Web Enablement of Volunteered Geographic Information , 2010, Int. J. Digit. Earth.

[67]  N. Lam Spatial Interpolation Methods: A Review , 1983 .

[68]  John K. Wright A Method of Mapping Densities of Population: With Cape Cod as an Example , 1936 .

[69]  Hanan Samet,et al.  The Quadtree and Related Hierarchical Data Structures , 1984, CSUR.

[70]  Cynthia A. Brewer,et al.  Dasymetric Mapping and Areal Interpolation: Implementation and Evaluation , 2001 .

[71]  Bor-Wen Tsai,et al.  Multi-layer multi-class dasymetric mapping to estimate population distribution. , 2010, The Science of the total environment.