Mapping fine-scale population distributions at the building level by integrating multisource geospatial big data

ABSTRACT Fine-scale population distribution data at the building level play an essential role in numerous fields, for example urban planning and disaster prevention. The rapid technological development of remote sensing (RS) and geographical information system (GIS) in recent decades has benefited numerous population distribution mapping studies. However, most of these studies focused on global population and environmental changes; few considered fine-scale population mapping at the local scale, largely because of a lack of reliable data and models. As geospatial big data booms, Internet-collected volunteered geographic information (VGI) can now be used to solve this problem. This article establishes a novel framework to map urban population distributions at the building scale by integrating multisource geospatial big data, which is essential for the fine-scale mapping of population distributions. First, Baidu points-of-interest (POIs) and real-time Tencent user densities (RTUD) are analyzed by using a random forest algorithm to down-scale the street-level population distribution to the grid level. Then, we design an effective iterative building-population gravity model to map population distributions at the building level. Meanwhile, we introduce a densely inhabited index (DII), generated by the proposed gravity model, which can be used to estimate the degree of residential crowding. According to a comparison with official community-level census data and the results of previous population mapping methods, our method exhibits the best accuracy (Pearson R = .8615, RMSE = 663.3250, p < .0001). The produced fine-scale population map can offer a more thorough understanding of inner city population distributions, which can thus help policy makers optimize the allocation of resources.

[1]  Cynthia A. Brewer,et al.  Dasymetric Mapping and Areal Interpolation: Implementation and Evaluation , 2001 .

[2]  Laurence J. C. Ma,et al.  China's Urban Population Statistics: A Critical Evaluation , 2005 .

[3]  Xiaoping Liu,et al.  Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model , 2017, Int. J. Geogr. Inf. Sci..

[4]  Yuji Murayama,et al.  A GIS Approach to Estimation of Building Population for Micro‐spatial Analysis , 2009, Trans. GIS.

[5]  Mitchel Langford,et al.  Rapid facilitation of dasymetric-based population interpolation by means of raster pixel maps , 2007, Comput. Environ. Urban Syst..

[6]  C. Fan,et al.  China on the Move: Migration, the State, and the Household , 2007 .

[7]  Yu Liu,et al.  Towards Estimating Urban Population Distributions from Mobile Call Data , 2012 .

[8]  Peng Gong,et al.  Mapping Urban Land Use by Using Landsat Images and Open Social Data , 2016, Remote. Sens..

[9]  E. Grafarend Linear and nonlinear models : fixed effects, random effects, and mixed models , 2006 .

[10]  Catherine Linard,et al.  Disaggregating Census Data for Population Mapping Using Random Forests with Remotely-Sensed and Ancillary Data , 2015, PloS one.

[11]  Xiaocong Xu,et al.  Mapping the fine-scale spatial pattern of housing rent in the metropolitan area by using online rental listings and ensemble learning , 2016 .

[12]  Farshad Fotouhi,et al.  Bias and stability of single variable classifiers for feature ranking and selection , 2014, Expert Syst. Appl..

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Fulong Wu,et al.  Urban villages under China's rapid urbanization: Unregulated assets and transitional neighbourhoods , 2010 .

[15]  Xiaoping Liu,et al.  Simulating urban growth by integrating landscape expansion index (LEI) and cellular automata , 2014, Int. J. Geogr. Inf. Sci..

[16]  Aniruddha Ghosh,et al.  A comparison of selected classification algorithms for mapping bamboo patches in lower Gangetic plains using very high resolution WorldView 2 imagery , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[17]  Juan Chen,et al.  Migration, environmental hazards, and health outcomes in China. , 2013, Social science & medicine.

[18]  Daniel Neagu,et al.  Interpreting random forest classification models using a feature contribution method , 2013, IRI.

[19]  Kristin Aunan,et al.  Internal migration and urbanization in China: impacts on population exposure to household air pollution (2000-2010). , 2014, The Science of the total environment.

[20]  Yu Zhu China's floating population and their settlement intention in the cities: Beyond the Hukou reform , 2007 .

[21]  Qingquan Li,et al.  Estimating the Distribution of Economy Activity: A Case Study in Jiangsu Province (China) Using Large Scale Social Network Data , 2014, ICDM Workshops.

[22]  D. Lu,et al.  Residential population estimation using a remote sensing derived impervious surface approach , 2006 .

[23]  Carlo Ratti,et al.  Mobile Landscapes: Using Location Data from Cell Phones for Urban Analysis , 2006 .

[24]  Carlo Ratti,et al.  Does Urban Mobility Have a Daily Routine? Learning from the Aggregate Data of Mobile Networks , 2010 .

[25]  David Shambaugh The Three Faces of Chinese Power: Might, Money, and Minds , David M. Lampton . Berkeley and London: University of California Press, 2008. xiii + 364 pp. $21.95 ISBN 978-0-520-25442-8. , 2008, The China Quarterly.

[26]  Qingquan Li,et al.  Mining time-dependent attractive areas and movement patterns from taxi trajectory data , 2009, 2009 17th International Conference on Geoinformatics.

[27]  Jie Shan,et al.  Building population mapping with aerial imagery and GIS data , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[28]  Jay Gao,et al.  Use of normalized difference built-up index in automatically mapping urban areas from TM imagery , 2003 .

[29]  Fahui Wang,et al.  Urban land uses and traffic 'source-sink areas': Evidence from GPS-enabled taxi data in Shanghai , 2012 .

[30]  Jan Peters-Anders,et al.  Mobile Phone Data as Source to Discover Spatial Activity and Motion Patterns , 2012 .

[31]  C. Lo,et al.  Dasymetric Estimation of Population Density and Areal Interpolation of Census Data , 2004 .

[32]  Alan T. Murray,et al.  A cokriging method for estimating population density in urban areas , 2005, Comput. Environ. Urban Syst..

[33]  Alexander Zipf,et al.  Fine-resolution population mapping using OpenStreetMap points-of-interest , 2014, Int. J. Geogr. Inf. Sci..

[34]  A. Tatem,et al.  Dynamic population mapping using mobile phone data , 2014, Proceedings of the National Academy of Sciences.

[35]  Chaogui Kang,et al.  Social Sensing: A New Approach to Understanding Our Socioeconomic Environments , 2015 .

[36]  J. Mennis Generating Surface Models of Population Using Dasymetric Mapping , 2003, The Professional Geographer.

[37]  B. Bhaduri,et al.  LandScan USA: a high-resolution geospatial and temporal modeling approach for population distribution and dynamics , 2007 .

[38]  Uwe Deichmann,et al.  A Review of Spatial Population Database Design and Modeling , 1996 .

[39]  Gérard Biau,et al.  Analysis of a Random Forests Model , 2010, J. Mach. Learn. Res..

[40]  Mitchel Langford,et al.  An Evaluation of Small Area Population Estimation Techniques Using Open Access Ancillary Data , 2013 .

[41]  A. Tatem,et al.  High Resolution Population Distribution Maps for Southeast Asia in 2010 and 2015 , 2013, PloS one.

[42]  Dudley L. Poston,et al.  The Population of Modern China , 1992 .

[43]  Xiuming Shan,et al.  Exploring spacetime structure of human mobility in urban space , 2011 .

[44]  Lu Guozhen,et al.  Estimating the Distribution of Economy Activity: A Case Study in Jiangsu Province (China) Using Large Scale Social Network Data , 2014, 2014 IEEE International Conference on Data Mining Workshop.

[45]  K. Chan,et al.  The Hukou System and Rural-Urban Migration in China: Processes and Changes , 1999, The China Quarterly.

[46]  R. Engstrom,et al.  Spatial refinement of census population distribution using remotely sensed estimates of impervious surfaces in Haiti , 2010 .