Big spatial data for urban and environmental sustainability

ABSTRACT Eighty percent of big data are associated with spatial information, and thus are Big Spatial Data (BSD). BSD provides new and great opportunities to rework problems in urban and environmental sustainability with advanced BSD analytics. To fully leverage the advantages of BSD, it is integrated with conventional data (e.g. remote sensing images) and improved methods are developed. This paper introduces four case studies: (1) Detection of polycentric urban structures; (2) Evaluation of urban vibrancy; (3) Estimation of population exposure to PM2.5; and (4) Urban land-use classification via deep learning. The results provide evidence that integrated methods can harness the advantages of both traditional data and BSD. Meanwhile, they can also improve the effectiveness of big data itself. Finally, this study makes three key recommendations for the development of BSD with regards to data fusion, data and predicting analytics, and theoretical modeling.

[1]  J. Jacobs The Death and Life of Great American Cities , 1962 .

[2]  Luisa Verdoliva,et al.  Land Use Classification in Remote Sensing Images by Convolutional Neural Networks , 2015, ArXiv.

[3]  Fuchun Sun,et al.  Aerial Scene Classification with Convolutional Neural Networks , 2015, ISNN.

[4]  Klaus-Robert Müller,et al.  Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models , 2017, ArXiv.

[5]  Jack C. P. Cheng,et al.  Spatial and temporal variations of spatial population accessibility to public hospitals: a case study of rural–urban comparison , 2018 .

[6]  Marco Madella,et al.  Land-use classification , 2016 .

[7]  Shashi Shekhar,et al.  Spatial big-data challenges intersecting mobility and cloud computing , 2012, MobiDE '12.

[8]  Jamie Peck,et al.  Variegated neoliberalization: geographies, modalities, pathways , 2010 .

[9]  Michael Batty,et al.  Spatial multi-objective land use optimization: extensions to the non-dominated sorting genetic algorithm-II , 2011, Int. J. Geogr. Inf. Sci..

[10]  Qingqing He,et al.  Satellite-based mapping of daily high-resolution ground PM 2.5 in China via space-time regression modeling , 2018 .

[11]  Yiran Peng,et al.  MODIS Collection 6.1 aerosol optical depth products over land and ocean: validation and comparison , 2019, Atmospheric Environment.

[12]  Michael Batty,et al.  Big data, smart cities and city planning , 2013, Dialogues in human geography.

[13]  Yang Liu,et al.  Satellite-Based Spatiotemporal Trends in PM2.5 Concentrations: China, 2004–2013 , 2015, Environmental health perspectives.

[14]  Mei-Po Kwan,et al.  Algorithmic Geographies: Big Data, Algorithmic Uncertainty, and the Production of Geographic Knowledge , 2016, Geographies of Mobility.

[15]  Antônio Miguel Vieira Monteiro,et al.  Parameter selection for region‐growing image segmentation algorithms using spatial autocorrelation , 2006 .

[16]  Chuanglin Fang,et al.  Spatial-temporal characteristics of PM2.5 in China: A city-level perspective analysis , 2016, Journal of Geographical Sciences.

[17]  M. Pinquart,et al.  Influences of socioeconomic status, social network, and competence on subjective well-being in later life: a meta-analysis. , 2000, Psychology and aging.

[18]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[19]  X. Bai,et al.  Society: Realizing China's urban dream , 2014, Nature.

[20]  Ying Long,et al.  SinoGrids: a practice for open urban data in China , 2016 .

[21]  Stephen P. Mills,et al.  Suomi satellite brings to light a unique frontier of nighttime environmental sensing capabilities , 2012, Proceedings of the National Academy of Sciences.

[22]  Divesh Srivastava,et al.  Big data integration , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[23]  Bo Huang,et al.  Urban land-use mapping using a deep convolutional neural network with high spatial resolution multispectral remote sensing imagery , 2018, Remote Sensing of Environment.

[24]  Hélène Laurent,et al.  Unsupervised Performance Evaluation of Image Segmentation , 2006, EURASIP J. Adv. Signal Process..

[25]  Yang Liu,et al.  Estimating Regional Spatial and Temporal Variability of PM2.5 Concentrations Using Satellite Data, Meteorology, and Land Use Information , 2009, Environmental health perspectives.

[26]  Ripon Patgiri,et al.  Taxonomy of Big Data: A Survey , 2018, ArXiv.

[27]  E. Lynn Usery,et al.  Using Geometrical, Textural, and Contextual Information of Land Parcels for Classification of Detailed Urban Land Use , 2009 .

[28]  Jiansheng Wu,et al.  Spatiotemporal patterns of remotely sensed PM2.5 concentration in China from 1999 to 2011 , 2016 .

[29]  D. Machin,et al.  Critical Discourse Analysis and the challenges and opportunities of social media , 2018, Critical Discourse Studies and/in Communication.

[30]  Veda C. Storey,et al.  Business Intelligence and Analytics: From Big Data to Big Impact , 2012, MIS Q..

[31]  Bo Du,et al.  Scene Classification via a Gradient Boosting Random Convolutional Network Framework , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[32]  Z. Irani,et al.  Critical analysis of Big Data challenges and analytical methods , 2017 .

[33]  Yuqi Bai,et al.  Daily Estimation of Ground-Level PM2.5 Concentrations over Beijing Using 3 km Resolution MODIS AOD. , 2015, Environmental science & technology.

[34]  Armistead G Russell,et al.  Daily estimation of ground-level PM2.5 concentrations at 4km resolution over Beijing-Tianjin-Hebei by fusing MODIS AOD and ground observations. , 2017, The Science of the total environment.

[35]  Shuliang Wang,et al.  Spatial Data Mining , 2019, Encyclopedia of Big Data Technologies.

[36]  Bo Huang,et al.  Satellite-based high-resolution PM2.5 estimation over the Beijing-Tianjin-Hebei region of China using an improved geographically and temporally weighted regression model. , 2018, Environmental pollution.

[37]  Klaus-Robert Müller,et al.  Explainable artificial intelligence , 2017 .

[38]  I. Kohane,et al.  Big Data and Machine Learning in Health Care. , 2018, JAMA.

[39]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[40]  Jan Gehl,et al.  Life Between Buildings: Using Public Space , 2003 .

[41]  L. Bengtsson,et al.  Improved Response to Disasters and Outbreaks by Tracking Population Movements with Mobile Phone Network Data: A Post-Earthquake Geospatial Study in Haiti , 2011, PLoS medicine.

[42]  P. Hall,et al.  Urban Future 21: A Global Agenda for Twenty-First Century Cities , 2000 .

[43]  Chris Brunsdon,et al.  Geographically Weighted Regression: The Analysis of Spatially Varying Relationships , 2002 .

[44]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[45]  Gui-Song Xia,et al.  Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery , 2015, Remote. Sens..

[46]  Emanuele Strano,et al.  Spatial sensitivity analysis for urban land use prediction with physics-constrained conditional generative adversarial networks , 2019, ArXiv.

[47]  Wei Tu,et al.  Evaluating and characterizing urban vibrancy using spatial big data: Shanghai as a case study , 2020, Environment and Planning B: Urban Analytics and City Science.

[48]  H. Long,et al.  Land use transitions and their effects on water environment in Huang-Huai-Hai Plain, China , 2015 .

[49]  Selim Aksoy,et al.  Learning bayesian classifiers for scene classification with a visual grammar , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[50]  Bo Huang,et al.  Using multi-source geospatial big data to identify the structure of polycentric cities , 2017 .

[51]  Atsushi Nara,et al.  A meta-modeling approach for spatio-temporal uncertainty and sensitivity analysis: an application for a cellular automata-based Urban growth and land-use change model , 2018, Int. J. Geogr. Inf. Sci..

[52]  Kang-Woo Lee,et al.  High-Performance Geospatial Big Data Processing System Based on MapReduce , 2018, ISPRS Int. J. Geo Inf..

[53]  Christiana Kartsonaki,et al.  Big data: Some statistical issues , 2018, Statistics & probability letters.

[54]  Victor Couture Valuing the Consumption Benefits of Urban Density , 2014 .

[55]  Bin Chen,et al.  Dynamic assessment of PM2.5 exposure and health risk using remote sensing and geo-spatial big data. , 2019, Environmental pollution.

[56]  Shawn D. Newsam,et al.  Spatial pyramid co-occurrence for image classification , 2011, 2011 International Conference on Computer Vision.

[57]  Alex Pentland,et al.  Once Upon a Crime: Towards Crime Prediction from Demographics and Mobile Data , 2014, ICMI.

[58]  Yuan Zhou,et al.  Restoration of Information Obscured by Mountainous Shadows Through Landsat TM/ETM+ Images Without the Use of DEM Data: A New Method , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[59]  Nicu Sebe,et al.  The Death and Life of Great Italian Cities: A Mobile Phone Data Perspective , 2016, WWW.

[60]  B Huang,et al.  Comprehensive Geographic Information Systems , 2017 .

[61]  J. Jason West,et al.  An Estimate of the Global Burden of Anthropogenic Ozone and Fine Particulate Matter on Premature Human Mortality Using Atmospheric Modeling , 2010, Environmental health perspectives.

[62]  Bin Chen,et al.  Multi-source remotely sensed data fusion for improving land cover classification , 2017 .

[63]  Bin Chen,et al.  Real-Time Estimation of Population Exposure to PM2.5 Using Mobile- and Station-Based Big Data , 2018, International journal of environmental research and public health.

[64]  Shougeng Hu,et al.  Automated urban land-use classification with remote sensing , 2013 .

[65]  Bin Chen,et al.  How do people in different places experience different levels of air pollution? Using worldwide Chinese as a lens. , 2018, Environmental pollution.

[66]  John Montgomery,et al.  Making a city: Urbanity, vitality and urban design , 1998 .

[67]  Bo Wu,et al.  Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices , 2010, Int. J. Geogr. Inf. Sci..

[68]  Yanfei Zhong,et al.  A spectral–structural bag-of-features scene classifier for very high spatial resolution remote sensing imagery , 2016 .

[69]  Christiane Schmullius,et al.  Object-based land cover mapping and comprehensive feature calculation for an automated derivation of urban structure types at block level , 2014 .

[70]  Erik M. Fredericks,et al.  Uncertainty in big data analytics: survey, opportunities, and challenges , 2019, Journal of Big Data.

[71]  Mingshu Wang,et al.  How polycentric is urban China and why? A case study of 318 cities , 2016 .

[72]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[73]  C. Fischer "Urbanism as a Way of Life" , 1972 .

[74]  Stefan Voigt,et al.  Satellite Image Analysis for Disaster and Crisis-Management Support , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[75]  G. Simmel The Metropolis and Mental Life , 2012 .

[76]  Jue Wang,et al.  An Innovative Context-Based Crystal-Growth Activity Space Method for Environmental Exposure Assessment: A Study Using GIS and GPS Trajectory Data Collected in Chicago , 2018, International journal of environmental research and public health.

[77]  Hassan Ghassemian,et al.  A review of remote sensing image fusion methods , 2016, Inf. Fusion.

[78]  A. Dale,et al.  Community Vitality: The Role of Community-Level Resilience Adaptation and Innovation in Sustainable Development , 2010 .

[79]  R. Britter,et al.  "Exposure Track"-The Impact of Mobile-Device-Based Mobility Patterns on Quantifying Population Exposure to Air Pollution. , 2016, Environmental science & technology.

[80]  D. McMillen Nonparametric Employment Subcenter Identification , 2001 .

[81]  S. Openshaw A million or so correlation coefficients : three experiments on the modifiable areal unit problem , 1979 .

[82]  Bei Zhao,et al.  Scene classification via latent Dirichlet allocation using a hybrid generative/discriminative strategy for high spatial resolution remote sensing imagery , 2013 .

[83]  M. Stern,et al.  Cultural Clusters: The Implications of Cultural Assets Agglomeration for Neighborhood Revitalization , 2010 .

[84]  Jun Wang,et al.  Intercomparison between satellite‐derived aerosol optical thickness and PM2.5 mass: Implications for air quality studies , 2003 .

[85]  N. Brenner Implosions/explosions. Towards a study of planetary urbanization , 2014 .

[86]  Liangpei Zhang,et al.  Multiagent Object-Based Classifier for High Spatial Resolution Imagery , 2014, IEEE Transactions on Geoscience and Remote Sensing.