Improved machine-learning mapping of local climate zones in metropolitan areas using composite Earth observation data in Google Earth Engine

Abstract Accurate, large-scale local climate zone (LCZ) maps with data consistency are crucial for urban environmental studies globally. However, current approaches using Earth observation data and machine learning (ML) algorithms with local computation power are limited by low accuracy, coverage, and efficiency. Here, we present an improved workflow for generating consistent large-scale LCZ maps based on optimal data and ML algorithm selection using the Google Earth Engine (GEE) platform. Twelve data-composition scenarios and one optimized scenario were designed to explore the effects and synergetic use of nine Earth observation datasets, based on their reported potential in pixel-based classification. Our results show that depending on the intended use of the map, the random forest (RF) classifier and support vector machine (SVM) classifier are by far the most appropriate ML algorithms for pixel-based LCZ classification. While the RF classifier achieves a significantly higher overall accuracy and shows advantages in most of the individual classes, the SVM classifier exhibits significantly less variability with regard to accuracy. In addition, the competitive accuracy of the optimized scenario shows that using “elite variables” in the RF classifier can significantly improve classification accuracy while also reducing computational burden. Furthermore, thermal-infrared variables are far more influential than other variables in LCZ classification. Our study is the first attempt to make a cross-comparison of various remote sensing datasets and ML algorithms for LCZ classification using the GEE platform. As such, our results provide valuable new insights, workflows, and future directions for large-scale LCZ classification to support urban environmental studies globally.

[1]  Bin Zhao,et al.  Long-Term Dynamic of Poyang Lake Surface Water: A Mapping Work Based on the Google Earth Engine Cloud Platform , 2019, Remote. Sens..

[2]  Gideon S. Mann,et al.  Efficient Large-Scale Distributed Training of Conditional Maximum Entropy Models , 2009, NIPS.

[3]  F. Freedman,et al.  Integration of the WUDAPT, WRF, and ENVI-met models to simulate extreme daytime temperature mitigation strategies in San Jose, California , 2020, Building and Environment.

[4]  M. Brauer,et al.  Creating National Air Pollution Models for Population Exposure Assessment in Canada , 2011, Environmental health perspectives.

[5]  Analysis of Machine Learning Classifiers for LULC Classification on Google Earth Engine , 2022 .

[6]  Gerald Forkuor,et al.  Landsat-8 vs. Sentinel-2: examining the added value of sentinel-2’s red-edge bands to land-use and land-cover mapping in Burkina Faso , 2018 .

[7]  Benjamin Bechtel,et al.  Classification of Local Climate Zones Using SAR and Multispectral Data in an Arid Environment , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  J. Fung,et al.  High-resolution calculation of the urban vegetation fraction in the Pearl River Delta from the Sentinel-2 NDVI for urban climate model parameterization , 2019, Geoscience Letters.

[9]  Shuqing Zhang,et al.  Crop classification from full-year fully-polarimetric L-band UAVSAR time-series using the Random Forest algorithm , 2020, Int. J. Appl. Earth Obs. Geoinformation.

[10]  Paul Osmond,et al.  Mapping Local Climate Zones for urban morphology classification based on airborne remote sensing data , 2017, 2017 Joint Urban Remote Sensing Event (JURSE).

[11]  J. Marshall,et al.  National Spatiotemporal Exposure Surface for NO2: Monthly Scaling of a Satellite-Derived Land-Use Regression, 2000-2010. , 2015, Environmental science & technology.

[12]  Daniel G. Aliaga,et al.  WUDAPT: An Urban Weather, Climate, and Environmental Modeling Infrastructure for the Anthropocene , 2018, Bulletin of the American Meteorological Society.

[13]  Edward Ng,et al.  GIS-based mapping of Local Climate Zone in the high-density city of Hong Kong , 2017, Urban Climate.

[14]  Saurabh Rahurkar,et al.  The relationship between built environments and physical activity: a systematic review. , 2012, American journal of public health.

[15]  Cheng-Chien Liu,et al.  Flood Prevention and Emergency Response System Powered by Google Earth Engine , 2018, Remote. Sens..

[16]  M. Pal Factors influencing the accuracy of remote sensing classifications : a comparative study , 2002 .

[17]  Paul M. Mather,et al.  Computer Processing of Remotely-Sensed Images: An Introduction , 1988 .

[18]  Michael Dixon,et al.  Google Earth Engine: Planetary-scale geospatial analysis for everyone , 2017 .

[19]  Jingzhe Wang,et al.  Combined use of Sentinel-2 and Landsat 8 to monitor water surface area dynamics using Google Earth Engine , 2020 .

[20]  Yuguo Li,et al.  The impact of building density and building height heterogeneity on average urban albedo and street surface temperature , 2015 .

[21]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[22]  Xiaomei Zhang,et al.  30m resolution Global Annual Burned Area Mapping based on Landsat images and Google Earth Engine , 2018, Remote. Sens..

[23]  Chi Yung Jim,et al.  Urbanization effect on spatiotemporal thermal patterns and changes in Hangzhou (China) , 2018, Building and Environment.

[24]  G. Evans The built environment and mental health , 2003, Journal of Urban Health.

[25]  M. Santamouris,et al.  Synergies between Urban Heat Island and Heat Waves in Athens (Greece), during an extremely hot summer (2012) , 2017, Scientific Reports.

[26]  Steffen Fritz,et al.  Developing a community-based worldwide urban morphology and materials database (WUDAPT) using remote sensing and crowdsourcing for improved urban climate modelling , 2015, 2015 Joint Urban Remote Sensing Event (JURSE).

[27]  J. Monteith,et al.  Boundary Layer Climates. , 1979 .

[28]  Wen Liu,et al.  Characteristics of shadow and removal of its effects for remote sensing imagery , 2009, 2009 IEEE International Geoscience and Remote Sensing Symposium.

[29]  Jinwei Dong,et al.  Mapping paddy rice planting area in northeastern Asia with Landsat 8 images, phenology-based algorithm and Google Earth Engine. , 2016, Remote sensing of environment.

[30]  Matthew Montanaro,et al.  Derivation and Validation of the Stray Light Correction Algorithm for the Thermal Infrared Sensor Onboard Landsat 8 , 2017 .

[31]  Gérard Dedieu,et al.  Assessment of an Operational System for Crop Type Map Production Using High Temporal and Spatial Resolution Satellite Optical Imagery , 2015, Remote. Sens..

[32]  Iain Stewart,et al.  Mapping Local Climate Zones for a Worldwide Database of the Form and Function of Cities , 2015, ISPRS Int. J. Geo Inf..

[33]  Benjamin Bechtel,et al.  Global transferability of local climate zone models , 2019, Urban Climate.

[34]  A. G. Wacker,et al.  Minimum Distance Classification in Remote Sensing , 1972 .

[35]  Susan L Handy,et al.  How the built environment affects physical activity: views from urban planning. , 2002, American journal of preventive medicine.

[36]  Pierre Soille,et al.  Assessment of the Added-Value of Sentinel-2 for Detecting Built-up Areas , 2016, Remote. Sens..

[37]  Martin C. Roberge,et al.  IMPERVIOUS SURFACES AND THE QUALITY OF NATURAL AND BUILT ENVIRONMENTS , 2002 .

[38]  M. J. Nigam,et al.  Shadow Detection and Removal from Remote Sensing Images using NDI and Morphological Operators , 2012 .

[39]  Sundus Shareef The impact of urban morphology and building's height diversity on energy consumption at urban scale. The case study of Dubai , 2021 .

[40]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[41]  S. Hassan,et al.  AN ASSESSMENT OF SPATIAL VARIATION OF LAND SURFACE CHARACTERISTICS OF MINNA, NIGER STATE NIGERIA FOR SUSTAINABLE URBANIZATION USING GEOSPATIAL TECHNIQUES , 2018, Geosfera Indonesia.

[42]  Maria Antonia Brovelli,et al.  Monitoring Forest Change in the Amazon Using Multi-Temporal Remote Sensing Data and Machine Learning Classification on Google Earth Engine , 2020, ISPRS Int. J. Geo Inf..

[43]  Constantin F. Aliferis,et al.  A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification , 2008, BMC Bioinformatics.

[44]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[45]  Mark J Nieuwenhuijsen,et al.  Car free cities: Pathway to healthy urban living. , 2016, Environment international.

[46]  Aman,et al.  COMPARATIVE ANALYSIS OF DIFFERENT METHODOLOGIES FOR LOCAL CLIMATE ZONE CLASSIFICATION , 2018 .

[47]  W. Emery,et al.  Satellite-derived urban heat islands from three coastal cities and the utilization of such data in urban climatology , 1989 .

[48]  Cheolhee Yoo,et al.  Comparison between convolutional neural networks and random forest for local climate zone classification in mega urban areas using Landsat images , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[49]  David B. Lobell,et al.  Smallholder maize area and yield mapping at national scales with Google Earth Engine , 2019, Remote Sensing of Environment.

[50]  Fabio Del Frate,et al.  Exploiting Earth Observation data products for mapping Local Climate Zones , 2015, 2015 Joint Urban Remote Sensing Event (JURSE).

[51]  D. Leung,et al.  A review on the generation, determination and mitigation of urban heat island. , 2008, Journal of environmental sciences.

[52]  Jianping Wu,et al.  Evaluation of NPP-VIIRS night-time light composite data for extracting built-up urban areas , 2014 .

[53]  Andrew K. Skidmore,et al.  Potential of Sentinel-2 spectral configuration to assess rangeland quality , 2015 .

[54]  J. Bouyer,et al.  Using Local Climate Zone scheme for UHI assessment: Evaluation of the method using mobile measurements , 2015 .

[55]  Lindi J. Quackenbush,et al.  Impact of training and validation sample selection on classification accuracy and accuracy assessment when using reference polygons in object-based classification , 2013 .

[56]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[57]  Ran Wang,et al.  Detecting multi-temporal land cover change and land surface temperature in Pearl River Delta by adopting local climate zone , 2019, Urban Climate.

[58]  Onisimo Mutanga,et al.  Remotely sensed retrieval of Local Climate Zones and their linkages to land surface temperature in Harare metropolitan city, Zimbabwe , 2019, Urban Climate.

[59]  Linda See,et al.  Generating WUDAPT Level 0 data – Current status of production and evaluation , 2019, Urban Climate.

[60]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[61]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[62]  Patricia Gober,et al.  Per-pixel vs. object-based classification of urban land cover extraction using high spatial resolution imagery , 2011, Remote Sensing of Environment.

[63]  Yang Shao,et al.  Comparison of support vector machine, neural network, and CART algorithms for the land-cover classification using limited training data points , 2012 .

[64]  Yunfei Gao,et al.  Block-based local climate zone approach to urban climate maps using the UDC model , 2020 .

[65]  Benjamin Bechtel,et al.  WUDAPT, an efficient land use producing data tool for mesoscale models? Integration of urban LCZ in WRF over Madrid , 2016 .

[66]  Catherine Linard,et al.  Using Local Climate Zones in Sub-Saharan Africa to tackle urban health issues , 2019, Urban Climate.

[67]  Luc Int Panis,et al.  Improving health through policies that promote active travel: a review of evidence to support integrated health impact assessment. , 2011, Environment international.

[68]  Geoffrey I. Webb Naïve Bayes , 2020, Encyclopedia of Machine Learning.

[69]  Damien Sulla-Menashe,et al.  MODIS Collection 5 global land cover: Algorithm refinements and characterization of new datasets , 2010 .

[70]  Steve Hankey,et al.  Correlates of the Built Environment and Active Travel: Evidence from 20 US Metropolitan Areas , 2018, Environmental health perspectives.

[71]  Benjamin Bechtel,et al.  Mapping Europe into local climate zones , 2019, PloS one.

[72]  Shan Suthaharan,et al.  Decision Tree Learning , 2016 .

[73]  Chunhong Zhao,et al.  Application of airborne remote sensing data on mapping local climate zones: Cases of three metropolitan areas of Texas, U.S , 2019, Comput. Environ. Urban Syst..

[74]  Benjamin Bechtel,et al.  Classification of Local Climate Zones Based on Multiple Earth Observation Data , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[75]  Julian D Marshall,et al.  A national satellite-based land-use regression model for air pollution exposure assessment in Australia. , 2014, Environmental research.

[76]  Jin Chen,et al.  Global land cover mapping at 30 m resolution: A POK-based operational approach , 2015 .

[77]  R. Avissar Potential effects of vegetation on the urban thermal environment , 1996 .

[78]  T. Gan,et al.  Long-term impact of rapid urbanization on urban climate and human thermal comfort in hot-arid environment , 2018, Building and Environment.

[79]  N. Clinton,et al.  A mangrove forest map of China in 2015: Analysis of time series Landsat 7/8 and Sentinel-1A imagery in Google Earth Engine cloud computing platform , 2017 .

[80]  M. Demuzere,et al.  Combining expert and crowd-sourced training data to map urban form and functions for the continental US , 2020, Scientific Data.

[81]  Xiao Xiang Zhu,et al.  Local climate zone-based urban land cover classification from multi-seasonal Sentinel-2 images with a recurrent residual network , 2019, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[82]  Nengcheng Chen,et al.  Long-Term Surface Water Dynamics Analysis Based on Landsat Imagery and the Google Earth Engine Platform: A Case Study in the Middle Yangtze River Basin , 2018, Remote. Sens..

[83]  T. Oke,et al.  Local Climate Zones for Urban Temperature Studies , 2012 .

[84]  Yu Wang,et al.  A Debiased MDI Feature Importance Measure for Random Forests , 2019, NeurIPS.

[85]  Francesca Fourney,et al.  Additive negative effects of anthropogenic sedimentation and warming on the survival of coral recruits , 2017, Scientific Reports.

[86]  Paul M. Mather,et al.  Support vector machines for classification in remote sensing , 2005 .

[87]  R. Silva,et al.  Spatiotemporal impact of land use/land cover changes on urban heat islands: A case study of Paço do Lumiar, Brazil , 2018 .

[88]  Thomas Jagdhuber,et al.  Sensitivity of Sentinel-1 backscatter to characteristics of buildings , 2017 .

[89]  C. Ren,et al.  Assessment of Local Climate Zone Classification Maps of Cities in China and Feasible Refinements , 2019, Scientific Reports.

[90]  D. Michelson Comparison of Algorithms for Classifying Swedish Landcover Using Landsat TM and ERS-1 SAR Data , 2000 .

[91]  Xiaoping Liu,et al.  High-resolution multi-temporal mapping of global urban land using Landsat images based on the Google Earth Engine Platform , 2018 .

[92]  C. Elvidge,et al.  VIIRS night-time lights , 2017, Remote Sensing of Night-time Light.

[93]  Wei Dai,et al.  Local Climate Zone Study for Sustainable Megacities Development by Using Improved WUDAPT Methodology – A Case Study in Guangzhou , 2016 .