Automated classification of building structures for urban built environment identification using machine learning

Abstract The urban environment, especially buildings with different structure types, is difficult to identify automatically due to variation in urban function and form and to buildings’ relatively tacit structural attributes. To enable automated classification of building structures in complex urban environment, this study developed a machine learning (ML)-based method. Twenty-nine ML features were defined, and twelve popular ML algorithms were tested. The method was tested by classifying over 3700 buildings in Beijing, China into five common structures. Of the 12 ML algorithms, the Gradient Boosted Decision Tree delivered the highest overall performance, and its classification was verified as effective, i.e., achieving approximately 91.7%, 90.6%, and 91.1% of average recall, precision and F1, respectively, on testing data. This study’s classification method helps establish a nexus among urban form, building structures, and materials and resource requirements, thus contributing to the advance of sustainable urban studies, such as urban metabolism and ecological planning.

[1]  Xiao Xiang Zhu,et al.  Model Fusion for Building Type Classification from Aerial and Street View Images , 2019, Remote. Sens..

[2]  Marc Sebban,et al.  Efficient Top Rank Optimization with Gradient Boosting for Supervised Anomaly Detection , 2017, ECML/PKDD.

[3]  Li Zhuo,et al.  A Novel Building Type Classification Scheme Based on Integrated LiDAR and High-Resolution Images , 2017, Remote Sensing.

[4]  C. V. Jawahar,et al.  Are buildings only instances?: exploration in architectural style categories , 2012, ICVGIP '12.

[5]  Thomas Blaschke,et al.  Ontology-Based Classification of Building Types Detected from Airborne Laser Scanning Data , 2014, Remote. Sens..

[6]  Judith W. Dexheimer,et al.  Natural Language Processing – The Basics , 2012 .

[7]  Robert Ries,et al.  Unlocking the green opportunity for prefabricated buildings and construction in China , 2018, Resources, Conservation and Recycling.

[8]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[9]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[10]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[11]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[12]  Robert Ries,et al.  The embodied air pollutant emissions and water footprints of buildings in China: a quantification using disaggregated input–output life cycle inventory model , 2016 .

[13]  Junjie Wu,et al.  Architectural Style Classification Using Multinomial Latent Logistic Regression , 2014, ECCV.

[14]  Robert Ries,et al.  Disaggregated I-O LCA model for building product chain energy quantification: A case from China , 2014 .

[15]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[16]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[17]  Bruce W. Suter,et al.  The multilayer perceptron as an approximation to a Bayes optimal discriminant function , 1990, IEEE Trans. Neural Networks.

[18]  Nicholas C. Coops,et al.  Predicting building ages from LiDAR data with random forests for building energy modeling , 2014 .

[19]  Doreen S. Boyd,et al.  Predicting residential building age from map data , 2019, Comput. Environ. Urban Syst..

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  Gayane Shalunts,et al.  Architectural Style Classification of Building Facade Towers , 2011, ISVC.

[22]  Csaba Benedek,et al.  HierarchyNet: Hierarchical CNN-Based Urban Building Classification , 2020, Remote. Sens..

[23]  Nora El-Gohary,et al.  Domain-Specific Hierarchical Text Classification for Supporting Automated Environmental Compliance Checking , 2016, J. Comput. Civ. Eng..

[24]  Michael E. Hodgson,et al.  Building type classification using spatial and landscape attributes derived from LiDAR remote sensing data , 2014 .

[25]  Charu C. Aggarwal,et al.  A Survey of Text Classification Algorithms , 2012, Mining Text Data.

[26]  Andrew J. Tatem,et al.  Using GIS and Machine Learning to Classify Residential Status of Urban Buildings in Low and Middle Income Settings , 2020, Remote. Sens..

[27]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[28]  Ella M. Atkins,et al.  Roof Shape Classification from LiDAR and Satellite Image Data Fusion Using Supervised Learning , 2018, Sensors.