Classification of Settlement Types from Tweets Using LDA and LSTM

Land use reflects the interrelation between the physically built environment and the activity patterns of people. It is indispensable information for decision-makes, but up-to-date and accurate land use information is often absent. Unlike approaches that make use of remote sensing data, in this work, we are interested in a novel data source, tweets, and explore its potential for land use classification in urban areas. Specifically, we propose a general framework for classifying settlement land-use types by extracting location, time, quantity and text features of twitter data. To do so, we apply latent Dirichlet allocation (LDA) and long short-term memory (LSTM) and then combines those features with spatial-temporal feature using Fused SVM and a two-stream convolutional neural network (CNN) for classification. For the case of classifying individual tweets by the land-use classes relevant in this study - residential, non-residential and mixed usage -, we reach overall accuracy (OA), average accuracy (AA), and Kappa coefficient with 72.35%, 73.76%, and 58.43%, respectively. As for the case of classifying block settlement types, we reach 61.90%, 63.33%, and 42.84%, respectively.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Hannes Taubenböck,et al.  Measuring morphological polycentricity - A comparative analysis of urban mass concentrations using remote sensing data , 2017, Comput. Environ. Urban Syst..

[3]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[4]  Xiao Xiang Zhu,et al.  FusioNet: A two-stream convolutional neural network for urban scene classification using PolSAR and hyperspectral data , 2017, 2017 Joint Urban Remote Sensing Event (JURSE).

[5]  Kazutoshi Sumiya,et al.  Urban area characterization based on crowd behavioral lifelogs over Twitter , 2012, Personal and Ubiquitous Computing.

[6]  D. Lu,et al.  Use of impervious surface in urban land-use classification , 2006 .

[7]  Robert A. Schowengerdt,et al.  A detailed comparison of backpropagation neural network and maximum-likelihood classifiers for urban land use classification , 1995, IEEE Trans. Geosci. Remote. Sens..

[8]  T. Esch,et al.  Delineation of Central Business Districts in mega city regions using remotely sensed data , 2013 .

[9]  James R. Anderson,et al.  A land use and land cover classification system for use with remote sensor data , 1976 .

[10]  T. Esch,et al.  Monitoring urbanization in mega cities from space , 2012 .

[11]  Hannes Taubenböck,et al.  Digital deserts on the ground and from space , 2017, 2017 Joint Urban Remote Sensing Event (JURSE).

[12]  Vanessa Frías-Martínez,et al.  Spectral clustering for sensing urban land use using Twitter activity , 2014, Engineering applications of artificial intelligence.

[13]  William J. Emery,et al.  A neural network approach using multi-scale textural metrics from very high-resolution panchromatic imagery for urban land-use classification , 2009 .