Classifying urban land use by integrating remote sensing and social media data

ABSTRACT Urban land use information plays an important role in urban management, government policy-making, and population activity monitoring. However, the accurate classification of urban functional zones is challenging due to the complexity of urban systems. Many studies have focused on urban land use classification by considering features that are extracted from either high spatial resolution (HSR) remote sensing images or social media data, but few studies consider both features due to the lack of available models. In our study, we propose a novel scene classification framework to identify dominant urban land use type at the level of traffic analysis zone by integrating probabilistic topic models and support vector machine. A land use word dictionary inside the framework was built by fusing natural–physical features from HSR images and socioeconomic semantic features from multisource social media data. In addition to comparing with manual interpretation data, we designed several experiments to test the land use classification accuracy of our proposed model with different combinations of previously acquired semantic features. The classification results (overall accuracy = 0.865, Kappa = 0.828) demonstrate the effectiveness of our strategy that blends features extracted from multisource geospatial data as semantic features to train the classification model. This method can be applied to help urban planners analyze fine urban structures and monitor urban land use changes, and additional data from multiple sources will be blended into this proposed framework in the future.

[1]  Jie Wang,et al.  Comparison of Classification Algorithms and Training Sample Sizes in Urban Land Classification with Landsat Thematic Mapper Imagery , 2014, Remote. Sens..

[2]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[3]  Yonghong Tian,et al.  CNN vs. SIFT for Image Retrieval: Alternative or Complementary? , 2016, ACM Multimedia.

[4]  Stéphane Dupuy,et al.  Land-cover dynamics in Southeast Asia: Contribution of object-oriented techniques for change detection , 2012 .

[5]  Chenghu Zhou,et al.  A new insight into land use classification based on aggregated mobile phone data , 2013, Int. J. Geogr. Inf. Sci..

[6]  Ramesh Nallapati,et al.  Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora , 2009, EMNLP.

[7]  Hua Xu,et al.  Chinese comments sentiment classification based on word2vec and SVMperf , 2015, Expert Syst. Appl..

[8]  Eric F. Lambin,et al.  Land-Use and Land-Cover Change , 2006 .

[9]  Hui Xiong,et al.  Discovering Urban Functional Zones Using Latent Activity Trajectories , 2015, IEEE Transactions on Knowledge and Data Engineering.

[10]  Liangpei Zhang,et al.  Hybrid generative/discriminative scene classification strategy based on latent dirichlet allocation for high spatial resolution remote sensing imagery , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[11]  Cui Chengyin,et al.  Identifying Commuting Pattern of Beijing Using Bus Smart Card Data , 2012 .

[12]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[13]  Ioannis Mourtos,et al.  Hyperarc Consistency for the Stable Admissions Problem , 2007 .

[14]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[15]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[16]  Francisco C. Pereira,et al.  Mining point-of-interest data from social networks for urban land use classification and disaggregation , 2015, Comput. Environ. Urban Syst..

[17]  Nathan S. Netanyahu,et al.  An Efficient SIFT-Based Mode-Seeking Algorithm for Sub-Pixel Registration of Remotely Sensed Images , 2015, IEEE Geoscience and Remote Sensing Letters.

[18]  Yatao Zhang,et al.  Mapping fine-scale population distributions at the building level by integrating multisource geospatial big data , 2017, Int. J. Geogr. Inf. Sci..

[19]  Yue Lu,et al.  Investigating task performance of probabilistic topic models: an empirical study of PLSA and LDA , 2011, Information Retrieval.

[20]  Xing Xie,et al.  Discovering regions of different functions in a city using human mobility and POIs , 2012, KDD.

[21]  Wenkai Li,et al.  A New Accuracy Assessment Method for One-Class Remote Sensing Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[22]  Liangpei Zhang,et al.  Scene Classification Based on the Multifeature Fusion Probabilistic Topic Model for High Spatial Resolution Remote Sensing Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[23]  M. Herold,et al.  Spatial Metrics and Image Texture for Mapping Urban Land Use , 2003 .

[24]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[25]  Gaetano Valenti,et al.  Traffic Estimation And Prediction Based On Real Time Floating Car Data , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[26]  Xiaoping Liu,et al.  Sensing spatial distribution of urban land use by integrating points-of-interest and Google Word2Vec model , 2017, Int. J. Geogr. Inf. Sci..

[27]  Xiaoping Liu,et al.  Delineating urban functional areas with building-level social media data: A dynamic time warping (DTW) distance based k-medoids method , 2017 .

[28]  U. Benz,et al.  Multi-resolution, object-oriented fuzzy analysis of remote sensing data for GIS-ready information , 2004 .

[29]  Filipe Rodrigues,et al.  Automatic Classification of Points-of-Interest for Land-use Analysis , 2012 .

[30]  Dietmar Bauer,et al.  Inferring land use from mobile phone activity , 2012, UrbComp '12.

[31]  Andrew Zisserman,et al.  Scene Classification Via pLSA , 2006, ECCV.

[32]  林 良嗣,et al.  Transport, land-use and the environment , 1996 .

[33]  Yang Yang,et al.  Learning semantic visual vocabularies using diffusion distance , 2009, CVPR.

[34]  Eduardo R. Hruschka,et al.  Document Clustering for Forensic Analysis: An Approach for Improving Computer Inspection , 2013, IEEE Transactions on Information Forensics and Security.

[35]  Peng Gong,et al.  Mapping Urban Land Use by Using Landsat Images and Open Social Data , 2016, Remote. Sens..

[36]  Jean-Claude Thill,et al.  Combining smart card data and household travel survey to analyze jobs-housing relationships in Beijing , 2013, Comput. Environ. Urban Syst..

[37]  Yun Zhu,et al.  Support vector machines and Word2vec for text classification with semantic features , 2015, 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC).

[38]  Xin Huang,et al.  A multi-index learning approach for classification of high-resolution remotely sensed images over urban areas , 2014 .

[39]  Thomas Blaschke,et al.  Geographic Object-Based Image Analysis – Towards a new paradigm , 2014, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[40]  Yu Li,et al.  Automatic Target Detection in High-Resolution Remote Sensing Images Using Spatial Sparse Coding Bag-of-Words Model , 2012, IEEE Geoscience and Remote Sensing Letters.

[41]  Xiaoping Liu,et al.  An improved artificial immune system for seeking the Pareto front of land-use allocation problem in large areas , 2013, Int. J. Geogr. Inf. Sci..

[42]  Jiao Licheng Research on Computation of GLCM of Image Texture , 2006 .

[43]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[44]  Shougeng Hu,et al.  Automated urban land-use classification with remote sensing , 2013 .

[45]  Jun Liu,et al.  Semantic classification for hyperspectral image by integrating distance measurement and relevance vector machine , 2015, Multimedia Systems.

[46]  James J. Little,et al.  Improving scene attribute recognition using web-scale object detectors , 2015, Comput. Vis. Image Underst..

[47]  Chen Hua-jie Study on traffic zone division based on spatial clustering analysis , 2009 .

[48]  Shihong Du,et al.  Semantic Classification of Heterogeneous Urban Scenes Using Intrascene Feature Similarity and Interscene Semantic Dependency , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[49]  Nathan S. Netanyahu,et al.  A sift-based mode-seeking procedure for efficient, accurate registration of remotely sensed images , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[50]  Liangpei Zhang,et al.  An SVM Ensemble Approach Combining Spectral, Structural, and Semantic Features for the Classification of High-Resolution Remotely Sensed Imagery , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[51]  Germain Forestier,et al.  Ontology-Based Object Recognition for Remote Sensing Image Interpretation , 2007, 19th IEEE International Conference on Tools with Artificial Intelligence(ICTAI 2007).

[52]  D. Skole,et al.  Land Use and Land Cover Change , 2014 .

[53]  Gang Zhao,et al.  Robust global sensitivity analysis under deep uncertainty via scenario analysis , 2016, Environ. Model. Softw..

[54]  Alexander Zipf,et al.  Toward mapping land-use patterns from volunteered geographic information , 2013, Int. J. Geogr. Inf. Sci..

[55]  Liangpei Zhang,et al.  A scene change detection framework for multi-temporal very high resolution remote sensing images , 2016, Signal Process..

[56]  Xiaoping Liu,et al.  Simulating urban growth by integrating landscape expansion index (LEI) and cellular automata , 2014, Int. J. Geogr. Inf. Sci..

[57]  Alexander Zipf,et al.  Fine-resolution population mapping using OpenStreetMap points-of-interest , 2014, Int. J. Geogr. Inf. Sci..

[58]  Xingjian Liu,et al.  Automated identification and characterization of parcels (AICP) with OpenStreetMap and Points of Interest , 2013, ArXiv.

[59]  Piotr Tokarczyk,et al.  Features, Color Spaces, and Boosting: New Insights on Semantic Classification of Remote Sensing Images , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[60]  Shihong Du,et al.  A Linear Dirichlet Mixture Model for decomposing scenes: Application to analyzing urban functional zonings , 2015 .

[61]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[62]  Mihai Datcu,et al.  Bridging the Semantic Gap for Satellite Image Annotation and Automatic Mapping Applications , 2011, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[63]  Zhenjiang Shen,et al.  V-BUDEM: A Vector-Based Beijing Urban Development Model for Simulating Urban Growth , 2015 .

[64]  A. Unnikrishnan,et al.  GREY LEVEL CO-OCCURRENCE MATRICES : GENERALISATION AND SOME NEW FEATURES , 2012, 1205.4831.

[65]  Chaogui Kang,et al.  Social Sensing: A New Approach to Understanding Our Socioeconomic Environments , 2015 .

[66]  Anjan Gudigar,et al.  Decision support system for fatty liver disease using GIST descriptors extracted from ultrasound images , 2016, Inf. Fusion.

[67]  Yan-sui Liu,et al.  Key issues of land use in China and implications for policy making , 2014 .

[68]  P. Sathyanarayana,et al.  Image Texture Feature Extraction Using GLCM Approach , 2013 .

[69]  Hui Liu,et al.  Spatiotemporal Detection and Analysis of Urban Villages in Mega City Regions of China Using High-Resolution Remotely Sensed Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[70]  E. Lynn Usery,et al.  Using Geometrical, Textural, and Contextual Information of Land Parcels for Classification of Detailed Urban Land Use , 2009 .

[71]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[72]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[73]  Ping Jian,et al.  Semantic Annotation of High-Resolution Remote Sensing Images via Gaussian Process Multi-Instance Multilabel Learning , 2013, IEEE Geoscience and Remote Sensing Letters.

[74]  Jon Atli Benediktsson,et al.  A Novel Automatic Change Detection Method for Urban High-Resolution Remotely Sensed Imagery Based on Multiindex Scene Representation , 2016, IEEE Transactions on Geoscience and Remote Sensing.