Employing Crowdsourced Geographic Information to Classify Land Cover with Spatial Clustering and Topic Model

Land cover classification is the most important element of land cover mapping and is a key input to many societal benefits. Traditional classification methods require a large amount of remotely sensed images, which are time consuming and labour intensive. Recently, crowdsourcing geographic information (CGI), including geo-tagged photos and other sources, has been widely used with lower costs, but still requires extensive labour for data classification. Alternatively, CGI textual information is available from online sources containing land cover information, and it provides a useful source for land cover classification. However, the major challenge of utilising CGI is its uneven spatial distributions in land cover regions, leading to less reliability of regions for land cover classification with sparsely distributed CGI. Moreover, classifying various unorganised CGI texts automatically in each land cover region is another challenge. This paper investigates a faster and more automated method that does not require remotely sensed images for land cover classification. Spatial clustering is employed for CGI to reduce the effect of uneven spatial distributions by extracting land cover regions with high density of CGI. To classify unorganised various CGI texts in each extracted region, land cover topics are calculated using topic model. As a case study, we applied this method using points of interest (POIs) as CGI to classify land cover in Shandong province. The classification result using our proposed method achieved an overall accuracy of approximately 80%, providing evidence that CGI with textual information has a great potential for land cover classification.

[1]  J. Fowler,et al.  Rapid assessment of disaster damage using social media activity , 2016, Science Advances.

[2]  Peng Gong,et al.  Mapping Urban Land Use by Using Landsat Images and Open Social Data , 2016, Remote. Sens..

[3]  Krzysztof Janowicz,et al.  Things and Strings: Improving Place Name Disambiguation from Short Texts by Combining Entity Co-Occurrence with Topic Modeling , 2016, EKAW.

[4]  Shawn D. Newsam,et al.  Land cover classification using geo-referenced photos , 2014, Multimedia Tools and Applications.

[5]  Krzysztof Janowicz,et al.  A data-synthesis-driven method for detecting and extracting vague cognitive regions , 2017, Int. J. Geogr. Inf. Sci..

[6]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[7]  Asamaporn Sitthi,et al.  Exploring Land Use and Land Cover of Geotagged Social-Sensing Images Using Naive Bayes Classifier , 2016 .

[8]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Lucy Bastin,et al.  Usability of VGI for validation of land cover maps , 2015, Int. J. Geogr. Inf. Sci..

[10]  Hui Xiong,et al.  Point-of-Interest Recommendation in Location Based Social Networks with Topic and Location Awareness , 2013, SDM.

[11]  Shawn D. Newsam,et al.  Exploring Geotagged images for land-use classification , 2012, GeoMM '12.

[12]  Chaogui Kang,et al.  Social Sensing: A New Approach to Understanding Our Socioeconomic Environments , 2015 .

[13]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[14]  Jacinto Estima,et al.  Investigating the Potential of OpenStreetMap for Land Use/Land Cover Production: A Case Study for Continental Portugal , 2015, OpenStreetMap in GIScience.

[15]  Li Gong,et al.  Revealing travel patterns and city structure with taxi trip data , 2016 .

[16]  M. Goodchild,et al.  Crowdsourcing Geographic Knowledge: Volunteered Geographic Information (VGI) in Theory and Practice , 2012 .

[17]  Giles M. Foody,et al.  Status of land cover classification accuracy assessment , 2002 .

[18]  Shaowen Wang,et al.  Latent spatio-temporal activity structures: a new approach to inferring intra-urban functional regions via social media check-in data , 2016, Geo spatial Inf. Sci..

[19]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[20]  Hao Wu,et al.  Active Collection of Land Cover Sample Data from Geo-Tagged Web Texts , 2015, Remote. Sens..

[21]  J. Lee,et al.  Using Social Media for Emergency Response and Urban Sustainability: A Case Study of the 2012 Beijing Rainstorm , 2015 .

[22]  Shu Peng,et al.  A web-based system for supporting global land cover data production , 2015 .

[23]  Maribel Yasmina Santos,et al.  Concave hull: A k-nearest neighbours approach for the computation of the region occupied by a set of points , 2007, GRAPP.

[24]  Jacinto Estima,et al.  Exploratory analysis of OpenStreetMap for land use classification , 2013, GEOCROWD '13.

[25]  Francisco C. Pereira,et al.  Mining point-of-interest data from social networks for urban land use classification and disaggregation , 2015, Comput. Environ. Urban Syst..

[26]  Steffen Fritz,et al.  Building a hybrid land cover map with crowdsourcing and geographically weighted regression , 2015 .

[27]  Wei Jiang,et al.  Using Social Media to Detect Outdoor Air Pollution and Monitor Air Quality Index (AQI): A Geo-Targeted Spatiotemporal Analysis Framework with Sina Weibo (Chinese Twitter) , 2015, PloS one.

[28]  Giles M. Foody,et al.  Using Volunteered Data in Land Cover Map Validation: Mapping West African Forests , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[29]  Kotaro Iizuka,et al.  Employing crowdsourced geographic data and multi-temporal/multi-sensor satellite imagery to monitor land cover change: A case study in an urbanizing region of the Philippines , 2017, Comput. Environ. Urban Syst..

[30]  Jin Chen,et al.  Global land cover mapping at 30 m resolution: A POK-based operational approach , 2015 .

[31]  Vanessa Frías-Martínez,et al.  Spectral clustering for sensing urban land use using Twitter activity , 2014, Engineering applications of artificial intelligence.

[32]  Krzysztof Janowicz,et al.  Extracting and understanding urban areas of interest using geotagged photos , 2015, Comput. Environ. Urban Syst..