Assessing Crowdsourced POI Quality: Combining Methods Based on Reference Data, History, and Spatial Relations

With the development of location-aware devices and the success and high use of Web 2.0 techniques, citizens are able to act as sensors by contributing geographic information. In this context, data quality is an important aspect that should be taken into account when using this source of data for different purposes. The goal of the paper is to analyze the quality of crowdsourced data and to study its evolution over time. We propose two types of approaches: (1) use the intrinsic characteristics of the crowdsourced datasets; or (2) evaluate crowdsourced Points of Interest (POIs) using external datasets (i.e., authoritative reference or other crowdsourced datasets), and two different methods for each approach. The potential of the combination of these approaches is then demonstrated, to overcome the limitations associated with each individual method. In this paper, we focus on POIs and places coming from the very successful crowdsourcing project: OpenStreetMap. The results show that the proposed approaches are complementary in assessing data quality. The positive results obtained for data matching show that the analysis of data quality through automatic data matching is possible but considerable effort and attention are needed for schema matching given the heterogeneity of OSM and the representation of authoritative datasets. For the features studied, it can be noted that change over time is sometimes due to disagreements between contributors, but in most cases the change improves the quality of the data.

[1]  Alexander Zipf,et al.  Fine-resolution population mapping using OpenStreetMap points-of-interest , 2014, Int. J. Geogr. Inf. Sci..

[2]  Vyron Antoniou,et al.  User generated spatial content: an analysis of the phenomenon and its challenges for mapping agencies , 2011 .

[3]  Stéphane Roche,et al.  Quantifying the Significance of Semantic Landmarks in Familiar and Unfamiliar Environments , 2015, COSIT.

[4]  Vyron Antoniou,et al.  MEASURES AND INDICATORS OF VGI QUALITY: AN OVERVIEW , 2015 .

[5]  Francisco C. Pereira,et al.  Mining point-of-interest data from social networks for urban land use classification and disaggregation , 2015, Comput. Environ. Urban Syst..

[6]  Hansi Senaratne,et al.  A review of volunteered geographic information quality assessment methods , 2017, Int. J. Geogr. Inf. Sci..

[7]  Anthony Stefanidis,et al.  Assessing Completeness and Spatial Error of Features in Volunteered Geographic Information , 2013, ISPRS Int. J. Geo Inf..

[8]  Peter Mooney,et al.  Crowd-sourced geographic information use in government , 2014 .

[9]  Hans-Jörg Stark Quality Assessment of Volunteered Geographic Information using Open Web Map Services within OpenAddresses , 2011 .

[10]  Anil Bawa-Cavia,et al.  Sensing the Urban Using location-based social network data in urban analysis Working , 2011 .

[11]  Gloria Bordogna,et al.  "Contextualized VGI" Creation and Management to Cope with Uncertainty and Imprecision , 2016, ISPRS Int. J. Geo Inf..

[12]  Anne Ruas,et al.  Knowledge formalization for vector data matching using belief theory , 2015, J. Spatial Inf. Sci..

[13]  Carsten Keßler,et al.  Trust as a Proxy Measure for the Quality of Volunteered Geographic Information in the Case of OpenStreetMap , 2013, AGILE Conf..

[14]  Simon Scheider,et al.  Semantic Referencing of Geosensor Data and Volunteered Geographic Information , 2011, Geospatial Semantics and the Semantic Web.

[15]  Filipe Rodrigues,et al.  Estimating Disaggregated Employment Size from Points-of-Interest and Census Data: From Mining the Web to Model Implementation and Visualization , 2013 .

[16]  Alexander Zipf,et al.  Defining Fitness-for-Use for Crowdsourced Points of Interest (POI) , 2016, ISPRS Int. J. Geo Inf..

[17]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[18]  Guillaume Touya,et al.  Quality Assessment of the French OpenStreetMap Dataset , 2010, Trans. GIS.

[19]  M. Goodchild Commentary: whither VGI? , 2008 .

[20]  Leysia Palen,et al.  From Crowdsourced Mapping to Community Mapping: The Post-earthquake Work of OpenStreetMap Haiti , 2014, COOP.

[21]  Guillaume Touya,et al.  Level of Details Harmonization Operations in OpenStreetMap Based Large Scale Maps , 2017 .

[22]  Georg Gartner,et al.  Social Media Data as a Source for Studying People’s Perception and Knowledge of Environments , 2013 .

[23]  Tim O'Reilly,et al.  What is Web 2.0: Design Patterns and Business Models for the Next Generation of Software , 2007 .

[24]  Mike Jackson,et al.  On Data Quality Assurance and Conflation Entanglement in Crowdsourcing for Environmental Studies , 2017, ISPRS Int. J. Geo Inf..

[25]  Pascal Neis,et al.  A Comprehensive Framework for Intrinsic OpenStreetMap Quality Analysis , 2014, Trans. GIS.

[26]  Frank O. Ostermann,et al.  Digital Earth from vision to practice: making sense of citizen-generated content , 2012, Int. J. Digit. Earth.

[27]  P. Mooney,et al.  Comparison of the accuracy of OpenStreetMap for Ireland with Google Maps and Bing Maps , 2010 .

[28]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[29]  William A. Mackaness,et al.  A functional perspective on map generalisation , 2009, Comput. Environ. Urban Syst..

[30]  Rodolphe Devillers,et al.  Improving Volunteered Geographic Information Quality Using a Tag Recommender System: The Case of OpenStreetMap , 2015, OpenStreetMap in GIScience.

[31]  M. Exel,et al.  The impact of crowdsourcing on spatial data quality indicators , 2010 .

[32]  Pascal Neis,et al.  Quality assessment for building footprints data on OpenStreetMap , 2014, Int. J. Geogr. Inf. Sci..

[33]  Ross Purves,et al.  Exploring place through user-generated content: Using Flickr tags to describe city cores , 2010, J. Spatial Inf. Sci..

[34]  Lucy Bastin,et al.  Usability of VGI for validation of land cover maps , 2015, Int. J. Geogr. Inf. Sci..

[35]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[36]  Guillaume Touya,et al.  Quality analysis of the Parisian OSM toponyms evolution , 2016 .

[37]  Giovanni Quattrone,et al.  There's No Such Thing as the Perfect Map: Quantifying Bias in Spatial Crowd-sourcing Datasets , 2015, CSCW.

[38]  Guillaume Touya,et al.  Detecting Level-of-Detail Inconsistencies in Volunteered Geographic Information Data Sets , 2013, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[39]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .