Improving Volunteered Geographic Information Quality Using a Tag Recommender System: The Case of OpenStreetMap

Studies have analyzed the quality of volunteered geographic information (VGI) datasets, assessing the positional accuracy of features and the semantic accuracy of the attributes. While it has been shown that VGI can, in some contexts, reach a high positional accuracy, these studies have also highlighted a large spatial heterogeneity in positional accuracy and completeness, but also concerning the semantics of the objects. Such high semantic heterogeneity of VGI datasets becomes a significant obstacle to a number of possible uses that could be made of the data. This paper proposes an approach for both improving the semantic quality and reducing the semantic heterogeneity of VGI datasets. The improvement of the semantic quality is achieved by using a tag recommender system, called OSMantic, which automatically suggests relevant tags to contributors during the editing process. Such an approach helps contributors find the most appropriate tags for a given object, hence reducing the overall dataset semantic heterogeneity. The approach was implemented into a plugin for the Java OpenStreetMap editor (JOSM) and different examples illustrate how this plugin can be used to improve the quality of VGI data. This plugin has been tested by OSM contributors and evaluated using an online questionnaire. Results of the evaluation suggest a high level of satisfaction from users and are discussed.

[1]  Krzysztof Janowicz,et al.  Analyzing the Spatial-Semantic Interaction of Points of Interest in Volunteered Geographic Information , 2011, COSIT.

[2]  Hartwig H. Hochmair,et al.  A Conceptual Model for Analyzing Contribution Patterns in the Context of VGI , 2013, Progress in Location-Based Services.

[3]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[4]  Angi Voß,et al.  A Comparison of the Street Networks of Navteq and OSM in Germany , 2011, AGILE Conf..

[5]  Michela Bertolotto,et al.  Geographic knowledge extraction and semantic similarity in OpenStreetMap , 2013, Knowledge and Information Systems.

[6]  M. Goodchild,et al.  Researching Volunteered Geographic Information: Spatial Data, Geographic Research, and New Social Practice , 2012 .

[7]  D. Altman,et al.  Statistics notes: Cronbach's alpha , 1997 .

[8]  Till Mossakowski,et al.  DO-ROAM: Activity-Oriented Search and Navigation with OpenStreetMap , 2011, GeoS.

[9]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[10]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[11]  Mark D. Smucker,et al.  Information Retrieval , 2017, Lecture Notes in Computer Science.

[12]  Miriam J. Metzger,et al.  The credibility of volunteered geographic information , 2008 .

[13]  Guillaume Touya,et al.  Quality Assessment of the French OpenStreetMap Dataset , 2010, Trans. GIS.

[14]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[15]  David Coleman,et al.  Volunteered Geographic Information: the nature and motivation of produsers , 2009, Int. J. Spatial Data Infrastructures Res..

[16]  Peter Mooney,et al.  Characteristics of Heavily Edited Objects in OpenStreetMap , 2012, Future Internet.

[17]  M. Haklay Citizen Science and Volunteered Geographic Information: Overview and Typology of Participation , 2013 .

[18]  Peter Mooney,et al.  The Annotation Process in OpenStreetMap , 2012, Trans. GIS.

[19]  Carsten Keßler,et al.  Trust as a Proxy Measure for the Quality of Volunteered Geographic Information in the Case of OpenStreetMap , 2013, AGILE Conf..

[20]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[21]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .

[22]  Barry Smith,et al.  Do Mountains Exist? Towards an Ontology of Landforms , 2003 .

[23]  Pascal Neis,et al.  Analyzing the Contributor Activity of a Volunteered Geographic Information Project - The Case of OpenStreetMap , 2012, ISPRS Int. J. Geo Inf..

[24]  Pascal Neis,et al.  A Comprehensive Framework for Intrinsic OpenStreetMap Quality Analysis , 2014, Trans. GIS.

[25]  D. Bégin,et al.  ASSESSING VOLUNTEERED GEOGRAPHIC INFORMATION (VGI) QUALITY BASED ON CONTRIBUTORS' MAPPING BEHAVIOURS , 2013 .

[26]  G. Gartner,et al.  Lecture Notes in Geoinformation and Cartography , 2006 .

[27]  Li Chen,et al.  A user-centric evaluation framework for recommender systems , 2011, RecSys '11.

[28]  Jens Lehmann,et al.  LinkedGeoData: Adding a Spatial Dimension to the Web of Data , 2009, SEMWEB.

[29]  Wolfgang G. Stock,et al.  "Power tags" in information retrieval , 2010, Libr. Hi Tech.

[30]  Yizhou Sun,et al.  P-Rank: a comprehensive structural similarity measure over information networks , 2009, CIKM.