Estimating Completeness of VGI Datasets by Analyzing Community Activity Over Time Periods

Due to the dynamic nature and heterogeneity of Volunteered Geographic Information (VGI) datasets a crucial question isu concerned with geographic data quality. Among others, one of the main quality categories addresses data completeness. Most of the previous work tackles this question by comparing VGI datasets to external reference datasets. Although such comparisons give valuable insights, questions about the quality of the external dataset and syntactic as well as semantic differences arise. This work proposes a novel approach for internal estimation of regional data completeness of VGI datasets by analyzing the changes in community activity over time periods. It builds on empirical evidence that completeness of selected feature classes in distinct geographical regions may only be achieved when community activity in the selected region runs through a well-defined sequence of activity stages beginning at the start stage, continuing with some years of growth and finally reaching saturation. For the retrospective calculation of activity stages, the annual shares of new features in combination with empirically founded heuristic rules for stage transitions are used. As a proof-of-concept the approach is applied to the OpenStreetMap History dataset by analyzing activity stages for 12 representative metropolitan areas. Results give empirical evidence that reaching the saturation stage is an adequate indication for a certain degree of data completeness in the selected regions. Results also show similarities and differences of community activity in the different cities, revealing that community activity stages follow similar rules but with significant temporal variances.

[1]  Pascal Neis,et al.  Comparison of Volunteered Geographic Information Data Contributions and Community Development for Selected World Regions , 2013, Future Internet.

[2]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[3]  Robert Hecht,et al.  Measuring Completeness of Building Footprints in OpenStreetMap over Space and Time , 2013, ISPRS Int. J. Geo Inf..

[4]  Vito Latora,et al.  Elementary processes governing the evolution of road networks , 2012, Scientific Reports.

[5]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[6]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[7]  Monika Sester,et al.  Quality Analysis of OpenStreetMap Data Based on Application Needs , 2011, Cartogr. Int. J. Geogr. Inf. Geovisualization.

[8]  Pascal Neis,et al.  A Comprehensive Framework for Intrinsic OpenStreetMap Quality Analysis , 2014, Trans. GIS.

[9]  Michela Bertolotto,et al.  Analysing the growth of OpenStreetMap networks , 2013 .

[10]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[11]  Hartwig H. Hochmair,et al.  A Conceptual Model for Analyzing Contribution Patterns in the Context of VGI , 2013, Progress in Location-Based Services.

[12]  Guillaume Touya,et al.  Quality Assessment of the French OpenStreetMap Dataset , 2010, Trans. GIS.

[13]  Adam C. Winstanley,et al.  Towards quality metrics for OpenStreetMap , 2010, GIS '10.

[14]  Pascal Neis,et al.  The Street Network Evolution of Crowdsourced Maps: OpenStreetMap in Germany 2007-2011 , 2011, Future Internet.

[15]  Jamal Jokar Arsanjani,et al.  The emergence and evolution of OpenStreetMap: a cellular automata approach , 2015, Int. J. Digit. Earth.

[16]  Ed H. Chi,et al.  The singularity is not near: slowing growth of Wikipedia , 2009, Int. Sym. Wikis.

[17]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .

[18]  Julian Hagenauer,et al.  Mining urban land-use patterns from volunteered geographic information by means of genetic algorithms and artificial neural networks , 2012, Int. J. Geogr. Inf. Sci..

[19]  Dennis Zielstra,et al.  Comparative Study of Pedestrian Accessibility to Transit Stations Using Free and Proprietary Network Data , 2011 .