Assessing the accuracy of openstreetmap data in south africa for the purpose of integrating it with authoritative data

The introduction and success of Volunteered Geographic Information (VGI) has gained the interest of National Mapping Agencies (NMAs) worldwide. VGI is geographic information that is freely generated by non-experts and shared using VGI initiatives available on the Internet. The NMA of South Africa i.e. the Chief Directorate: National GeoSpatial Information (CD: NGI) is looking to this volunteer information to maintain their topographical database; however, the main concern is the quality of the data. The purpose of this work is to assess whether it is feasible to use VGI to update the CD: NGI topographical database. The data from OpenStreetMap (OSM), which is one the most successful VGI initiatives, was compared to a reference data set provided by the CD: NGI. Corresponding features between the two data sets were compared in order to assess the various quality aspects. The investigation was split into quantitative and qualitative assessments. The aim of the quantitative assessments was to determine the internal quality of the OSM data. The internal quality elements included the positional accuracy, geometric accuracy, semantic accuracy and the completeness. The first part of the qualitative assessment was concerned with the currency of OSM data between 2006 and 2012. The second part of the assessment was focused on the uniformity of OSM data acquisition across South Africa. The quantitative results showed that both road and building features do not meet the CD: NGI positional accuracy standards. In some areas the positional accuracy of roads are close to the required accuracy. The buildings generally compare well in shape to the CD: NGI buildings. However, there were very few OSM polygon features to assess, thus the results are limited to a small sample. The semantic accuracy of roads was low. Volunteers do not generally classify roads correctly. Instead, many volunteers prefer to class roads generically. The last part of the quantitative results, the completeness, revealed that commercial areas reach high completeness percentages and sometimes exceed the total length of the CD: NGI roads. In residential areas, the percentages are lower and in low urban density areas, the lowest. Nonetheless, the OSM repository has seen significant growth since 2006. The qualitative results showed that because the OSM repository has continued to grow since 2006, the level of currency has increased. In South Africa, the most contributions were made between 2010 and 2012. The OSM data set is thus current after 2012. The amount and type of contributions are however not uniform across the country for various reasons. The number of point contributions was low. Thus, the relationship between the type of contribution and the settlement type could not be made with certainty. Because the OSM data does not meet the CD: NGI spatial accuracy requirements, the two data sets cannot be integrated at the database level. Instead, two options are proposed. The CD: NGI could use the OSM data for detecting changes to the landscape only. The other recommendation is to transform and verify the OSM data. Only those features with a high positional accuracy would then be ingested. The CD: NGI currently has a shortage of staff that is qualified to process ancillary data. Both of the options proposed thus require automated techniques because it is time consuming to perform these tasks manually.

[1]  Eric Schenk,et al.  Crowdsourcing: What can be Outsourced to the Crowd, and Why ? , 2009 .

[2]  Michael F. Goodchild,et al.  NeoGeography and the nature of geographic expertise , 2009, J. Locat. Based Serv..

[3]  David Coleman,et al.  Volunteered Geographic Information: the nature and motivation of produsers , 2009, Int. J. Spatial Data Infrastructures Res..

[4]  Ted Pedersen,et al.  Using Measures of Semantic Relatedness for Word Sense Disambiguation , 2003, CICLing.

[5]  MICHAEL F. GOODCHILD,et al.  A Simple Positional Accuracy Measure for Linear Features , 1997, Int. J. Geogr. Inf. Sci..

[6]  Harold Moellering,et al.  Extending the formal model of a spatial data infrastructure to include volunteered geographical information , 2011 .

[7]  J. A. Glennona Crowdsourcing geographic information for disaster response: a research , 2010 .

[8]  Pascal Neis,et al.  The Street Network Evolution of Crowdsourced Maps: OpenStreetMap in Germany 2007-2011 , 2011, Future Internet.

[9]  S. Elwood Volunteered geographic information: key questions, concepts and methods to guide emerging research and practice , 2008 .

[10]  Joseph Ferreira,et al.  The Future of Spatial Data Infrastructures: Capacity-building for the Emergence of Municipal SDIs , 2007, Int. J. Spatial Data Infrastructures Res..

[11]  Robert B McMaster,et al.  A Statistical Analysis of Mathematical Measures for Linear Simplification , 1986 .

[12]  H. Kelley,et al.  Communication And Persuasion , 1953 .

[13]  Kevin McDougall From silos to networks - will users drive spatial data infrastructures in the future? , 2010 .

[14]  Alex Singleton,et al.  Web mapping 2.0: The neogeography of the GeoWeb , 2008 .

[15]  Eric B. Wolf,et al.  OpenStreetMap Collaborative Prototype, Phase 1 , 2011 .

[16]  Amir Pourabdollah,et al.  Towards an Authoritative OpenStreetMap: Conflating OSM and OS OpenData National Maps' Road Network , 2013, ISPRS Int. J. Geo Inf..

[17]  Ernest J. Wilson,et al.  The Information Revolution and Developing Countries , 2004 .

[18]  Stéphane Roche,et al.  Potential of VGI as a Resource for SDIS in The North/South Context , 2010 .

[19]  M. Goodchild Commentary: whither VGI? , 2008 .

[20]  G. Sithole,et al.  Assessment of the Homogeneity of Volunteered Geographic Information in South Africa , 2012 .

[21]  David J. Coleman,et al.  Potential Contributions and Challenges of VGI for Conventional Topographic Base-Mapping Programs , 2013 .

[22]  K. Mcdougall The potential of citizen volunteered spatial information for building SDI , 2009 .

[23]  Alan M. MacEachren,et al.  Compactness of Geographic Shape: Comparison and Evaluation of Measures , 1985 .

[24]  Derrick G. Kourie,et al.  Perceptions of Virtual Globes, Volunteered Geographical Information and Spatial Data Infrastructures , 2010 .

[25]  J. F. Hangouet COMPUTATION OF THE HAUSDORFF DISTANCE BETWEEN PLANE VECTOR POLYLINES , 2008 .

[26]  Richard L. Church,et al.  UC Office of the President Recent Work Title An efficient measure of compactness for two-dimensional shapes and its application in regionalization problems Permalink , 2013 .

[27]  Claude Caron,et al.  How to Improve the Social Utility Value of Geographic Information Systems for French Local Governments? A Delphi Study , 2003 .

[28]  François Vauglin Modèles statistiques des imprécisions géométriques des objets géographiques linéaires , 1997 .

[29]  Guillaume Touya,et al.  Quality Assessment of the French OpenStreetMap Dataset , 2010, Trans. GIS.

[30]  I. Bishop,et al.  Spatial data infrastructures for cities in developing countries Lessons from the Bangkok experience , 2000 .

[31]  Christian Fuchs,et al.  Africa and the digital divide , 2008, Telematics Informatics.

[32]  D. Sui Tobler's First Law of Geography: A Big Idea for a Small World? , 2004 .

[33]  Frank O. Ostermann,et al.  A conceptual workflow for automatically assessing the quality of volunteered geographic information for crisis management , 2011 .

[34]  David Fairbairn,et al.  Using Geometric Properties to Evaluate Possible Integration of Authoritative and Volunteered Geographic Information , 2013, ISPRS Int. J. Geo Inf..

[35]  D Fairbairn,et al.  User generated content and formal data sources for integrating geospatial data , 2011 .

[36]  Miriam J. Metzger,et al.  The credibility of volunteered geographic information , 2008 .

[37]  Glen Hart,et al.  When worlds collide: combining Ordnance Survey and Open Street Map data , 2010 .

[38]  Abbas Rajabifard,et al.  The role of sub‐national government and the private sector in future spatial data infrastructures , 2006, Int. J. Geogr. Inf. Sci..

[39]  Julian Smit,et al.  A review of the status of spatial data infrastructure implementation in Africa , 2010, South Afr. Comput. J..

[40]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[41]  Ioannis Pitas,et al.  FAST SHAPE MATCHING USING THE HAUSDORFF DISTANCE , 2006 .

[42]  Pascal Neis,et al.  New Applications based on collaborative geodata – the case of Routing , 2008 .

[43]  S. S. Tickodri-Togboa,et al.  OPPORTUNITIES AND CHALLENGES FOR SDI DEVELOPMENT IN DEVELOPING COUNTRIES - A CASE STUDY OF UGANDA , 2004 .

[44]  Cristina Gouveia,et al.  New approaches to environmental monitoring: the use of ICT to explore volunteered geographic information , 2008 .

[45]  P. Norris The Worldwide Digital Divide: Information Poverty, the Internet and Development , 2000 .

[46]  David Fairbairn,et al.  Assessing the accuracy of 'crowdsourced' data and its integration with official spatial data sets , 2010 .

[47]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[48]  Linda Martindale Bridging the digital divide in South Africa , 2002 .

[49]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .

[50]  Tomi Kauppinen,et al.  Tracking Editing Processes in Volunteered Geographic Information: The Case of OpenStreetMap , 2011, COSIT 2011.

[51]  Bertram C. Bruce,et al.  Reconceptualizing the role of the user of spatial data infrastructure , 2008 .

[52]  Derrick G. Kourie,et al.  Thoughts on Exploiting Instability in Lattices for Assessing the Discrimination Adequacy of a Taxonomy , 2010, CLA.

[53]  M. Goodchild,et al.  Researching Volunteered Geographic Information: Spatial Data, Geographic Research, and New Social Practice , 2012 .

[54]  Robert Jeansoulin,et al.  Towards spatial data quality information analysis tools for experts assessing the fitness for use of spatial data , 2007, Int. J. Geogr. Inf. Sci..

[55]  Marinos Kavouras,et al.  Semantifying OpenStreetMap , 2012, Terra Cognita@ISWC.

[56]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[57]  Peter A. Johnson,et al.  Situating the Adoption of VGI by Government , 2013 .

[58]  Abbas Rajabifard,et al.  Spatially enabling governments through SDI implementation , 2008, Int. J. Geogr. Inf. Sci..

[59]  Derrick G. Kourie,et al.  Challenges for quality in volunteered geographical information , 2011 .

[60]  G. Sithole,et al.  Assessing the Quality of OpenStreetMap Data in South Africa in Reference to National Mapping Standards , 2014 .

[61]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[62]  K. Keniston,et al.  The Four Digital Divides , 2003 .

[63]  Kpalma Kidiyo,et al.  A Survey of Shape Feature Extraction Techniques , 2008 .

[64]  Vyron Antoniou,et al.  User generated spatial content: an analysis of the phenomenon and its challenges for mapping agencies , 2011 .

[65]  Max J. Egenhofer,et al.  Topological Error Correcting in GIS , 1997, SSD.

[66]  Steffen Fritz,et al.  Assessing the Accuracy of Volunteered Geographic Information arising from Multiple Contributors to an Internet Based Collaborative Project , 2013, Trans. GIS.