Assessing the impact of demographic characteristics on spatial error in volunteered geographic information features

The proliferation of volunteered geographic information (VGI), such as OpenStreetMap (OSM) enabled by technological advancements, has led to large volumes of user-generated geographical content. While this data is becoming widely used, the understanding of the quality characteristics of such data is still largely unexplored. An open research question is the relationship between demographic indicators and VGI quality. While earlier studies have suggested a potential relationship between VGI quality and population density or socio-economic characteristics of an area, such relationships have not been rigorously explored, and mainly remained qualitative in nature. This paper addresses this gap by quantifying the relationship between demographic properties of a given area and the quality of VGI contributions. We study specifically the demographic characteristics of the mapped area and its relation to two dimensions of spatial data quality, namely positional accuracy and completeness of the corresponding VGI contributions with respect to OSM using the Denver (Colorado, US) area as a case study. We use non-spatial and spatial analysis techniques to identify potential associations among demographics data and the distribution of positional and completeness errors found within VGI data. Generally, the results of our study show a lack of statistically significant support for the assumption that demographic properties affect the positional accuracy or completeness of VGI. While this research is focused on a specific area, our results showcase the complex nature of the relationship between VGI quality and demographics, and highlights the need for a better understanding of it. By doing so, we add to the debate of how demographics impact on the quality of VGI data and lays the foundation to further work.

[1]  P. Moran Notes on continuous stochastic phenomena. , 1950, Biometrika.

[2]  P. J. Clark,et al.  Distance to Nearest Neighbor as a Measure of Spatial Relationships in Populations , 1954 .

[3]  R. Reyment,et al.  Statistics and Data Analysis in Geology. , 1988 .

[4]  S. J. Press,et al.  Choosing between Logistic Regression and Discriminant Analysis , 1978 .

[5]  C. J. Huberty,et al.  Issues in the use and interpretation of discriminant analysis , 1984 .

[6]  FRANKLIN J. JAMES,et al.  A New Generalized “Exposure-Based” Segregation Index , 1986 .

[7]  J. Burt,et al.  Elementary statistics for geographers , 1995 .

[8]  Peter D. Wentzell,et al.  Comments on the relationship between principal components analysis and weighted linear regression for bivariate data sets , 1996 .

[9]  M. Goodchild,et al.  Geographic Information Systems and Science (second edition) , 2001 .

[10]  David Wheeler,et al.  Multicollinearity and correlation among local regression coefficients in geographically weighted regression , 2005, J. Geogr. Syst..

[11]  S. Graham Software-sorted geographies , 2005 .

[12]  Patrick Kosciuk Biometrics: possible safe haven or lost cause? , 2005, CSOC.

[13]  Wsd Wong,et al.  Statistical Analysis of Geographic Information with ArcView GIS And ArcGIS , 2005 .

[14]  Stacey Kuznetsov,et al.  Motivations of contributors to Wikipedia , 2006, CSOC.

[15]  Naveen Donthu,et al.  Using the technology acceptance model to explain how attitudes determine Internet usage: The role of perceived access barriers and demographics , 2006 .

[16]  R. Sieber Public Participation Geographic Information Systems: A Literature Review and Framework , 2006 .

[17]  Michael J de Smith,et al.  Geospatial Analysis: A Comprehensive Guide to Principles, Techniques and Software Tools , 2007 .

[18]  Matthew Graham,et al.  The Creative Reconstruction of the Internet: Google and the Privatization of Cyberspace and Digiplace , 2007 .

[19]  M. Goodchild Citizens as sensors: the world of volunteered geography , 2007 .

[20]  Graham Vickery,et al.  Participative Web And User-Created Content: Web 2.0 Wikis and Social Networking , 2007 .

[21]  Matthew Zook,et al.  Mapping DigiPlace: Geocoded Internet Data and the Representation of Place , 2007 .

[22]  Katy Börner,et al.  Analyzing and visualizing the semantic coverage of Wikipedia and its authors , 2005, Complex..

[23]  David L. Tulloch Is VGI participation? From vernal pools to video games , 2008 .

[24]  Oded Nov,et al.  Exploring motivations for contributing to open source initiatives: The roles of contribution context and personal values , 2008, Comput. Hum. Behav..

[25]  Daniel Z. Sui,et al.  The wikification of GIS and its consequences: Or Angelina Jolie's new tattoo and the future of GIS , 2008, Comput. Environ. Urban Syst..

[26]  Patrick Weber,et al.  OpenStreetMap: User-Generated Street Maps , 2008, IEEE Pervasive Computing.

[27]  S. Elwood Volunteered geographic information: key questions, concepts and methods to guide emerging research and practice , 2008 .

[28]  Alex Singleton,et al.  Linking Social Deprivation and Digital Exclusion in England , 2009 .

[29]  Matthew Zook,et al.  Placemarks and waterlines: Racialized cyberscapes in post-Katrina Google Earth , 2009 .

[30]  Andrew Hudson-Smith,et al.  NeoGeography and Web 2.0: concepts, tools and applications , 2009, J. Locat. Based Serv..

[31]  David Coleman,et al.  Volunteered Geographic Information: the nature and motivation of produsers , 2009, Int. J. Spatial Data Infrastructures Res..

[32]  Adam C. Winstanley,et al.  Towards quality metrics for OpenStreetMap , 2010, GIS '10.

[33]  P. Mooney,et al.  Comparison of the accuracy of OpenStreetMap for Ireland with Google Maps and Bing Maps , 2010 .

[34]  M. Haklay How Good is Volunteered Geographical Information? A Comparative Study of OpenStreetMap and Ordnance Survey Datasets , 2010 .

[35]  L. Anselin Local Indicators of Spatial Association—LISA , 2010 .

[36]  Guillaume Touya,et al.  Quality Assessment of the French OpenStreetMap Dataset , 2010, Trans. GIS.

[37]  David Fairbairn,et al.  Assessing the accuracy of 'crowdsourced' data and its integration with official spatial data sets , 2010 .

[38]  A. Zipf,et al.  A Comparative Study of Proprietary Geodata and Volunteered Geographic Information for Germany , 2010 .

[39]  S. Elwood Geographic information science: emerging research on the societal implications of the geospatial web , 2010 .

[40]  Vyron Antoniou,et al.  How Many Volunteers Does it Take to Map an Area Well? The Validity of Linus’ Law to Volunteered Geographic Information , 2010 .

[41]  Alexander Zipf,et al.  Generating web-based 3D City Models from OpenStreetMap: The current situation in Germany , 2010, Comput. Environ. Urban Syst..

[42]  Michael T Braun,et al.  Exploratory regression analysis: A tool for selecting models and determining predictor importance , 2011, Behavior research methods.

[43]  Christine Marston,et al.  Education Policy and School Segregation: A Study of the Denver Metropolitan Region , 2011 .

[44]  L. Sugarbaker,et al.  The National Map , 2011 .

[45]  Oded Nov,et al.  Technology-Mediated Citizen Science Participation: A Motivational Model , 2011, ICWSM.

[46]  Greg G. Brown,et al.  An evaluation of the use of points versus polygons in public participation geographic information systems using quasi-experimental design and Monte Carlo simulation , 2012, Int. J. Geogr. Inf. Sci..

[47]  Pascal Neis,et al.  The Street Network Evolution of Crowdsourced Maps: OpenStreetMap in Germany 2007-2011 , 2011, Future Internet.

[48]  Eric B. Wolf,et al.  Structures data collection for the national map using volunteered geographic information , 2012 .

[49]  Pascal Neis,et al.  Analyzing the Contributor Activity of a Volunteered Geographic Information Project - The Case of OpenStreetMap , 2012, ISPRS Int. J. Geo Inf..

[50]  Claire Ellul,et al.  Assessing Data Completeness of VGI through an Automated Matching Procedure for Linear Data , 2012, Trans. GIS.

[51]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[52]  Peter Mooney,et al.  Characteristics of Heavily Edited Objects in OpenStreetMap , 2012, Future Internet.

[53]  M. Goodchild,et al.  Crowdsourcing Geographic Knowledge: Volunteered Geographic Information (VGI) in Theory and Practice , 2012 .

[54]  David Fairbairn,et al.  Using Geometric Properties to Evaluate Possible Integration of Authoritative and Volunteered Geographic Information , 2013, ISPRS Int. J. Geo Inf..

[55]  M. Goodchild,et al.  Prospects for VGI Research and the Emerging Fourth Paradigm , 2013 .

[56]  M. Goodchild,et al.  Spatial, temporal, and socioeconomic patterns in the use of Twitter and Flickr , 2013 .

[57]  Dennis Zielstra,et al.  Development and Completeness of Points Of Interest in Free and Proprietary Data Sets: A Florida Case Study , 2013 .

[58]  Anthony Stefanidis,et al.  Assessing Completeness and Spatial Error of Features in Volunteered Geographic Information , 2013, ISPRS Int. J. Geo Inf..

[59]  J. Kent,et al.  Spatial patterns and demographic indicators of effective social media content during theHorsethief Canyon fire of 2012 , 2013 .

[60]  Nedjeljko Frančula,et al.  ISPRS International Journal of Geo-Information , 2016 .