On predicting and improving the quality of Volunteer Geographic Information projects

Initiatives that rely upon the contributions of volunteers to reach a specific goal are growing more and more with the success of Web 2.0–interactive applications. Also scientific projects are testing and exploiting volunteers' collaboration, but the quality of information obtained with this approach is often puzzling. This paper offers a rich overview of many scientific projects where geographic contributions are committed to volunteers, to the aim of defining strategies to improve information quality. By describing real examples of Volunteer Geographic Information (VGI), the contribution establishes a categorization based on the characteristics of the information, tasks, and scopes of the projects. After a discussion on the relationships of categories and VGI quality, the paper analyses techniques to improve the quality of volunteered information according to the moment of its assessment (i.e., ex ante, ex post, or both with respect to information creation). The paper outlines the main limitations of the different approaches and indicates some guidelines for future developments.

[1]  A. Lin Knowledge Assets: Securing Competitive Advantage in the Information Economy , 2001 .

[2]  Steve Kelling,et al.  Data-Intensive Science: A New Paradigm for Biodiversity Studies , 2009 .

[3]  Giles M. Foody,et al.  Rating crowdsourced annotations: evaluating contributions of variable quality and completeness , 2014, Int. J. Digit. Earth.

[4]  Gloria Bordogna,et al.  Alpine Glaciology: An Historical Collaboration between Volunteers and Scientists and the Challenge Presented by an Integrated Approach , 2013, ISPRS Int. J. Geo Inf..

[5]  M. Kelly,et al.  Citizen Science in the Age of Neogeography: Utilizing Volunteered Geographic Information for Environmental Monitoring , 2012 .

[6]  David Coleman,et al.  Volunteered Geographic Information: the nature and motivation of produsers , 2009, Int. J. Spatial Data Infrastructures Res..

[7]  Martin J. Eppler,et al.  Conceptualizing Information Quality: A Review of Information Quality Frameworks from the Last Ten Years , 2000, IQ.

[8]  David Fairbairn,et al.  Assessing the accuracy of 'crowdsourced' data and its integration with official spatial data sets , 2010 .

[9]  Thomas J. Stohlgren,et al.  Assessing citizen science data quality: an invasive species case study , 2011 .

[10]  Hans-Jörg Stark Quality assurance of crowdsourced geocoded address-data within OpenAddresses. Concepts and implementation , 2010 .

[11]  Michael F. Goodchild,et al.  Citizens as Voluntary Sensors: Spatial Data Infrastructure in the World of Web 2.0 , 2007, Int. J. Spatial Data Infrastructures Res..

[12]  John Riedl,et al.  Creating, destroying, and restoring value in wikipedia , 2007, GROUP.

[13]  Frank O. Ostermann,et al.  A conceptual workflow for automatically assessing the quality of volunteered geographic information for crisis management , 2011 .

[14]  Robert Jeansoulin,et al.  Fundamentals of Spatial Data Quality , 2006 .

[15]  Gloria Bordogna,et al.  A linguistic decision making approach to assess the quality of volunteer geographic information for citizen science , 2014, Inf. Sci..

[16]  Carlo Batini,et al.  The Many Faces of Information and their Impact on Information Quality , 2012, ICIQ.

[17]  Wen Hu,et al.  Are you contributing trustworthy data?: the case for a reputation system in participatory sensing , 2010, MSWIM '10.

[18]  Sean W. Smith,et al.  Quality in Internet Collective Goods : Zealots and Good Samaritans in the Case of Wikipedia , 2005 .

[19]  M. Boisot Knowledge Assets: Securing Competitive Advantage in the Information Economy , 1998 .

[20]  Jeremy Ginsberg,et al.  Detecting influenza epidemics using search engine query data , 2009, Nature.

[21]  Michael F. Goodchild,et al.  Assuring the quality of volunteered geographic information , 2012 .

[22]  Michael F. Goodchild,et al.  NeoGeography and the nature of geographic expertise , 2009, J. Locat. Based Serv..

[23]  Robert Jeansoulin,et al.  Towards spatial data quality information analysis tools for experts assessing the fitness for use of spatial data , 2007, Int. J. Geogr. Inf. Sci..

[24]  E. Hand Volunteer army catches interstellar dust grains , 2010 .

[25]  Candie C. Wilderman,et al.  Public Participation in Scientific Research: Defining the Field and Assessing Its Potential for Informal Science Education. A CAISE Inquiry Group Report. , 2009 .

[26]  Axel Bruns,et al.  Towards Produsage: Futures for User-Led Content Production , 2006 .

[27]  Kevin Crowston,et al.  Mechanisms for Data Quality and Validation in Citizen Science , 2011, 2011 IEEE Seventh International Conference on e-Science Workshops.

[28]  Frank O. Ostermann,et al.  Digital Earth from vision to practice: making sense of citizen-generated content , 2012, Int. J. Digit. Earth.

[29]  Lora Aroyo,et al.  Crowdsourcing in the cultural heritage domain: opportunities and challenges , 2011, C&T.

[30]  Peter Mooney,et al.  Characteristics of Heavily Edited Objects in OpenStreetMap , 2012, Future Internet.

[31]  E. Hand,et al.  Citizen science: People power , 2010, Nature.

[32]  Irina Shklovski,et al.  Respectfully yours in safety and service: Emergency management & social media evangelism , 2010, ISCRAM.

[33]  M. Haklay Citizen Science and Volunteered Geographic Information: Overview and Typology of Participation , 2013 .

[34]  Enrique Herrera-Viedma,et al.  Evaluating the information quality of Web sites: A methodology based on fuzzy computing with words , 2006, J. Assoc. Inf. Sci. Technol..

[35]  Antoon Bronselaer,et al.  Consistently Handling Geographical User Data - Context-Dependent Detection of Co-located POIs , 2010, IPMU.