Enhancing the Quality and Trust of Citizen Science Data

The Internet, Web 2.0 and Social Networking technologies are enabling citizens to actively participate in “citizen science” projects by contributing data to scientific programs. However, the limited expertise of contributors can lead to poor quality or misleading data being submitted. Subsequently, the scientific community often perceive citizen science data as not worthy of being used in serious scientific research. In this paper, we describe a technological framework that combines data quality improvements and trust metrics to enhance the reliability of citizen science data. We describe how trust models provide a simple and effective mechanism for measuring the reliability of community-generated data. We also describe filtering services that remove untrustworthy data, and enable confident re-use of the data. The resulting services are evaluated in the context of the Coral Watch project which uses volunteers to collect data on coral reef bleaching.

[1]  Raph Levien,et al.  Attack-Resistant Trust Metrics , 2009, Computing with Social Trust.

[2]  Rohana Rezel,et al.  SWE-FE: Extending folksonomies to the Sensor Web , 2010, 2010 International Symposium on Collaborative Technologies and Systems.

[3]  P. Elzer,et al.  project overview , 2002 .

[4]  Justin Marshall,et al.  Coral Reefs and Climate Change: The Guide for Education and Awareness , 2012 .

[5]  David G. Delaney,et al.  Marine invasive species: validation of citizen science and implications for national monitoring networks , 2007, Biological Invasions.

[6]  A. W. Galloway,et al.  The Reliability of Citizen Science: A Case Study of Oregon White Oak Stand Surveys , 2006 .

[7]  John M. Charnes,et al.  Proceedings of the 34th conference on Winter simulation: exploring new frontiers , 2002 .

[8]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[9]  Jordi Sabater-Mir,et al.  Review on Computational Trust and Reputation Models , 2005, Artificial Intelligence Review.

[10]  Anne H. H. Ngu,et al.  QoS computation and policing in dynamic web service selection , 2004, WWW Alt. '04.

[11]  C. Lintott,et al.  Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey , 2008, 0804.4483.

[12]  S. M. Evans,et al.  The value of marine ecological data collected by volunteers , 2003 .

[13]  Amit P. Sheth,et al.  Web service technologies and their synergy with simulation , 2002, Proceedings of the Winter Simulation Conference.

[14]  Paolo Massa,et al.  Trustlet, Open Research on Trust Metrics , 2001, BIS.

[15]  Richard Y. Wang,et al.  Quality information and knowledge , 1998 .

[16]  Martin Bichler,et al.  Design science in information systems research , 2006, Wirtschaftsinf..

[17]  Jennifer Golbeck,et al.  Computing and Applying Trust in Web-based Social Networks , 2005 .

[18]  Jennifer Golbeck,et al.  Trust and nuanced profile similarity in online social networks , 2009, TWEB.

[19]  Diane M. Strong,et al.  Process-Embedded Data Integrity , 2004, J. Database Manag..

[20]  Diane M. Strong,et al.  Information quality benchmarks: product and service performance , 2002, CACM.

[21]  Laurent Vercouter,et al.  A Service-Oriented Architecture to Support Agent Reputation Models Interoperability , 2008, WONTO.

[22]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[23]  Elizabeth Lyon,et al.  Open Science at Web-Scale: Optimising Participation and Predictive Potential , 2009 .

[24]  Craig E. Wills,et al.  Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters , 2004 .

[25]  Richard Y. Wang,et al.  Data quality assessment , 2002, CACM.

[26]  J. Silvertown A new dawn for citizen science. , 2009, Trends in ecology & evolution.

[27]  R. Bonney,et al.  Citizen Science as a Tool for Conservation in Residential Ecosystems , 2007 .

[28]  E. Paulos Designing for Doubt Citizen Science and the Challenge of Change , 2009 .