The Verification of Ecological Citizen Science Data: Current Approaches and Future Possibilities

Citizen science schemes enable ecological data collection over very large spatial and temporal scales, producing datasets of high value for both pure and applied research. However, the accuracy of citizen science data is often questioned, owing to issues surrounding data quality and verification, the process by which records are checked after submission for correctness. Verification is a critical process for ensuring data quality and for increasing trust in such datasets, but verification approaches vary considerably between schemes. Here, we systematically review approaches to verification across ecological citizen science schemes that feature in published research, aiming to identify the options available for verification, and to examine factors that influence the approaches used. We reviewed 259 schemes and were able to locate verification information for 142 of those. Expert verification was most widely used, especially among longer-running schemes, followed by community consensus and automated approaches. Expert verification has been the default approach for schemes in the past, but as the volume of data collected through citizen science schemes grows and the potential of automated approaches develops, many schemes might be able to implement approaches that verify data more efficiently. We present an idealised system for data verification, identifying schemes where this system could be applied and the requirements for implementation. We propose a hierarchical approach in which the bulk of records are verified by automation or community consensus, and any flagged records can then undergo additional levels of verification by experts.

[1]  Andrew Zisserman,et al.  Time-lapse imagery and volunteer classifications from the Zooniverse Penguin Watch project , 2018, Scientific Data.

[2]  Siân E Green,et al.  Innovations in Camera Trapping Technology and Approaches: The Integration of Citizen Science and Artificial Intelligence , 2020, Animals : an open access journal from MDPI.

[3]  Darryl N. Jones,et al.  Can Citizen Science Assist in Determining Koala (Phascolarctos cinereus) Presence in a Declining Population? , 2016, Animals : an open access journal from MDPI.

[4]  Sarah K. Chase,et al.  A framework for evaluating and designing citizen science programs for natural resources monitoring , 2016, Conservation biology : the journal of the Society for Conservation Biology.

[5]  F. Shilling,et al.  Large Extent Volunteer Roadkill and Wildlife Observation Systems as Sources of Reliable Data , 2017, Front. Ecol. Evol..

[6]  Helen E Roy,et al.  The diversity and evolution of ecological and environmental citizen science , 2017, PloS one.

[7]  Kevin Crowston,et al.  The future of citizen science: emerging technologies and shifting paradigms , 2012, Frontiers in Ecology and the Environment.

[8]  H. Rue,et al.  Spatial modeling of Audubon Christmas Bird Counts reveals fine‐scale patterns and drivers of relative abundance trends , 2019, Ecosphere.

[9]  David C Moyer,et al.  A comparison of deep learning and citizen science techniques for counting wildlife in aerial survey images , 2019, Methods in Ecology and Evolution.

[10]  A. Cocco,et al.  Using verified citizen science as a tool for monitoring the European hornet (Vespa crabro) in the island of Sardinia (Italy) , 2019, NeoBiota.

[11]  Carl Lagoze,et al.  eBird: Curating Citizen Science Data for Use by Diverse Communities , 2014, Int. J. Digit. Curation.

[12]  K. Lertzman,et al.  Democratizing conservation science and practice , 2018 .

[13]  B. Morgan,et al.  A generalized abundance index for seasonal invertebrates , 2016, Biometrics.

[14]  Adam J. Bates,et al.  The OPAL bugs count survey: exploring the effects of urbanisation and habitat characteristics using citizen science , 2015, Urban Ecosystems.

[15]  David B. Roy,et al.  Statistics for citizen science: extracting signals of change from noisy ecological data , 2014 .

[16]  David N. Bonter,et al.  Citizen Science as an Ecological Research Tool: Challenges and Benefits , 2010 .

[17]  Margaret Kosmala,et al.  Assessing data quality in citizen science (preprint) , 2016, bioRxiv.

[18]  Steve Kelling,et al.  Automated data verification in a large-scale citizen science project: A case study , 2012, 2012 IEEE 8th International Conference on E-Science.

[19]  M. Attrill,et al.  Spatiotemporal variability in the structure of seagrass meadows and associated macrofaunal assemblages in southwest England (UK): Using citizen science to benchmark ecological pattern , 2019, Ecology and evolution.

[20]  Helen E. Roy,et al.  Thinking like a naturalist: Enhancing computer vision of citizen science images by harnessing contextual data , 2020 .

[21]  J. Kaye,et al.  Citizen science or scientific citizenship? Disentangling the uses of public engagement rhetoric in national research initiatives , 2016, BMC Medical Ethics.

[22]  R. Whittaker,et al.  Beyond scarcity: citizen science programmes as useful tools for conservation biogeography , 2010 .

[23]  C. Bradshaw,et al.  Increased population size of fish in a lowland river following restoration of structural habitat , 2019, Ecological applications : a publication of the Ecological Society of America.

[24]  Michael J. O. Pocock,et al.  Ecological monitoring with citizen science: the design and implementation of schemes for recording plants in Britain and Ireland , 2015 .

[25]  H. Johnson,et al.  A comparison of 'traditional' and multimedia information systems development practices , 2003, Inf. Softw. Technol..

[26]  Alison Donnelly,et al.  The role of citizen science in monitoring biodiversity in Ireland , 2014, International Journal of Biometeorology.

[27]  Gene E. Likens,et al.  Communicating with the public: opportunities and rewards for individual ecologists , 2010 .

[28]  Abby J. Kinchy,et al.  Citizen Science: Probing the Virtues and Contexts of Participatory Research , 2016 .

[29]  Brigitte Huber,et al.  Fostering public trust in science: The role of social media , 2019, Public understanding of science.

[30]  Michael J. O. Pocock,et al.  The Biological Records Centre: a pioneer of citizen science , 2015 .

[31]  J. Křeček,et al.  Water-quality genesis in a mountain catchment affected by acidification and forestry practices , 2019, Freshwater Science.

[32]  Mark A. Burgman,et al.  Evaluating the accuracy and calibration of expert predictions under uncertainty: predicting the outcomes of ecological research , 2012 .

[33]  Tatsuya Amano,et al.  An agenda for the future of biological recording for ecological monitoring and citizen science , 2015 .

[34]  Rick Bonney,et al.  The history of public participation in ecological research , 2012 .

[35]  Jon Rosewell,et al.  Crowdsourcing the identification of organisms: A case-study of iSpot , 2015, ZooKeys.

[36]  Christopher Kullenberg,et al.  What Is Citizen Science? – A Scientometric Meta-Analysis , 2016, PloS one.

[37]  Quantifying the long-term decline of the West European hedgehog in England by subsampling citizen-science datasets , 2016, European Journal of Wildlife Research.

[38]  Helen E. Roy,et al.  Understanding citizen science and environmental monitoring: final report on behalf of UK Environmental Observation Framework , 2012 .

[39]  Christopher E Mason,et al.  The Power of Engaging Citizen Scientists for Scientific Progress , 2016, Journal of microbiology & biology education.

[40]  C. Lawton,et al.  The regional demise of a non-native invasive species: the decline of grey squirrels in Ireland , 2019, Biological Invasions.

[41]  S. Iyengar,et al.  Scientific communication in a post-truth society , 2018, Proceedings of the National Academy of Sciences.

[42]  Brian L. Sullivan,et al.  Brokering Trust in Citizen Science , 2018, Society & Natural Resources.

[43]  David Sewell,et al.  Optimising biodiversity assessments by volunteers: The application of occupancy modelling to large-scale amphibian surveys , 2010 .

[44]  Graham C. Smith,et al.  Economical crowdsourcing for camera trap image classification , 2018, Remote Sensing in Ecology and Conservation.

[45]  David B. Roy,et al.  Phenology of British butterflies and climate change , 2000 .

[46]  E. Gardner,et al.  Make the Adder Count: population trends from a citizen science survey of UK adders , 2019, January 2019.

[47]  Margaret Kosmala,et al.  Season Spotter: Using Citizen Science to Validate and Scale Plant Phenology from Near-Surface Remote Sensing , 2016, Remote. Sens..

[48]  Jun Yu,et al.  Emergent Filters: Automated Data Verification in a Large-Scale Citizen Science Project , 2011, 2011 IEEE Seventh International Conference on e-Science Workshops.

[49]  Caren B. Cooper,et al.  Data validation in citizen science: a case study from Project FeederWatch , 2012 .

[50]  Johannes Kamp,et al.  Unstructured citizen science data fail to detect long‐term population declines of common birds in Denmark , 2016 .

[51]  Jun Yu,et al.  Modeling Experts and Novices in Citizen Science Data for Species Distribution Modeling , 2010, 2010 IEEE International Conference on Data Mining.

[52]  Margaret Kosmala,et al.  A generalized approach for producing, quantifying, and validating citizen science data from wildlife images , 2016, Conservation biology : the journal of the Society for Conservation Biology.

[53]  Batumi Raptor Count: autumn raptor migration count data from the Batumi bottleneck, Republic of Georgia , 2019, ZooKeys.

[54]  C. Lintott,et al.  Snapshot Serengeti, high-frequency annotated camera trap images of 40 mammalian species in an African savanna , 2015, Scientific Data.

[55]  David T. Jones,et al.  Public participation in soil surveys: lessons from a pilot study in England. , 2012, Environmental science & technology.

[56]  Thomas J. Stohlgren,et al.  Assessing citizen science data quality: an invasive species case study , 2011 .

[57]  C. Mellish,et al.  The role of automated feedback in training and retaining biological recorders for citizen science , 2016, Conservation biology : the journal of the Society for Conservation Biology.

[58]  Kevin Crowston,et al.  Mechanisms for Data Quality and Validation in Citizen Science , 2011, 2011 IEEE Seventh International Conference on e-Science Workshops.

[59]  J. Silvertown A new dawn for citizen science. , 2009, Trends in ecology & evolution.

[60]  Derek L. Hansen,et al.  Motivations Affecting Initial and Long-Term Participation in Citizen Science Projects in Three Countries , 2014 .

[61]  Michael J. O. Pocock,et al.  Data-derived metrics describing the behaviour of field-based citizen scientists provide insights for project design and modelling bias , 2020, Scientific Reports.