Exploring the Use of Decision Tree Methodology in Hydrology Using Crowdsourced Data

To fill the observations gap on ungauged streams, crowdsourced distributed hydrologic measurements were considered as a potential supplement for observational data networks. However, citizen science data come with uncertainty as they are provided by the general public. In order to investigate this uncertainty, a decision tree methodology was applied to evaluate existing citizen science data of stream stage based on the CrowdHydrology (CH) network. Quality control (QC) flags were developed and applied to CH sites, dividing Level 1 dataset (raw dataset) into Level 2 (flagged dataset) and Level 3 (processed dataset). Error estimates were calculated to determine uncertainty in the citizen science data. The results indicate that the decision tree could provide reliable QC for citizen science data and demonstrate how uncertainty can be quantified in the QC datasets.

[1]  J. Ross Quinlan,et al.  Generating Production Rules from Decision Trees , 1987, IJCAI.

[2]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[3]  Wilbert O. Thomas,et al.  The stream-gaging program of the U.S. Geological Survey , 1995 .

[4]  C. Brodley,et al.  Decision tree classification of land cover from remotely sensed data , 1997 .

[5]  S. Lemon,et al.  Classification and regression tree analysis in public health: Methodological review and comparison with logistic regression , 2003, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[6]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[7]  Brian L. Sullivan,et al.  eBird: A citizen-based bird observation network in the biological sciences , 2009 .

[8]  Matthew N. Anyanwu,et al.  Comparative Analysis of Serial Decision Tree Classification Algorithms , 2009 .

[9]  Alin Dobra,et al.  Decision Tree Classification , 2009, Encyclopedia of Database Systems.

[10]  G. Gauchat Politicization of Science in the Public Sphere , 2012 .

[11]  Rick Bonney,et al.  The current state of citizen science as a tool for ecological research and public engagement , 2012 .

[12]  Michael N. Fienen,et al.  Social.Water - A crowdsourcing tool for environmental data acquisition , 2012, Comput. Geosci..

[13]  S. McCormick After the cap: Risk assessment, citizen science, and disaster recovery , 2012 .

[14]  S. Koponen,et al.  Water quality analysis using an inexpensive device and a mobile phone , 2013, Environmental Systems Research.

[15]  Christopher S Lowry,et al.  CrowdHydrology: Crowdsourcing Hydrologic Data and Engaging Citizen Scientists , 2013, Ground water.

[16]  Ian Graham,et al.  Hacker science versus closed science: building environmental monitoring infrastructure , 2014 .

[17]  Chris Mungall,et al.  Global biotic interactions: An open infrastructure to share and analyze species-interaction datasets , 2014, Ecol. Informatics.

[18]  Yong Wang,et al.  Online active learning of decision trees with evidential data , 2016, Pattern Recognit..

[19]  Lucas Wexler,et al.  Glossary Of Meteorology , 2016 .

[20]  Jérôme Le Coz,et al.  Crowdsourced data for flood hydrology: Feedback from recent citizen science projects in Argentina, France and New Zealand , 2016 .

[21]  Krzysztof Z. Gajos,et al.  Crowdsourcing as a Tool for Research: Implications of Uncertainty , 2017, CSCW.

[22]  M. Clausel,et al.  Decision tree for uncertainty measures , 2018 .

[23]  Simon Etter,et al.  Testing the Waters: Mobile Apps for Crowdsourced Streamflow Data , 2018 .

[24]  B Weeser,et al.  Citizen science pioneers in Kenya - A crowdsourced approach for hydrological monitoring. , 2018, The Science of the total environment.

[25]  Nick van de Giesen,et al.  Citizen science flow – an assessment of simple streamflow measurement methods , 2019, Hydrology and Earth System Sciences.

[26]  Kristine F. Stepenuck,et al.  Growing Pains of Crowdsourced Stream Stage Monitoring Using Mobile Phones: The Development of CrowdHydrology , 2019, Front. Earth Sci..

[27]  Simon Etter,et al.  Virtual Staff Gauges for Crowd-Based Stream Level Observations , 2019, Front. Earth Sci..

[28]  M. Rufino,et al.  Citizen science in hydrological monitoring and ecosystem services management: State of the art and future prospects. , 2019, The Science of the total environment.

[29]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.