Causal modelling applied to the risk assessment of a wastewater discharge

Bayesian networks (BNs), or causal Bayesian networks, have become quite popular in ecological risk assessment and natural resource management because of their utility as a communication and decision-support tool. Since their development in the field of artificial intelligence in the 1980s, however, Bayesian networks have evolved and merged with structural equation modelling (SEM). Unlike BNs, which are constrained to encode causal knowledge in conditional probability tables, SEMs encode this knowledge in structural equations, which is thought to be a more natural language for expressing causal information. This merger has clarified the causal content of SEMs and generalised the method such that it can now be performed using standard statistical techniques. As it was with BNs, the utility of this new generation of SEM in ecological risk assessment will need to be demonstrated with examples to foster an understanding and acceptance of the method. Here, we applied SEM to the risk assessment of a wastewater discharge to a stream, with a particular focus on the process of translating a causal diagram (conceptual model) into a statistical model which might then be used in the decision-making and evaluation stages of the risk assessment. The process of building and testing a spatial causal model is demonstrated using data from a spatial sampling design, and the implications of the resulting model are discussed in terms of the risk assessment. It is argued that a spatiotemporal causal model would have greater external validity than the spatial model, enabling broader generalisations to be made regarding the impact of a discharge, and greater value as a tool for evaluating the effects of potential treatment plant upgrades. Suggestions are made on how the causal model could be augmented to include temporal as well as spatial information, including suggestions for appropriate statistical models and analyses.

[1]  J. Pearl Causal inference in statistics: An overview , 2009 .

[2]  Dan Geiger,et al.  Conditional independence and its representations , 1990, Kybernetika.

[3]  Bill Shipley,et al.  A New Inferential Test for Path Models Based on Directed Acyclic Graphs , 2000 .

[4]  Brian H. McArdle,et al.  FITTING MULTIVARIATE MODELS TO COMMUNITY DATA: A COMMENT ON DISTANCE‐BASED REDUNDANCY ANALYSIS , 2001 .

[5]  Bill Shipley,et al.  Confirmatory path analysis in a generalized multilevel context. , 2009, Ecology.

[6]  A. Underwood Beyond BACI: the detection of environmental impacts on populations in the real, but variable, world , 1992 .

[7]  Simone Fattorini,et al.  Cause and Correlation in Biology. A User's Guide to Path Analysis, Structural Equations and Causal Inference with R, Second edition, Bill Shipley. Cambridge University Press (2016), (ISBN: 978-1-107-44259-7, 314 pp., £39.99, paperback) , 2017 .

[8]  J. Bence,et al.  TEMPORAL AND SPATIAL VARIATION IN ENVIRONMENTAL IMPACT ASSESSMENT , 2001 .

[9]  Marti J. Anderson,et al.  Causal modeling with multivariate species data , 2013 .

[10]  Allan Stewart-Oaten,et al.  ENVIRONMENTAL IMPACT ASSESSMENT: "PSEUDOREPLICATION" IN TIME?' , 1986 .

[11]  Kevin B. Korb,et al.  Parameterisation and evaluation of a Bayesian network for use in an ecological risk assessment , 2007, Environ. Model. Softw..

[12]  P. Grambsch,et al.  A Package for Survival Analysis in S , 1994 .

[13]  A. J. Underwood,et al.  Beyond BACI: Experimental designs for detecting human environmental impacts on temporal variations in natural populations , 1991 .

[14]  S. Wood Generalized Additive Models: An Introduction with R , 2006 .

[15]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[16]  Sarah C. Goslee,et al.  The ecodist Package for Dissimilarity-based Analysis of Ecological Data , 2007 .

[17]  P. Legendre,et al.  vegan : Community Ecology Package. R package version 1.8-5 , 2007 .

[18]  Warren L. Paul A causal modelling approach to spatial and temporal confounding in environmental impact studies , 2011 .

[19]  Pierre Legendre,et al.  DISTANCE‐BASED REDUNDANCY ANALYSIS: TESTING MULTISPECIES RESPONSES IN MULTIFACTORIAL ECOLOGICAL EXPERIMENTS , 1999 .

[20]  Dan Geiger,et al.  Identifying independence in bayesian networks , 1990, Networks.

[21]  G. Minshall,et al.  The River Continuum Concept , 1980 .

[22]  D. Helsel Nondetects and data analysis : statistics for censored environmental data , 2005 .

[23]  Peter Spirtes,et al.  Introduction to Causal Inference , 2010, J. Mach. Learn. Res..

[24]  Raymond N. Gorley,et al.  PERMANOVA+ for PRIMER. Guide to software and statistical methods , 2008 .

[25]  Alan Y. Chiang,et al.  Generalized Additive Models: An Introduction With R , 2007, Technometrics.

[26]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[27]  B. Shipley,et al.  A Correction Note on “A New Inferential Test for Path Models Based on Directed Acyclic Graphs” , 2009 .

[28]  Donald A. Jackson STOPPING RULES IN PRINCIPAL COMPONENTS ANALYSIS: A COMPARISON OF HEURISTICAL AND STATISTICAL APPROACHES' , 1993 .

[29]  J. Pearl Graphs, Causality, and Structural Equation Models , 1998 .

[30]  James B. Grace,et al.  Guidelines for a graph-theoretic implementation of structural equation modeling , 2012, Ecosphere.

[31]  J. Pearl Causal diagrams for empirical research , 1995 .

[32]  S. Frontier Étude de la décroissance des valeurs propres dans une analyse en composantes principales: Comparaison avec le modd́le du bâton brisé , 1976 .

[33]  Bill Shipley,et al.  Cause and Correlation in Biology: A User''s Guide to Path Analysis , 2016 .

[34]  James B. Grace,et al.  Structural Equation Modeling and Natural Systems , 2006 .

[35]  B. Marcot,et al.  Bayesian belief networks: applications in ecology and natural resource management , 2006 .

[36]  E. Victoria RISK-BASED ASSESSMENT OF ECOSYSTEM PROTECTION IN AMBIENT WATERS , 2004 .