Developing a new Bayesian Risk Index for risk evaluation of soil contamination.

Industrial and agricultural activities heavily constrain soil quality. Potentially Toxic Elements (PTEs) are a threat to public health and the environment alike. In this regard, the identification of areas that require remediation is crucial. In the herein research a geochemical dataset (230 samples) comprising 14 elements (Cu, Pb, Zn, Ag, Ni, Mn, Fe, As, Cd, V, Cr, Ti, Al and S) was gathered throughout eight different zones distinguished by their main activity, namely, recreational, agriculture/livestock and heavy industry in the Avilés Estuary (North of Spain). Then a stratified systematic sampling method was used at short, medium, and long distances from each zone to obtain a representative picture of the total variability of the selected attributes. The information was then combined in four risk classes (Low, Moderate, High, Remediation) following reference values from several sediment quality guidelines (SQGs). A Bayesian analysis, inferred for each zone, allowed the characterization of PTEs correlations, the unsupervised learning network technique proving to be the best fit. Based on the Bayesian network structure obtained, Pb, As and Mn were selected as key contamination parameters. For these 3 elements, the conditional probability obtained was allocated to each observed point, and a simple, direct index (Bayesian Risk Index-BRI) was constructed as a linear rating of the pre-defined risk classes weighted by the previously obtained probability. Finally, the BRI underwent geostatistical modeling. One hundred Sequential Gaussian Simulations (SGS) were computed. The Mean Image and the Standard Deviation maps were obtained, allowing the definition of High/Low risk clusters (Local G clustering) and the computation of spatial uncertainty. High-risk clusters are mainly distributed within the area with the highest altitude (agriculture/livestock) showing an associated low spatial uncertainty, clearly indicating the need for remediation. Atmospheric emissions, mainly derived from the metallurgical industry, contribute to soil contamination by PTEs.

[1]  Peter B Woodbury,et al.  Dos and don'ts of spatially explicit ecological risk assessments , 2003, Environmental toxicology and chemistry.

[2]  J. J. Stone,et al.  Arsenic and uranium transport in sediments near abandoned uranium mines in Harding County, South Dakota , 2009 .

[3]  C. Ordóñez Galán,et al.  Functional data analysis as a tool to correlate textural and geochemical data , 2013, Appl. Math. Comput..

[4]  J. Loredo,et al.  Investigation of trace element sources from an industrialized area (Avilés, northern Spain) using multivariate statistical methods. , 2002, Environment international.

[5]  Peter Glöckner,et al.  Why Does Unsupervised Pre-training Help Deep Learning? , 2013 .

[6]  Henrique Garcia Pereira,et al.  Geostatistical Estimation of a Summary Recovery Index for Marble Quarries , 1993 .

[7]  Nir Friedman,et al.  Being Bayesian About Network Structure. A Bayesian Approach to Structure Discovery in Bayesian Networks , 2004, Machine Learning.

[8]  M. Silva,et al.  Contaminated water, stream sediments and soils close to the abandoned Pinhal do Souto uranium mine, central Portugal , 2014 .

[9]  M. Albuquerque,et al.  Using indicator kriging for the evaluation of arsenic potential contamination in an abandoned mining area (Portugal). , 2013, The Science of the total environment.

[10]  M. Albuquerque,et al.  Uranium and Arsenic Spatial Distribution in the Águeda Watershed Groundwater , 2014 .

[11]  T. LaForce,et al.  Bayesian Reservoir History Matching Considering Model and Parameter Uncertainties , 2012, Mathematical Geosciences.

[12]  Rui Zhou,et al.  Scenario analysis of mine water inrush hazard using Bayesian networks , 2016 .

[13]  Tomislav Hengl,et al.  Heavy metals in European soils: A geostatistical analysis of the FOREGS geochemical database , 2008 .

[14]  R. Olea Geostatistics for Natural Resources Evaluation By Pierre Goovaerts, Oxford University Press, Applied Geostatistics Series, 1997, 483 p., hardcover, $65 (U.S.), ISBN 0-19-511538-4 , 1999 .

[15]  Lammert Kooistra,et al.  Environmental risk mapping of pollutants: state of the art and communication aspects. , 2010, The Science of the total environment.

[16]  Judea Pearl,et al.  Fusion, Propagation, and Structuring in Belief Networks , 1986, Artif. Intell..

[17]  Andrew J Davies,et al.  Bayesian inference-based environmental decision support systems for oil spill response strategy selection. , 2015, Marine pollution bulletin.

[18]  A. Bannari,et al.  Assessment of soil contamination around an abandoned mine in a semi-arid environment using geochemistry and geostatistics: Pre-work of geochemical process modeling with numerical models , 2013 .

[19]  A. Getis The Analysis of Spatial Association by Use of Distance Statistics , 2010 .

[20]  Marek J. Druzdzel,et al.  Intercausal Reasoning with Uninstantiated Ancestor Nodes , 1993, UAI.

[21]  J. E. T. Moen,et al.  Soil Protection and Remedial Actions: Criteria for Decision Making and Standardization of Requirements , 1986 .

[22]  Kwong-Sak Leung,et al.  Using Evolutionary Programming and Minimum Description Length Principle for Data Mining of Bayesian Networks , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[24]  Feng Qi,et al.  Knowledge discovery from soil maps using inductive learning , 2003, Int. J. Geogr. Inf. Sci..

[25]  T. Mayr,et al.  On the application of Bayesian Networks in Digital Soil Mapping , 2015 .

[26]  Phaedon C. Kyriakidis,et al.  Geostatistical Space–Time Models: A Review , 1999 .

[27]  Chunxu Hao,et al.  Environmental Performance Index at the Provincial Level for China 2006–2011 , 2017 .

[28]  K. R. Hayes,et al.  How believable is your BBN , 2009 .

[29]  K S McDonald,et al.  Developing best-practice Bayesian Belief Networks in ecological risk assessments for freshwater and estuarine ecosystems: a quantitative review. , 2015, Journal of environmental management.

[30]  Ben J. M. Ale,et al.  Risk maps and communication , 1998 .

[31]  L. Salvati,et al.  Land quality, sustainable development and environmental degradation in agricultural districts: A computational approach based on entropy indexes , 2017 .

[32]  V. Ettler,et al.  Geochemical sources, forms and phases of soil contamination in an industrial city. , 2017, The Science of the total environment.

[33]  Rafael Rumí,et al.  Bayesian networks in environmental modelling , 2011, Environ. Model. Softw..

[34]  Khalil S. Hindi,et al.  Minimum-weight spanning tree algorithms A survey and empirical study , 2001, Comput. Oper. Res..

[35]  Bruce G. Marcot,et al.  Metrics for evaluating performance and uncertainty of Bayesian network models , 2012 .

[36]  M. Bidovec,et al.  Geochemical Atlas of Europe, Part 1, Background Information, Methodology and Maps , 2005 .

[37]  Yong-guan Zhu,et al.  Trace metal contamination in urban soils of China. , 2012, The Science of the total environment.

[38]  Eduardo Moreno-Jiménez,et al.  Screening risk assessment tools for assessing the environmental impact in an abandoned pyritic mine in Spain. , 2011, The Science of the total environment.

[39]  W. H. Rulkens,et al.  A GIS-based approach for the long-term prediction of human health risks at contaminated sites , 2005 .

[40]  Michael N. Fienen,et al.  A statistical learning framework for groundwater nitrate models of the Central Valley, California, USA , 2015 .

[41]  Matt Gerstenberger,et al.  Bi-directional risk assessment in carbon capture and storage with Bayesian Networks , 2015 .

[42]  João F Pinto,et al.  Construction of a quality index for granules produced by fluidized bed technology and application of the correspondence analysis as a discriminant procedure. , 2010, European journal of pharmaceutics and biopharmaceutics : official journal of Arbeitsgemeinschaft fur Pharmazeutische Verfahrenstechnik e.V.

[43]  Jin Wang,et al.  Incorporation of formal safety assessment and Bayesian network in navigational risk estimation of the Yangtze River , 2013, Reliab. Eng. Syst. Saf..

[44]  R. Bargagli,et al.  Barium and Other Trace Metals as Indicators of Vehicle Emissions , 1997 .

[45]  Andrew Kusiak,et al.  Very short-term wind speed forecasting with Bayesian structural break model , 2013 .

[46]  Chunye Lin,et al.  Contamination and health risks of soil heavy metals around a lead/zinc smelter in southwestern China. , 2015, Ecotoxicology and environmental safety.

[47]  Jacques Rivoirard Concepts and Methods of Geostatistics , 2005 .

[48]  Geoffrey I. Webb,et al.  Not So Naive Bayes: Aggregating One-Dependence Estimators , 2005, Machine Learning.

[49]  Oz Sahin,et al.  Applications of Bayesian belief networks in water resource management: A systematic review , 2016, Environ. Model. Softw..