Modelling bioaccumulation of heavy metals in soil-crop ecosystems and identifying its controlling factors using machine learning.

The prediction and identification of the factors controlling heavy metal transfer in soil-crop ecosystems are of critical importance. In this study, random forest (RF), gradient boosted machine (GBM), and generalised linear (GLM) models were compared after being used to model and identify prior factors that affect the transfer of heavy metals (HMs) in soil-crop systems in the Yangtze River Delta, China, based on 13 covariates with 1822 pairs of soil-crop samples. The mean bioaccumulation factors (BAFs) for all crops followed the order Cd > Zn > As > Cu > Ni > Hg > Cr > Pb. The RF model showed the best prediction ability for the BAFs of HMs in soil-crop ecosystems, followed by GBM and GLM. The R2 values of the RF models for the BAFs of Zn, Cu, Cr, Ni, Hg, Cd, As, and Pb were 0.84, 0.66, 0.59, 0.58, 0.58, 0.51, 0.30, and 0.17, respectively. The primary controlling factor in soil-to-crop transfer of all HMs under study was plant type, followed by soil heavy metal content and soil organic materials. The model used herein could be used to assist the prediction of heavy metal contents in crops based on heavy metal contents in soil and other covariates, and can significantly reduce the cost, labour, and time requirements involved with laboratory analysis. It can also be used to quantify the importance of variables and identify potential control factors in heavy metal bioaccumulation in soil-crop ecosystems.

[1]  Greg Ridgeway,et al.  Generalized Boosted Models: A guide to the gbm package , 2006 .

[2]  Bailin Liu,et al.  Assessment of the bioavailability, bioaccessibility and transfer of heavy metals in the soil-grain-human systems near a mining and smelting area in NW China. , 2017, The Science of the total environment.

[3]  Fangbai Li,et al.  Accumulation of heavy metals in leaf vegetables from agricultural soils and associated potential health risks in the Pearl River Delta, South China , 2013, Environmental Monitoring and Assessment.

[4]  G. Christakos,et al.  Improved heavy metal mapping and pollution source apportionment in Shanghai City soils using auxiliary information. , 2019, The Science of the total environment.

[5]  Zhou Shi,et al.  Estimating spatially downscaled rainfall by regression kriging using TRMM precipitation and elevation in Zhejiang Province, southeast China , 2014 .

[6]  Hefa Cheng,et al.  Planning for sustainability in China's urban development: status and challenges for Dongtan eco-city project. , 2010, Journal of environmental monitoring : JEM.

[7]  Mogens Humlekrog Greve,et al.  Mapping soil organic matter contents at field level with Cubist, Random Forest and kriging , 2019, Geoderma.

[8]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[9]  D. Guan,et al.  The influence of bioavailable heavy metals and microbial parameters of soil on the metal accumulation in rice grain. , 2017, Chemosphere.

[10]  J. Keith Ord,et al.  Spatial Processes Models and Applications , 1981 .

[11]  Yan Li,et al.  Composite assessment of human health risk from potentially toxic elements through multiple exposure routes: A case study in farmland in an important industrial city in East China , 2020 .

[12]  D. Fatta-Kassinos,et al.  Assessment of long-term wastewater irrigation impacts on the soil geochemical properties and the bioaccumulation of heavy metals to the agricultural products , 2014, Environmental Monitoring and Assessment.

[13]  C. Chatterjee,et al.  Physiological and Biochemical Responses of French Bean to Excess Cobalt , 2006 .

[14]  F. Ekelund,et al.  Is wood ash amendment a suitable mitigation strategy for N2O emissions from soil? , 2020, The Science of the total environment.

[15]  J. Friedman Stochastic gradient boosting , 2002 .

[16]  Meie Wang,et al.  Assessing cadmium exposure risks of vegetables with plant uptake factor and soil property. , 2018, Environmental pollution.

[17]  J M Clarke,et al.  Selection and breeding of plant cultivars to minimize cadmium accumulation. , 2008, The Science of the total environment.

[18]  Yan Li,et al.  Assessment of the potential health risks of heavy metals in soils in a coastal industrial region of the Yangtze River Delta , 2017, Environmental Science and Pollution Research.

[19]  Shu Tao,et al.  The Challenges and Solutions for Cadmium-contaminated Rice in China: A Critical Review. , 2016, Environment international.

[20]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[21]  M. Vijver,et al.  Monitoring metals in terrestrial environments within a bioavailability framework and a focus on soil extraction. , 2007, Ecotoxicology and environmental safety.

[22]  Q. Qian,et al.  Comparative proteomic analysis provides new insights into cadmium accumulation in rice grain under cadmium stress. , 2014, Journal of hazardous materials.

[23]  Zhou Shi,et al.  A methodological framework for identifying potential sources of soil heavy metal pollution based on machine learning: A case study in the Yangtze Delta, China. , 2019, Environmental pollution.

[24]  Yongzhong Qian,et al.  Concentrations of cadmium, lead, mercury and arsenic in Chinese market milled rice and associated population health risk , 2010 .

[25]  Jianming Xu,et al.  Heavy metal contaminations in a soil-rice system: identification of spatial dependence in relation to soil properties of paddy fields. , 2010, Journal of hazardous materials.

[26]  Hefa Cheng,et al.  Application of stochastic models in identification and apportionment of heavy metal pollution sources in the surface soils of a large-scale region. , 2013, Environmental science & technology.

[27]  K. Kalbitz,et al.  Mobilization of heavy metals and arsenic in polluted wetland soils and its dependence on dissolved organic matter. , 1998, The Science of the total environment.

[28]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[29]  Fan Wang,et al.  Human health risk assessment of heavy metals in soil-vegetable system: a multi-medium analysis. , 2013, The Science of the total environment.

[30]  Manfred Lenzen,et al.  Mercury Flows in China and Global Drivers. , 2017, Environmental science & technology.

[31]  Li Tian,et al.  Integrated GIS and multivariate statistical analysis for regional scale assessment of heavy metal soil contamination: A critical review. , 2017, Environmental pollution.

[32]  G. Pan,et al.  Biochar soil amendment as a solution to prevent Cd-tainted rice from China: Results from a cross-site field experiment , 2013 .

[33]  T. Arao,et al.  Heavy metal contamination of agricultural soil and countermeasures in Japan , 2010, Paddy and Water Environment.

[34]  Lei Li,et al.  Zn, Ni, Mn, Cr, Pb and Cu in soil-tea ecosystem: The concentrations, spatial relationship and potential control. , 2018, Chemosphere.

[35]  B. J. Alloway,et al.  Effects of short-term pH fluctuations on cadmium, nickel, lead, and zinc availability to ryegrass in a sewage sludge-amended field. , 2008, Chemosphere.

[36]  C. Monterroso,et al.  Bioavailability and plant accumulation of heavy metals and phosphorus in agricultural soils amended by long-term application of sewage sludge. , 2007, Chemosphere.

[37]  Yan Li,et al.  Source Identification and Apportionment of Trace Elements in Soils in the Yangtze River Delta, China , 2018, International journal of environmental research and public health.

[38]  Xianguo Lu,et al.  Cd and Pb Contents in Soil, Plants, and Grasshoppers along a Pollution Gradient in Huludao City, Northeast China , 2011, Biological Trace Element Research.

[39]  E. Meers,et al.  Trace metal behaviour in estuarine and riverine floodplain soils and sediments: a review. , 2009, The Science of the total environment.

[40]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[41]  Zhou Shi,et al.  Improvement of Spatial Modeling of Cr, Pb, Cd, As and Ni in Soil Based on Portable X-ray Fluorescence (PXRF) and Geostatistics: A Case Study in East China , 2019, International journal of environmental research and public health.

[42]  Binggan Wei,et al.  A review of heavy metal contaminations in urban soils, urban road dusts and agricultural soils from China. , 2010 .

[43]  D. Brus,et al.  Predictions of spatially averaged cadmium contents in rice grains in the Fuyang Valley, P.R. China. , 2009, Journal of environmental quality.

[44]  Zhou Shi,et al.  Application of portable XRF and VNIR sensors for rapid assessment of soil heavy metal pollution , 2017, PloS one.

[45]  Peifang Wang,et al.  Effects of Pb on the oxidative stress and antioxidant response in a Pb bioaccumulator plant Vallisneria natans. , 2012, Ecotoxicology and environmental safety.

[46]  Yan Li,et al.  Identifying heavy metal pollution hot spots in soil-rice systems: A case study in South of Yangtze River Delta, China. , 2019, The Science of the total environment.

[47]  Zhong Tang,et al.  Soil contamination in China: current status and mitigation strategies. , 2015, Environmental science & technology.

[48]  Reena Singh,et al.  Heavy metals and living systems: An overview , 2011, Indian journal of pharmacology.

[49]  Qi Wang,et al.  Cadmium accumulation in edible flowering cabbages in the Pearl River Delta, China: Critical soil factors and enrichment models. , 2018, Environmental pollution.

[50]  Yan Li,et al.  Heavy Metal Pollution Delineation Based on Uncertainty in a Coastal Industrial City in the Yangtze River Delta, China , 2018, International journal of environmental research and public health.

[51]  G. Pan,et al.  Effects of biochar on availability and plant uptake of heavy metals - A meta-analysis. , 2018, Journal of environmental management.

[52]  E. Mentasti,et al.  Accumulation of heavy metals from contaminated soil to plants and evaluation of soil remediation by vermiculite. , 2011, Chemosphere.

[53]  Ren-qing Wang,et al.  Multiple factors impact the contents of heavy metals in vegetables in high natural background area of China. , 2017, Chemosphere.

[54]  A. Bellanca,et al.  Heavy metals in urban soils: a case study from the city of Palermo (Sicily), Italy. , 2002, The Science of the total environment.

[55]  Hefa Cheng,et al.  Improving China’s water resources management for better adaptation to climate change , 2012, Climatic Change.

[56]  C. Lan,et al.  Cadmium in soil–rice system and health risk associated with the use of untreated mining wastewater for irrigation in Lechang, China , 2006 .

[57]  Dawei Han,et al.  Machine Learning Techniques for Downscaling SMOS Satellite Soil Moisture Using MODIS Land Surface Temperature for Hydrological Application , 2013, Water Resources Management.

[58]  Claire Deacon,et al.  Variation in rice cadmium related to human exposure. , 2013, Environmental science & technology.

[59]  W-X Liu,et al.  Uptake of Toxic Heavy Metals by Rice (Oryza sativa L.) Cultivated in the Agricultural Soil near Zhengzhou City, People’s Republic of China , 2007, Bulletin of environmental contamination and toxicology.

[60]  I Iribarren,et al.  Risk-based evaluation of the exposure of children to trace elements in playgrounds in Madrid (Spain). , 2007, Chemosphere.

[61]  S. Zheng,et al.  Lead contamination in tea garden soils and factors affecting its bioavailability. , 2005, Chemosphere.

[62]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[63]  Yongzhang Zhou,et al.  Bioavailability and soil-to-crop transfer of heavy metals in farmland soils: A case study in the Pearl River Delta, South China. , 2018, Environmental pollution.

[64]  Bing Li,et al.  Accumulation of total mercury and methylmercury in rice plants collected from different mining areas in China. , 2014, Environmental pollution.

[65]  Guoping Zhang,et al.  The influence of pH and organic matter content in paddy soil on heavy metal availability and their uptake by rice plants. , 2011, Environmental pollution.

[66]  Yang Wang,et al.  Multivariate and geostatistical analyses of the spatial distribution and sources of heavy metals in agricultural soil in Dehui, Northeast China. , 2013, Chemosphere.

[67]  Dominique Arrouays,et al.  Optimizing pedotransfer functions for estimating soil bulk density using boosted regression trees. , 2009 .

[68]  Patricio Crespo,et al.  Spatial prediction of soil water retention in a Páramo landscape: Methodological insight into machine learning using random forest , 2018 .

[69]  N. Rascio,et al.  Heavy metal hyperaccumulating plants: how and why do they do it? And what makes them so interesting? , 2011, Plant science : an international journal of experimental plant biology.

[70]  Yan Li,et al.  Assessment of Heavy Metal Pollution and Health Risks in the Soil-Plant-Human System in the Yangtze River Delta, China , 2017, International journal of environmental research and public health.

[71]  Qiuyun Zhang,et al.  Concentration and transportation of heavy metals in vegetables and risk assessment of human exposure to bioaccessible heavy metals in soil near a waste-incinerator site, South China. , 2015, The Science of the total environment.

[72]  Youngihn Kho,et al.  GeoDa: An Introduction to Spatial Data Analysis , 2006 .

[73]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[74]  Yepu Li,et al.  Accumulation, spatio-temporal distribution, and risk assessment of heavy metals in the soil-corn system around a polymetallic mining area from the Loess Plateau, northwest China , 2017 .

[75]  Zhou Shi,et al.  Estimating soil salinity from remote sensing and terrain data in southern Xinjiang Province, China , 2019, Geoderma.