Compositional analysis of topsoil metals and its associations with cancer mortality using spatial misaligned data

The presence of toxic metals in soil per se, and in soil impacted by mining, industry, agriculture and urbanisation in particular, is a major concern for both human health and ecotoxicology. The dual aim of this study was: to ascertain whether topsoil composition could influence the spatial distribution of mortality due to different types of cancer and to identify possible errors committed by epidemiological studies which analyse soil composition data as a closed number system. We conducted an ecological cancer mortality study, covering 861,440 cancer deaths (27 cancer sites) in 7917 Spanish mainland towns, from 1999 to 2008. Topsoil levels of Al, As, Cd, Cr, Cu, Fe, Mn, Ni, Pb and Zn were determined by ICP-MS at 13,317 sampling points. We transformed the topsoil data in two ways, i.e. log transformation and centred logratio transformation. Principal factor analysis was performed to obtain independent latent factors for the transformed variables. To estimate the effect on mortality of topsoil factor loadings, we fitted Besag, York and Mollié models embedded in geostatistical-spatial models. This model included soil sample locations and town centroids (non-aligned data), fitted using the integrated nested Laplace approximation (INLA) as a tool for Bayesian inference and stochastic partial differential equations (SPDE). All results were adjusted for socio-demographic variables. The results indicated that soil composition could have an influence on the spatial distribution and mortality patterns of cancer. The analysis adjusted for socio-demographic variables showed excess male mortality due to digestive system tumours in areas with soils containing higher Cd, Pb, Zn, Mn and Cu concentrations, bladder cancer in areas with soils containing higher Cd concentrations, and brain cancer in areas with soils containing As. In both sexes, cancer of oesophagus was associated with soils containing a higher lead content, while lung cancer was associated with soils containing a higher copper content. Stress should be laid on the importance of taking into account the compositional nature of the data in this type of analysis.

[1]  Olle Selinus,et al.  Essentials of Medical Geology: Revised Edition , 2013 .

[2]  M. Pollán,et al.  Municipal distribution of bladder cancer mortality in Spain: Possible role of mining and industry , 2006, BMC public health.

[3]  Soyoung Jeon,et al.  Measurement Error caused by Spatial Misalignment in Environmental Epidemiology , 2009 .

[4]  Catherine A Calder,et al.  Accounting for uncertainty in ecological analysis: the strengths and limitations of hierarchical statistical modeling. , 2009, Ecological applications : a publication of the Ecological Society of America.

[5]  F. Tao,et al.  Potential health risks of heavy metals in cultivated topsoil and grain, including correlations with human primary liver, lung and gastric cancer, in Anhui province, Eastern China. , 2014, The Science of the total environment.

[6]  Clemens Reimann,et al.  Statistical data analysis explained : applied environmental statics with R , 2008 .

[7]  Finn Lindgren,et al.  Bayesian Spatial Modelling with R-INLA , 2015 .

[8]  Matthew C. B. Tsilimigras,et al.  Compositional data analysis of the microbiome: fundamentals, tools, and challenges. , 2016, Annals of epidemiology.

[9]  Ignacio Aguaded,et al.  Association between residential proximity to environmental pollution sources and childhood renal tumors. , 2016, Environmental research.

[10]  Habibul Ahsan,et al.  The Broad Scope of Health Effects from Chronic Arsenic Exposure: Update on a Worldwide Public Health Problem , 2013, Environmental health perspectives.

[11]  J. Besag,et al.  Bayesian image restoration, with two applications in spatial statistics , 1991 .

[12]  Kurt Straif,et al.  A review of human carcinogens--Part C: metals, arsenic, dusts, and fibres. , 2009, The Lancet. Oncology.

[13]  J. Ranville Evaluating the Influence of Soil Geochemical Characteristics on Metal Bioavailability , 2008 .

[14]  C Montomoli,et al.  Spatial correlation in ecological analysis. , 1993, International journal of epidemiology.

[15]  M. Cameletti,et al.  Spatial and Spatio-temporal Bayesian Models with R - INLA , 2015 .

[16]  Gonzalo López-Abente,et al.  Proximity to mining industry and cancer mortality. , 2012, The Science of the total environment.

[17]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[18]  C. Gotway,et al.  Combining Incompatible Spatial Data , 2002 .

[19]  M. L. Leite Applying compositional data methodology to nutritional epidemiology , 2016, Statistical methods in medical research.

[20]  H. Rue,et al.  An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach , 2011 .

[21]  Andrew O. Finley,et al.  Bayesian Hierarchical Models for Spatially Misaligned Data in R , 2014 .

[22]  P. Fernández-Navarro,et al.  Arsenic and chromium topsoil levels and cancer mortality in Spain , 2016, Environmental Science and Pollution Research.

[23]  H. Rue,et al.  Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations , 2009 .

[24]  F. Mullick,et al.  Environmental Pathology , 2012, Essentials of Medical Geology.

[25]  Clemens Reimann,et al.  Robust factor analysis for compositional data , 2009, Comput. Geosci..

[26]  N. Malats,et al.  Pancreatic cancer risk and levels of trace elements , 2011, Gut.

[27]  P. Fernández-Navarro,et al.  Time trends in municipal distribution patterns of cancer mortality in Spain , 2014, BMC Cancer.

[28]  H. Rue,et al.  Spatio-temporal modeling of particulate matter concentration through the SPDE approach , 2012, AStA Advances in Statistical Analysis.

[29]  P. Filzmoser,et al.  The concept of compositional data analysis in practice--total major element concentrations in agricultural and grazing land soils of Europe. , 2012, The Science of the total environment.

[30]  Gonzalo López-Abente,et al.  Description of industrial pollution in Spain , 2007, BMC public health.

[31]  John Aitchison,et al.  Principles of compositional data analysis , 1994 .

[32]  P. Filzmoser,et al.  The bivariate statistical analysis of environmental (compositional) data. , 2010, The Science of the total environment.