Penalized estimation of sparse concentration matrices based on prior knowledge with applications to placenta elemental data

Identifying patterns of association or dependency among high-dimensional biological datasets with sparse precision matrices remains a challenge. In this paper, we introduce a weighted sparse Gaussian graphical model that can incorporate prior knowledge to infer the structure of the network of trace element concentrations, including essential elements as well as toxic metals and metaloids measured in the human placentas. We present the weighted L1 penalized regularization procedure for estimating the sparse precision matrix in the setting of Gaussian graphical models. First, we use simulation models to demonstrate that the proposed method yields a better estimate of the precision matrix than the procedures that fail to account for the prior knowledge of the network structure. Then, we apply this method to estimate sparse element concentration matrices of placental biopsies from the New Hampshire Birth Cohort Study. The chemical architecture for elements is complex; thus, the method proposed herein was applied to infer the dependency structures of the elements using prior knowledge of their biological roles.

[1]  A. Belger,et al.  Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia , 2014, NeuroImage: Clinical.

[2]  Hongzhe Li,et al.  Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks. , 2006, Biostatistics.

[3]  M. West,et al.  Sparse graphical models for exploring gene expression data , 2004 .

[4]  N. Ballatori,et al.  Transport of toxic metals by molecular mimicry. , 2002, Environmental health perspectives.

[5]  M. Bottai,et al.  Sex-specific effects of early life cadmium exposure on DNA methylation and implications for birth weight , 2013, Epigenetics.

[6]  L. Wasserman,et al.  False discovery control with p-value weighting , 2006 .

[7]  Jiang Gui,et al.  Testing multiple hypotheses through IMP weighted FDR based on a genetic functional network with application to a new zebrafish transcriptome study , 2014, BioData Mining.

[8]  D. Eide,et al.  A novel iron-regulated metal transporter from plants identified by functional expression in yeast. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Peter A. C. 't Hoen,et al.  Consistency of biological networks inferred from microarray and sequencing data , 2016, BMC Bioinformatics.

[10]  R. H. Washerman Transfer of Calcium and Strontium across Biological Membranes , 1961, Nature.

[11]  Stan Szpakowicz,et al.  Beyond Accuracy, F-Score and ROC: A Family of Discriminant Measures for Performance Evaluation , 2006, Australian Conference on Artificial Intelligence.

[12]  C. Marsit,et al.  Placental Metal Concentrations in Relation to Maternal and Infant Toenails in a U.S. Cohort. , 2016, Environmental science & technology.

[13]  N. Meinshausen,et al.  Consistent neighbourhood selection for sparse high-dimensional graphs with the Lasso , 2004 .

[14]  Korbinian Strimmer,et al.  An empirical Bayes approach to inferring large-scale gene association networks , 2005, Bioinform..

[15]  R. Tibshirani,et al.  Sparse inverse covariance estimation with the graphical lasso. , 2008, Biostatistics.

[16]  Daphne Koller,et al.  Sharing and Specificity of Co-expression Networks across 35 Human Tissues , 2014, PLoS Comput. Biol..

[17]  D. Eide,et al.  The IRT1 protein from Arabidopsis thaliana is a metal transporter with a broad substrate range , 1999, Plant Molecular Biology.

[18]  Pradeep Ravikumar,et al.  Sparse inverse covariance matrix estimation using quadratic approximation , 2011, MLSLP.

[19]  A. R. Twardock Studies on the Movement of Calcium and Strontium Across the Bovine Mammary Gland1 , 1963 .

[20]  Yan-Li Ji,et al.  Maternal cadmium exposure reduces placental zinc transport and induces fetal growth restriction in mice. , 2016, Reproductive toxicology.

[21]  Robert J.P. Williams,et al.  The Biological Chemistry of the Elements: The Inorganic Chemistry of Life , 2001 .

[22]  B. Rosen,et al.  Transport pathways for arsenic and selenium: a minireview. , 2009, Environment international.

[23]  T. Simons,et al.  Active transport of lead by the calcium pump in human red cell ghosts. , 1988, The Journal of physiology.

[24]  Rudolfs K. Zalups,et al.  Molecular and ionic mimicry and the transport of toxic metals. , 2005, Toxicology and applied pharmacology.

[25]  C. Marsit,et al.  Placental arsenic concentrations in relation to both maternal and infant biomarkers of exposure in a US cohort , 2015, Journal of Exposure Science and Environmental Epidemiology.