Of mice and men: Sparse statistical modeling in cardiovascular genomics

In high-throughput genomics, large-scale designed experiments are becoming common, and analysis approaches based on highly multivariate regression and anova concepts are key tools. Shrinkage models of one form or another can provide comprehensive approaches to the problems of simultaneous inference that involve implicit multiple comparisons over the many, many parameters representing effects of design factors and covariates. We use such approaches here in a study of cardiovascular genomics. The primary experimental context concerns a carefully designed, and rich, gene expression study focused on gene-environment interactions, with the goals of identifying genes implicated in connection with disease states and known risk factors, and in generating expression signatures as proxies for such risk factors. A coupled exploratory analysis investigates cross-species extrapolation of gene expression signatures--how these mouse-model signatures translate to humans. The latter involves exploration of sparse latent factor analysis of human observational data and of how it relates to projected risk signatures derived in the animal models. The study also highlights a range of applied statistical and genomic data analysis issues, including model specification, computational questions and model-based correction of experimental artifacts in DNA microarray data.

[1]  K. Possinger,et al.  Increased expression of the tetraspanins CD53 and CD63 on apoptotic human neutrophils , 2000, Journal of leukocyte biology.

[2]  J. Pedersen-Lane,et al.  Analysis of the thiol status of peripheral blood leukocytes in rheumatoid arthritis patients , 2007, Journal of leukocyte biology.

[3]  P. Müller,et al.  A Bayesian mixture model for differential gene expression , 2005 .

[4]  M. West,et al.  Gene expression phenotypic models that predict the activity of oncogenic pathways , 2003, Nature Genetics.

[5]  F. Ohsuzu,et al.  The effect of interleukin-1 receptor antagonist on arteries and cholesterol metabolism. , 2006, Journal of atherosclerosis and thrombosis.

[6]  L. Wasserman,et al.  Operating characteristics and extensions of the false discovery rate procedure , 2002 .

[7]  R. Spang,et al.  Predicting the clinical status of human breast cancer by using gene expression profiles , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M. Clyde,et al.  Model Uncertainty , 2003 .

[9]  J F Cornhill,et al.  Topographic Study of Sudanophilic Lesions in Cholesterol‐Fed Minipigs by Image Analysis , 1985, Arteriosclerosis.

[10]  A. Raz On the Role of Galectin-3 in Cancer Metastasis : From the Bench to the Clinic and Back , 2008 .

[11]  C. Devlin,et al.  Genetic alterations of IL-1 receptor antagonist in mice affect plasma cholesterol level and foam cell lesion size , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[12]  R. Mahley,et al.  Apolipoprotein E: cholesterol transport protein with expanding role in cell biology. , 1988, Science.

[13]  H. Hamm,et al.  Thrombin Modulates the Expression of a Set of Genes Including Thrombospondin-1 in Human Microvascular Endothelial Cells* , 2005, Journal of Biological Chemistry.

[14]  K. Nath,et al.  Clusterin protects against oxidative stress in vitro through aggregative and nonaggregative properties. , 1998, Kidney international.

[15]  Matthew West,et al.  Bayesian factor regression models in the''large p , 2003 .

[16]  M. West,et al.  High-Dimensional Sparse Factor Modeling: Applications in Gene Expression Genomics , 2008, Journal of the American Statistical Association.

[17]  J. S. Rao,et al.  Spike and Slab Gene Selection for Multigroup Microarray Data , 2005 .

[18]  Zhigang Zhou,et al.  TNFR1-induced NF-kappaB, but not ERK, p38MAPK or JNK activation, mediates TNF-induced ICAM-1 and VCAM-1 expression on endothelial cells. , 2007, Cellular signalling.

[19]  S. Fazio,et al.  Prevention of atherosclerosis in apolipoprotein E-deficient mice by bone marrow transplantation , 1995, Science.

[20]  Carlos M. Carvalho,et al.  Sparse Statistical Modelling in Gene Expression Genomics , 2006 .

[21]  Rafael A Irizarry,et al.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data. , 2003, Biostatistics.

[22]  M. West,et al.  Gene expression profiling and genetic markers in glioblastoma survival. , 2005, Cancer research.

[23]  J. S. Rao,et al.  Detecting Differentially Expressed Genes in Microarrays Using Bayesian Model Selection , 2003 .

[24]  P. Delafontaine,et al.  Differential effects of low density lipoproteins on insulin-like growth factor-1 (IGF-1) and IGF-1 receptor expression in vascular smooth muscle cells. , 2000, The Journal of biological chemistry.

[25]  James T. Wu,et al.  Advanced glycosylation end products: A new disease marker for diabetes and aging , 1993, Journal of clinical laboratory analysis.

[26]  A. Bayés‐Genís,et al.  Effects of insulin-like growth factor-I on cultured human coronary artery smooth muscle cells. , 2003, Growth hormone & IGF research : official journal of the Growth Hormone Research Society and the International IGF Research Society.

[27]  M. West,et al.  Gene expression predictors of breast cancer outcomes , 2003, The Lancet.

[28]  Mike West,et al.  Distinctions in the specificity of E2F function revealed by gene expression signatures. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Rainer Spang,et al.  DNA Microarray Data Analysis and Regression Modeling for Genetic Expression Profiling , 2000 .

[30]  D. Kaul Molecular link between cholesterol, cytokines and atherosclerosis , 2001, Molecular and Cellular Biochemistry.

[31]  M. West,et al.  Gene Expression Phenotypes of Oncogenic Signaling Pathways , 2003, Cell cycle.

[32]  D. Madigan,et al.  Bayesian Model Averaging for Linear Regression Models , 1997 .

[33]  M. Boes,et al.  Insulin, insulin-like growth factors, and vascular endothelium. , 1988, The American journal of medicine.

[34]  G. Norata,et al.  Oxidized Lipoproteins and Endothelium , 2000, Clinical chemistry and laboratory medicine.

[35]  In Gyu Kim,et al.  LPS-induced CD53 expression: a protection mechanism against oxidative and radiation stress. , 2004, Molecules and cells.

[36]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[37]  M. West,et al.  Molecular evidence for arterial repair in atherosclerosis. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  M. Cybulsky,et al.  Adhesion of Monocytes to Arterial Endothelium and Initiation of Atherosclerosis Are Critically Dependent on Vascular Cell Adhesion Molecule-1 Gene Dosage , 2001, Arteriosclerosis, thrombosis, and vascular biology.

[39]  M. West,et al.  Distinct gene expression phenotypes of cells lacking Rb and Rb family members. , 2003, Cancer research.

[40]  James G. Scott,et al.  An exploration of aspects of Bayesian multiple testing , 2006 .

[41]  J F Cornhill,et al.  Topography of human aortic sudanophilic lesions. , 1990, Monographs on atherosclerosis.

[42]  M. West,et al.  Gene Expression Phenotypes of Atherosclerosis , 2004, Arteriosclerosis, thrombosis, and vascular biology.

[43]  L. Fabbri,et al.  Clusterin decreases oxidative stress in lung fibroblasts exposed to cigarette smoke. , 2006, American journal of respiratory and critical care medicine.

[44]  M. West,et al.  Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Y. Shen,et al.  p53-dependent apoptosis pathways. , 2001, Advances in cancer research.

[46]  D. Green,et al.  Mechanisms of p53-dependent apoptosis. , 2001, Biochemical Society transactions.

[47]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[48]  G. Romeo,et al.  Role of galectin-3 as a receptor for advanced glycosylation end products. , 2000, Kidney international. Supplement.

[49]  S. Menini,et al.  Development of age-dependent glomerular lesions in galectin-3/AGE-receptor-3 knockout mice. , 2005, American journal of physiology. Renal physiology.

[50]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[51]  Sylvia Richardson,et al.  Bayesian Hierarchical Model for Identifying Changes in Gene Expression from Microarray Experiments , 2002, J. Comput. Biol..

[52]  G. Pugliese,et al.  Role of galectin-3 in diabetic nephropathy. , 2003, Journal of the American Society of Nephrology : JASN.

[53]  P. Wilson,et al.  Monocyte cytokine production in an elderly population: effect of age and inflammation. , 1998, The journals of gerontology. Series A, Biological sciences and medical sciences.

[54]  S. Schwartz,et al.  Osteopontin Expression in Cardiovascular Diseases a , 1995, Annals of the New York Academy of Sciences.