Multiplex meta-analysis of RNA expression to identify genes with variants associated with immune dysfunction

Objective We demonstrate a genome-wide method for the integration of many studies of gene expression of phenotypically similar disease processes, a method of multiplex meta-analysis. We use immune dysfunction as an example disease process. Design We use a heterogeneous collection of datasets across human and mice samples from a range of tissues and different forms of immunodeficiency. We developed a method integrating Tibshirani's modified t-test (SAM) is used to interrogate differential expression within a study and Fisher's method for omnibus meta-analysis to identify differentially expressed genes across studies. The ability of this overall gene expression profile to prioritize disease associated genes is evaluated by comparing against the results of a recent genome wide association study for common variable immunodeficiency (CVID). Results Our approach is able to prioritize genes associated with immunodeficiency in general (area under the ROC curve = 0.713) and CVID in particular (area under the ROC curve = 0.643). Conclusions This approach may be used to investigate a larger range of failures of the immune system. Our method may be extended to other disease processes, using RNA levels to prioritize genes likely to contain disease associated DNA variants.

[1]  J. Ioannidis Why Most Discovered True Associations Are Inflated , 2008, Epidemiology.

[2]  R. Houlston,et al.  Searching for the missing heritability of complex diseases , 2011, Human mutation.

[3]  Dmitry Pushkarev,et al.  Single-molecule sequencing of an individual human genome , 2009, Nature Biotechnology.

[4]  Alexander A. Morgan,et al.  Likelihood ratios for genome medicine , 2010, Genome Medicine.

[5]  Russell G. Jones,et al.  Enhancing CD8 T-cell memory by modulating fatty acid metabolism , 2009, Nature.

[6]  Alexander A. Morgan,et al.  Comparison of multiplex meta analysis techniques for understanding the acute rejection of solid organ transplants , 2010, BMC Bioinformatics.

[7]  Qian Tao,et al.  DNA methyltransferase 3B (DNMT3B) mutations in ICF syndrome lead to altered epigenetic modifications and aberrant expression of genes regulating development, neurogenesis and immune function. , 2008, Human molecular genetics.

[8]  P. Donnelly,et al.  Replicating genotype–phenotype associations , 2007, Nature.

[9]  K. P. Murphy,et al.  Janeway's immunobiology , 2007 .

[10]  R. Prentice,et al.  Bias-reduced estimators and confidence intervals for odds ratios in genome-wide association studies. , 2008, Biostatistics.

[11]  Alexander A. Morgan,et al.  Clinical assessment incorporating a personal genome , 2010, The Lancet.

[12]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[13]  A. Scott,et al.  Innate Immune Responses to Lung-Stage Helminth Infection Induce Alternatively Activated Alveolar Macrophages , 2006, Infection and Immunity.

[14]  Adam S. Cifu,et al.  Symptom to Diagnosis: An Evidence-Based Guide , 2006 .

[15]  Douglas G Altman,et al.  Key Issues in Conducting a Meta-Analysis of Gene Expression Microarray Datasets , 2008, PLoS medicine.

[16]  Bodo Grimbacher,et al.  STAT3 mutations in the hyper-IgE syndrome. , 2007, The New England journal of medicine.

[17]  Jing Chen,et al.  Disease candidate gene identification and prioritization using protein interaction networks , 2009, BMC Bioinformatics.

[18]  Mark S. Anderson,et al.  Projection of an Immunological Self Shadow Within the Thymus by the Aire Protein , 2002, Science.

[19]  Jean Yee Hwa Yang,et al.  Comparison study of microarray meta-analysis methods , 2010, BMC Bioinformatics.

[20]  Michael Boehnke,et al.  Quantifying and correcting for the winner's curse in genetic association studies , 2009, Genetic epidemiology.

[21]  Eric S. Lander,et al.  Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  C. Rotimi,et al.  Genetic Variants Associated with Complex Human Diseases Show Wide Variation across Multiple Populations , 2009, Public Health Genomics.

[23]  Takeshi Yamada,et al.  The transcription factor ELF4 controls proliferation and homing of CD8+ T cells via the Krüppel-like factors KLF4 and KLF2 , 2009, Nature Immunology.

[24]  Joseph T. Glessner,et al.  Immune deficiencies , infection , and systemic immune disorders Genome-wide association identifies diverse causes of common variable immunodeficiency , 2011 .

[25]  J. Pritchard,et al.  Overcoming the winner's curse: estimating penetrance parameters from case-control data. , 2007, American journal of human genetics.

[26]  Li Li,et al.  Differentially Expressed RNA from Public Microarray Data Identifies Serum Protein Biomarkers for Cross-Organ Transplant Rejection and Other Conditions , 2010, PLoS Comput. Biol..

[27]  P. Munson,et al.  Immune responses to Pneumocystis murina are robust in healthy mice but largely absent in CD40 ligand‐deficient mice , 2008, Journal of leukocyte biology.

[28]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Christian Steidl,et al.  Essential role of Jun family transcription factors in PU.1 knockdown–induced leukemic stem cells , 2006, Nature Genetics.

[30]  Atul J. Butte,et al.  Evaluation and integration of 49 genome-wide experiments and the prediction of previously unknown obesity-related genes , 2007, Bioinform..

[31]  J. Ioannidis Why Most Published Research Findings Are False , 2005, PLoS medicine.

[32]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[33]  D. Nakae,et al.  Disruption of Spermatogenic Cell Adhesion and Male Infertility in Mice Lacking TSLC1/IGSF4, an Immunoglobulin Superfamily Cell Adhesion Molecule , 2006, Molecular and Cellular Biology.

[34]  Jessica M. Lindvall,et al.  Transcriptional signatures of Itk-deficient CD3+, CD4+ and CD8+ T-cells , 2009, BMC Genomics.

[35]  J. Shendure,et al.  Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data , 2011, Nature Reviews Genetics.

[36]  R. Melamed,et al.  The variable immunological self: Genetic variation and nongenetic noise in Aire-regulated transcription , 2008, Proceedings of the National Academy of Sciences.

[37]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[38]  Adeline R. Whitney,et al.  Gene Expression Profiling Provides Insight into the Pathophysiology of Chronic Granulomatous Disease1 , 2004, The Journal of Immunology.

[39]  Dennis B. Troup,et al.  NCBI GEO: mining tens of millions of expression profiles—database and tools update , 2006, Nucleic Acids Res..

[40]  J. Ioannidis,et al.  Replication validity of genetic association studies , 2001, Nature Genetics.

[41]  A. Butte,et al.  AILUN: reannotating gene expression data automatically , 2007, Nature Methods.

[42]  David P Bick,et al.  Making a definitive diagnosis: Successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease , 2011, Genetics in Medicine.

[43]  N. Killeen,et al.  MLL5 contributes to hematopoietic stem cell fitness and homeostasis. , 2009, Blood.

[44]  Robert Tibshirani,et al.  A comparison of fold-change and the t-statistic for microarray data analysis , 2007 .

[45]  Russ B. Altman,et al.  Towards a Cytokine-Cell Interaction Knowledgebase of the Adaptive Immune System , 2008, Pacific Symposium on Biocomputing.

[46]  L. Peltonen,et al.  Promiscuous gene expression in thymic epithelial cells is regulated at multiple levels , 2005, The Journal of experimental medicine.