Disease signatures are robust across tissues and experiments

Meta‐analyses combining gene expression microarray experiments offer new insights into the molecular pathophysiology of disease not evident from individual experiments. Although the established technical reproducibility of microarrays serves as a basis for meta‐analysis, pathophysiological reproducibility across experiments is not well established. In this study, we carried out a large‐scale analysis of disease‐associated experiments obtained from NCBI GEO, and evaluated their concordance across a broad range of diseases and tissue types. On evaluating 429 experiments, representing 238 diseases and 122 tissues from 8435 microarrays, we find evidence for a general, pathophysiological concordance between experiments measuring the same disease condition. Furthermore, we find that the molecular signature of disease across tissues is overall more prominent than the signature of tissue expression across diseases. The results offer new insight into the quality of public microarray data using pathophysiological metrics, and support new directions in meta‐analysis that include characterization of the commonalities of disease irrespective of tissue, as well as the creation of multi‐tissue systems models of disease pathology using public data.

[1]  Ronald W. Davis,et al.  Quantitative Monitoring of Gene Expression Patterns with a Complementary DNA Microarray , 1995, Science.

[2]  S. P. Fodor,et al.  Using oligonucleotide probe arrays to access genetic diversity. , 1995, BioTechniques.

[3]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[4]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information , 2021, Nucleic Acids Res..

[5]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[6]  Microarray standards at last , 2002, Nature.

[7]  E. Lander,et al.  A molecular signature of metastasis in primary solid tumors , 2003, Nature Genetics.

[8]  Rainer Breitling,et al.  Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments , 2004, FEBS letters.

[9]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[10]  P. Brown,et al.  Large-scale meta-analysis of cancer microarray data identifies common transcriptional profiles of neoplastic transformation and progression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  A. Whitehead,et al.  Variation in tissue-specific gene expression among natural populations , 2005, Genome Biology.

[12]  D. Botstein,et al.  A DNA microarray survey of gene expression in normal human tissues , 2005, Genome Biology.

[13]  D. Koller,et al.  A module map showing conditional activity of expression modules in cancer , 2004, Nature Genetics.

[14]  John Quackenbush,et al.  Multiple-laboratory comparison of microarray platforms , 2005, Nature Methods.

[15]  A. Chinnaiyan,et al.  Integrative analysis of the cancer transcriptome , 2005, Nature Genetics.

[16]  Rong Chen,et al.  Finding Disease-Related Genomic Experiments Within an International Repository: First Steps in Translational Bioinformatics , 2006, AMIA.

[17]  Paul A Clemons,et al.  The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease , 2006, Science.

[18]  Olga G. Troyanskaya,et al.  A scalable method for integration and functional analysis of multiple microarray datasets , 2006, Bioinform..

[19]  Maqc Consortium The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements , 2006, Nature Biotechnology.

[20]  F. Luft,et al.  The protean face of sarcoidosis revisited. , 2006, Nephrology, dialysis, transplantation : official publication of the European Dialysis and Transplant Association - European Renal Association.

[21]  Jeffrey T. Chang,et al.  Oncogenic pathway signatures in human cancers as a guide to targeted therapies , 2006, Nature.

[22]  Atul J. Butte,et al.  Evaluation and integration of 49 genome-wide experiments and the prediction of previously unknown obesity-related genes , 2007, Bioinform..

[23]  R. Irizarry,et al.  A gene expression bar code for microarray data , 2007, Nature Methods.

[24]  Hyungwon Choi,et al.  A Latent Variable Approach for Meta-Analysis of Gene Expression Data from Multiple Microarray Experiments , 2007, BMC Bioinformatics.

[25]  A. Butte,et al.  AILUN: reannotating gene expression data automatically , 2007, Nature Methods.

[26]  C. Sotiriou,et al.  Meta-analysis of gene expression profiles in breast cancer: toward a unified understanding of breast cancer subtyping and prognosis signatures , 2007, Breast Cancer Research.

[27]  Joel Dudley,et al.  Enabling Integrative Genomic Analysis of High Impact Human Diseases Through Text Mining , 2007, Pacific Symposium on Biocomputing.

[28]  Susmita Datta,et al.  Finding common genes in multiple cancer types through meta-analysis of microarray experiments: a rank aggregation approach. , 2008, Genomics.

[29]  J. Astola,et al.  Systematic bioinformatic analysis of expression levels of 17,330 human genes across 9,783 samples from 175 types of healthy and pathological tissues , 2008, Genome Biology.

[30]  Eric E Schadt,et al.  Multi-tissue coexpression networks reveal unexpected subnetworks associated with disease. , 2009 .

[31]  Qinghua Wang,et al.  Insulin as a physiological modulator of glucagon secretion. , 2008, American journal of physiology. Endocrinology and metabolism.

[32]  T. Noda,et al.  Molecular mechanism of moderate insulin resistance in adiponectin-knockout mice. , 2008, Endocrine journal.

[33]  Ibrahim Emam,et al.  ArrayExpress update—from an archive of functional genomics experiments to the atlas of gene expression , 2008, Nucleic Acids Res..

[34]  Joel Dudley,et al.  Identification of Discriminating Biomarkers for Human Disease Using Integrative Network Biology , 2008, Pacific Symposium on Biocomputing.

[35]  G. Azabdaftari,et al.  Refinement of breast cancer classification by molecular characterization of histological special types , 2009 .

[36]  陶仲为 Sarcoidosis , 2009 .