The functional landscape of mouse gene expression

Background Large-scale quantitative analysis of transcriptional co-expression has been used to dissect regulatory networks and to predict the functions of new genes discovered by genome sequencing in model organisms such as yeast. Although the idea that tissue-specific expression is indicative of gene function in mammals is widely accepted, it has not been objectively tested nor compared with the related but distinct strategy of correlating gene co-expression as a means to predict gene function. Results We generated microarray expression data for nearly 40,000 known and predicted mRNAs in 55 mouse tissues, using custom-built oligonucleotide arrays. We show that quantitative transcriptional co-expression is a powerful predictor of gene function. Hundreds of functional categories, as defined by Gene Ontology 'Biological Processes', are associated with characteristic expression patterns across all tissues, including categories that bear no overt relationship to the tissue of origin. In contrast, simple tissue-specific restriction of expression is a poor predictor of which genes are in which functional categories. As an example, the highly conserved mouse gene PWP1 is widely expressed across different tissues but is co-expressed with many RNA-processing genes; we show that the uncharacterized yeast homolog of PWP1 is required for rRNA biogenesis. Conclusions We conclude that 'functional genomics' strategies based on quantitative transcriptional co-expression will be as fruitful in mammals as they have been in simpler organisms, and that transcriptional control of mammalian physiology is more modular than is generally appreciated. Our data and analyses provide a public resource for mammalian functional genomics.

[1]  B. Byers,et al.  A yeast gene essential for regulation of spindle pole duplication , 1988, Molecular and cellular biology.

[2]  T. Doetschman,et al.  Targeted ablation of the phospholamban gene is associated with markedly enhanced myocardial contractility and loss of beta-agonist stimulation. , 1994, Circulation research.

[3]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[4]  F. Hossler,et al.  Structure and blood supply of intrinsic lymph nodes in the wall of the rabbit urinary bladder—studies with light microscopy, electron microscopy, and vascular corrosion casting , 1998, The Anatomical record.

[5]  Marti A. Hearst Trends & Controversies: Support Vector Machines , 1998, IEEE Intell. Syst..

[6]  D. Botstein,et al.  The transcriptional program of sporulation in budding yeast. , 1998, Science.

[7]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[8]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[9]  C. Niehrs,et al.  Synexpression groups in eukaryotes , 1999, Nature.

[10]  Yudong D. He,et al.  Functional Discovery via a Compendium of Expression Profiles , 2000, Cell.

[11]  Eric S. Lander,et al.  Genomic analysis of metastasis reveals an essential role for RhoC , 2000, Nature.

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  David K. Hanzel,et al.  Mining the human genome using microarrays of open reading frames , 2000, Nature Genetics.

[14]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[15]  D. Botstein,et al.  Genomic expression programs in the response of yeast cells to environmental changes. , 2000, Molecular biology of the cell.

[16]  K. P. Rabitsch,et al.  Functional Genomics Identifies Monopolin A Kinetochore Protein Required for Segregation of Homologs during Meiosis I , 2000, Cell.

[17]  M B Eisen,et al.  Delineating developmental and metabolic pathways in vivo by expression profiling using the RIKEN set of 18,816 full-length enriched mouse cDNA arrays , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  W. Stanford,et al.  Gene-trap mutagenesis: past, present and beyond , 2001, Nature Reviews Genetics.

[19]  Yudong D. He,et al.  Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer , 2001, Nature Biotechnology.

[20]  Hans Lehrach,et al.  Functional Annotation of Mouse Genome Sequences , 2001, Science.

[21]  C. Burge,et al.  Computational inference of homologous gene structures in the human genome. , 2001, Genome research.

[22]  Joshua M. Stuart,et al.  A Gene Expression Map for Caenorhabditis elegans , 2001, Science.

[23]  Lani F. Wu,et al.  Large-scale prediction of Saccharomyces cerevisiae gene function using overlapping transcriptional clusters , 2002, Nature Genetics.

[24]  M. Mann,et al.  Directed Proteomic Analysis of the Human Nucleolus , 2002, Current Biology.

[25]  A. Orth,et al.  Large-scale analysis of the human and mouse transcriptomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[27]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[28]  B. Honoré,et al.  Endonuclein is a cell cycle regulated WD-repeat protein that is up-regulated in adenocarcinoma of the pancreas , 2002, Oncogene.

[29]  Jo McEntyre,et al.  The NCBI Handbook , 2002 .

[30]  C. Ball,et al.  Saccharomyces Genome Database. , 2002, Methods in enzymology.

[31]  Tatiana A. Tatusova,et al.  NCBI Reference Sequence Project: update and current status , 2003, Nucleic Acids Res..

[32]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[33]  M. Fagiolini,et al.  Targeting a complex transcriptome: the construction of the mouse full-length cDNA encyclopedia. , 2003, Genome research.

[34]  T. Pawson,et al.  Transgenic RNA interference in ES cell–derived embryos recapitulates a genetic null phenotype , 2003, Nature Biotechnology.

[35]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[36]  Nicola J. Rinaldi,et al.  Computational discovery of gene modules and regulatory networks , 2003, Nature Biotechnology.

[37]  Y. Hayashizaki,et al.  Systematic expression profiling of the mouse transcriptome using RIKEN cDNA microarrays. , 2003, Genome research.

[38]  R. Guigó,et al.  Comparative gene prediction in human and mouse. , 2003, Genome research.

[39]  Brendan J. Frey,et al.  Spatial Bias Removal in Microarray Images , 2003 .

[40]  John B. Anderson,et al.  CDD: a curated Entrez database of conserved domain alignments , 2003, Nucleic Acids Res..

[41]  Brendan J. Frey,et al.  A Panoramic View of Yeast Noncoding RNA Processing , 2003, Cell.

[42]  Alistair G. Rust,et al.  Ensembl 2002: accommodating comparative genomics , 2003, Nucleic Acids Res..

[43]  M. Brent,et al.  Comparison of mouse and human genomes followed by experimental verification yields an estimated 1,019 additional genes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[44]  L. Wagner,et al.  21. UniGene: A Unified View of the Transcriptome , 2003 .

[45]  M. Richards,et al.  The Transcriptome Profile of Human Embryonic Stem Cells as Defined by SAGE , 2004, Stem cells.

[46]  A. Su,et al.  Applications of a rat multiple tissue gene expression data set. , 2004, Genome research.

[47]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Homin K. Lee,et al.  Coexpression analysis of human genes across many microarray data sets. , 2004, Genome research.

[49]  Faculty Opinions recommendation of The functional landscape of mouse gene expression. , 2005 .