Rapid and asymmetric divergence of duplicate genes in the human gene coexpression network

BackgroundWhile gene duplication is known to be one of the most common mechanisms of genome evolution, the fates of genes after duplication are still being debated. In particular, it is presently unknown whether most duplicate genes preserve (or subdivide) the functions of the parental gene or acquire new functions. One aspect of gene function, that is the expression profile in gene coexpression network, has been largely unexplored for duplicate genes.ResultsHere we build a human gene coexpression network using human tissue-specific microarray data and investigate the divergence of duplicate genes in it. The topology of this network is scale-free. Interestingly, our analysis indicates that duplicate genes rapidly lose shared coexpressed partners: after approximately 50 million years since duplication, the two duplicate genes in a pair have only slightly higher number of shared partners as compared with two random singletons. We also show that duplicate gene pairs quickly acquire new coexpressed partners: the average number of partners for a duplicate gene pair is significantly greater than that for a singleton (the latter number can be used as a proxy of the number of partners for a parental singleton gene before duplication). The divergence in gene expression between two duplicates in a pair occurs asymmetrically: one gene usually has more partners than the other one. The network is resilient to both random and degree-based in silico removal of either singletons or duplicate genes. In contrast, the network is especially vulnerable to the removal of highly connected genes when duplicate genes and singletons are considered together.ConclusionDuplicate genes rapidly diverge in their expression profiles in the network and play similar role in maintaining the network robustness as compared with singletons.Contact:kdm16@psu.eduSupplementary information: Please see additional files.

[1]  B. Birren,et al.  Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae , 2004, Nature.

[2]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[3]  Andreas Wagner,et al.  Asymmetric functional divergence of duplicate genes in yeast. , 2002, Molecular biology and evolution.

[4]  Homin K. Lee,et al.  Coexpression analysis of human genes across many microarray data sets. , 2004, Genome research.

[5]  Sergei Maslov,et al.  Upstream plasticity and downstream robustness in evolution of molecular networks , 2003, BMC Evolutionary Biology.

[6]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R. Lyman Ott.,et al.  An introduction to statistical methods and data analysis , 1977 .

[8]  M. Lynch,et al.  The structure and early evolution of recently arisen gene duplicates in the Caenorhabditis elegans genome. , 2003, Genetics.

[9]  Lukasz Huminiecki,et al.  Congruence of tissue expression profiles from Gene Expression Atlas, SAGEmap and TissueInfo databases , 2003, BMC Genomics.

[10]  Z. Gu,et al.  Extent of gene duplication in the genomes of Drosophila, nematode, and yeast. , 2002, Molecular biology and evolution.

[11]  P. Kemmeren,et al.  Protein interaction verification and functional annotation by integrated analysis of genome-scale data. , 2002, Molecular cell.

[12]  D. Nicolae,et al.  Rapid divergence in expression between duplicate genes inferred from microarray data. , 2002, Trends in genetics : TIG.

[13]  Erich Bornberg-Bauer,et al.  Evidence of interaction network evolution by whole-genome duplications: a case study in MADS-box proteins. , 2006, Molecular biology and evolution.

[14]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[15]  W R Pearson,et al.  Flexible sequence similarity searching with the FASTA3 program package. , 2000, Methods in molecular biology.

[16]  Wen-Hsiung Li,et al.  Slow molecular clocks in Old World monkeys, apes, and humans. , 2002, Molecular biology and evolution.

[17]  Eugene V Koonin,et al.  Paralogs and mutational robustness linked through transcriptional reprogramming. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[18]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[19]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[20]  Andreas Wagner,et al.  Energy constraints on the evolution of gene expression. , 2005, Molecular biology and evolution.

[21]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[22]  Z. Gu,et al.  Evolutionary analyses of the human genome , 2001, Nature.

[23]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  A. Wagner,et al.  Decoupled evolution of coding region and mRNA expression patterns after gene duplication: implications for the neutralist-selectionist debate. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[26]  Jianzhi Zhang,et al.  Rapid Subfunctionalization Accompanied by Prolonged and Substantial Neofunctionalization in Duplicate Gene Evolution , 2005, Genetics.

[27]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[28]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[29]  David J. Galas,et al.  A duplication growth model of gene expression networks , 2002, Bioinform..

[30]  Andreas Wagner,et al.  Molecular evolution in the yeast transcriptional regulation network. , 2004, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[31]  E. Koonin,et al.  Conservation and coevolution in the scale-free human gene coexpression network. , 2004, Molecular biology and evolution.

[32]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[33]  M A Nowak,et al.  Evolutionary preservation of redundant duplicated genes. , 1999, Seminars in cell & developmental biology.

[34]  Wen-Hsiung Li,et al.  Divergence in the spatial pattern of gene expression between human duplicate genes. , 2003, Genome research.

[35]  Z. Yang,et al.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. , 2000, Molecular biology and evolution.

[36]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[37]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[38]  X. Gu Evolution of duplicate genes versus genetic robustness against null mutations. , 2003, Trends in genetics : TIG.

[39]  A. Wagner Distributed robustness versus redundancy as causes of mutational robustness. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[40]  J. Lieb,et al.  ChIP-chip: considerations for the design, analysis, and application of genome-wide chromatin immunoprecipitation experiments. , 2004, Genomics.

[41]  A. Wagner The yeast protein interaction network evolves rapidly and contains few redundant duplicate genes. , 2001, Molecular biology and evolution.

[42]  G. Church,et al.  Correlation between transcriptome and interactome mapping data from Saccharomyces cerevisiae , 2001, Nature Genetics.

[43]  E. Koonin,et al.  Selection in the evolution of gene duplications , 2002, Genome Biology.

[44]  Z. Gu,et al.  Different evolutionary patterns between young duplicate genes in the human genome , 2003, Genome Biology.

[45]  R. Solé,et al.  Evolving protein interaction networks through gene duplication. , 2003, Journal of theoretical biology.

[46]  K. H. Wolfe,et al.  Divergence of spatial gene expression profiles following species-specific gene duplications in human and mouse. , 2004, Genome Research.

[47]  A. Wagner,et al.  Asymmetric sequence divergence of duplicate genes. , 2003, Genome research.

[48]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[49]  Albert-László Barabási,et al.  Error and attack tolerance of complex networks , 2000, Nature.

[50]  S. Bergmann,et al.  Similarities and Differences in Genome-Wide Expression Data of Six Organisms , 2003, PLoS biology.

[51]  Jianzhi Zhang Evolution by gene duplication: an update , 2003 .

[52]  D. Figeys Combining different 'omics' technologies to map and validate protein-protein interactions in humans. , 2004, Briefings in functional genomics & proteomics.

[53]  S. Teichmann,et al.  Gene regulatory network growth by duplication , 2004, Nature Genetics.