Impacts of gene essentiality, expression pattern, and gene compactness on the evolutionary rate of mammalian proteins.

Understanding the determinants of the rate of protein sequence evolution is of fundamental importance in evolutionary biology. Many recent studies have focused on the yeast because of the availability of many genome-wide expressional and functional data. Yeast studies revealed a predominant role of gene expression level and a minor role of gene essentiality in determining the rate of protein sequence evolution. Whether these rules apply to complex organisms such as mammals is unclear. Here we assemble a list of 1,138 essential and 2,341 nonessential mouse genes based on targeted gene deletion experiments and report a significant impact of gene essentiality on the rate of mammalian protein evolution. Gene expression level has virtually no effect, although tissue specificity in expression pattern has a strong influence. Unexpectedly, gene compactness, measured by average intron size and untranslated region length, has the greatest influence. Hence, the relative importance of the various factors in determining the rate of mammalian protein evolution is gene compactness > gene essentiality approximately tissue specificity > expression level. Our results suggest a considerable variation in rate determinants between unicellular organisms such as the yeast and multicellular organisms such as mammals.

[1]  T. Ohta,et al.  On some principles governing molecular evolution. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Jian Lu,et al.  Weak selection revealed by the whole-genome comparison of the X chromosome and autosomes of human and chimpanzee. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  C. Wilke,et al.  A single determinant dominates the rate of yeast protein evolution. , 2006, Molecular biology and evolution.

[4]  Michal Galdzicki,et al.  Mammalian overlapping genes: the comparative perspective. , 2004, Genome research.

[5]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[6]  M. Kimura The Neutral Theory of Molecular Evolution: Introduction , 1983 .

[7]  Jianzhi Zhang,et al.  Significant impact of protein dispensability on the instantaneous rate of protein evolution. , 2005, Molecular biology and evolution.

[8]  A. Vinogradov Compactness of human housekeeping genes: selection for economy or genomic design? , 2004, Trends in genetics : TIG.

[9]  A. Minton Implications of macromolecular crowding for protein assembly. , 2000, Current opinion in structural biology.

[10]  Bernardo Lemos,et al.  Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions. , 2005, Molecular biology and evolution.

[11]  Doron Lancet,et al.  Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification , 2005, Bioinform..

[12]  Eduardo P C Rocha,et al.  An analysis of determinants of amino acids substitution rates in bacterial proteins. , 2004, Molecular biology and evolution.

[13]  Takashi Makino,et al.  The evolutionary rate of a protein is influenced by features of the interacting partners. , 2006, Molecular biology and evolution.

[14]  Jianzhi Zhang,et al.  Low rates of expression profile divergence in highly expressed genes and tissue-specific genes during mammalian evolution. , 2006, Molecular biology and evolution.

[15]  A. E. Hirsh,et al.  Functional genomic analysis of the rates of protein evolution. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[16]  E. Koonin,et al.  Essential genes are more evolutionarily conserved than are nonessential genes in bacteria. , 2002, Genome research.

[17]  Wen-Hsiung Li,et al.  Rate of protein evolution versus fitness effect of gene deletion. , 2003, Molecular biology and evolution.

[18]  E. Birney,et al.  EnsMart: a generic system for fast and flexible access to biological data. , 2003, Genome research.

[19]  A. E. Hirsh,et al.  Evolutionary Rate in the Protein Interaction Network , 2002, Science.

[20]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[21]  M. Kreitman,et al.  Population, evolutionary and genomic consequences of interference selection. , 2002, Genetics.

[22]  R. Ellis,et al.  Medicine: Danger — misfolding proteins , 2002, Nature.

[23]  Jianzhi Zhang,et al.  Evolutionary conservation of expression profiles between human and mouse orthologous genes. , 2006, Molecular biology and evolution.

[24]  L. Orgel,et al.  Biochemical Evolution , 1971, Nature.

[25]  Cristian I. Castillo-Davis,et al.  Selection for short introns in highly expressed genes , 2002, Nature Genetics.

[26]  T. Ohta THE NEARLY NEUTRAL THEORY OF MOLECULAR EVOLUTION , 1992 .

[27]  C. Ponting,et al.  Elevated rates of protein secretion, evolution, and disease among tissue-specific genes. , 2003, Genome research.

[28]  C. Pál,et al.  Does the recombination rate affect the efficiency of purifying selection? The yeast genome provides a partial answer. , 2001, Molecular biology and evolution.

[29]  Matthew W. Hahn,et al.  Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. , 2005, Molecular biology and evolution.

[30]  A. Eyre-Walker,et al.  Human disease genes: patterns and predictions. , 2003, Gene.

[31]  T. Jukes,et al.  The neutral theory of molecular evolution. , 2000, Genetics.

[32]  Aleksey Y Ogurtsov,et al.  Bioinformatical assay of human gene morbidity. , 2004, Nucleic acids research.

[33]  C. Wilke,et al.  Why highly expressed proteins evolve slowly. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[34]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[35]  J. Kelso,et al.  Impact of the presence of paralogs on sequence divergence in a set of mouse-human orthologs. , 2002, Genome research.

[36]  Fang Yang,et al.  An abundance of X-linked genes expressed in spermatogonia , 2001, Nature Genetics.

[37]  C. Pál,et al.  Genomic function: Rate of evolution and gene dispensability. , 2003, Nature.

[38]  Cristian I. Castillo-Davis,et al.  Conservation, relocation and duplication in genome evolution. , 2003, Trends in genetics : TIG.

[39]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[40]  A. E. Hirsh,et al.  Protein dispensability and rate of evolution , 2001, Nature.

[41]  Sudhir Kumar,et al.  Gene Expression Intensity Shapes Evolutionary Rates of the Proteins Encoded by the Vertebrate Genome , 2004, Genetics.

[42]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Wen-Hsiung Li,et al.  Mammalian housekeeping genes evolve more slowly than tissue-specific genes. , 2004, Molecular biology and evolution.

[44]  Laurence D. Hurst,et al.  Genomic function (communication arising): Rate of evolution and gene dispensability , 2003, Nature.

[45]  Laurence D. Hurst,et al.  Do essential genes evolve slowly? , 1999, Current Biology.

[46]  Hunter B. Fraser,et al.  Modularity and evolutionary constraint on proteins , 2005, Nature Genetics.

[47]  J. Zhang,et al.  Protein-length distributions for the three domains of life. , 2000, Trends in genetics : TIG.

[48]  M. Nei,et al.  Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection , 1988, Nature.

[49]  M. Goodman,et al.  Phylogenetic footprinting reveals unexpected complexity in trans factor binding upstream from the epsilon-globin gene. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[50]  Wei-Min Liu,et al.  Robust estimators for expression analysis , 2002, Bioinform..

[51]  R. DeSalle,et al.  Adaptive Evolution of Genes and Genomes , 2000, Heredity.

[52]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[53]  K. Hastings Strong evolutionary conservation of broadly expressed protein isoforms in the troponin I gene family and other vertebrate gene families , 1996, Journal of Molecular Evolution.

[54]  Hiroshi Akashi,et al.  Translational selection and yeast proteome evolution. , 2003, Genetics.

[55]  F. Sherman Getting started with yeast. , 1991, Methods in enzymology.

[56]  Bruce T Lahn,et al.  Genic mutation rates in mammals: local similarity, chromosomal heterogeneity, and X-versus-autosome disparity. , 2003, Molecular biology and evolution.

[57]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[58]  Jianzhi Zhang,et al.  Toward a Molecular Understanding of Pleiotropy , 2006, Genetics.

[59]  J. Brotherton The counting and sizing of spermatozoa from ten animal species using a coulter counter , 2009, Andrologia.