Connectivity of Neutral Networks, Overdispersion, and Structural Conservation in Protein Evolution

Protein structures are much more conserved than sequences during evolution. Based on this observation, we investigate the consequences of structural conservation on protein evolution. We study seven of the most studied protein folds, determining that an extended neutral network in sequence space is associated with each of them. Within our model, neutral evolution leads to a non-Poissonian substitution process, due to the broad distribution of connectivities in neutral networks. The observation that the substitution process has non-Poissonian statistics has been used to argue against the original Kimura neutral theory, while our model shows that this is a generic property of neutral evolution with structural conservation. Our model also predicts that the substitution rate can strongly fluctuate from one branch to another of the evolutionary tree. The average sequence similarity within a neutral network is close to the threshold of randomness, as observed for families of sequences sharing the same fold. Nevertheless, some positions are more difficult to mutate than others. We compare such structurally conserved positions to positions conserved in protein evolution, suggesting that our model can be a valuable tool to distinguish structural from functional conservation in databases of protein families. These results indicate that a synergy between database analysis and structurally based computational studies can increase our understanding of protein evolution.

[1]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[2]  E. Shakhnovich,et al.  Engineering of stable and fast-folding sequences of model proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[3]  E. Koonin,et al.  The Impact of Comparative Genomics on Our Understanding of Evolution , 2000, Cell.

[4]  O. Ptitsyn,et al.  Non-functional conserved residues in globins and their possible role as a folding nucleus. , 1999, Journal of molecular biology.

[5]  J M Blackburn,et al.  Directed evolution of new catalytic activity using the alpha/beta-barrel scaffold. , 2000, Nature.

[6]  D. Thirumalai,et al.  Thermodynamic stability of folded proteins against mutations , 1997, cond-mat/9709078.

[7]  E. Shakhnovich,et al.  Folding and misfolding of designed proteinlike chains with mutations , 1998 .

[8]  Frances H. Arnold,et al.  Computational method to reduce the search space for directed protein evolution , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Michele Vendruscolo,et al.  Neutral evolution of model proteins: diffusion in sequence space and overdispersion. , 1998, Journal of theoretical biology.

[10]  N. Kannan,et al.  Aromatic clusters: a determinant of thermal stability of thermophilic proteins. , 2000, Protein engineering.

[11]  P. Schuster,et al.  From sequences to shapes and back: a case study in RNA secondary structures , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[12]  T. Ohta Role of very slightly deleterious mutations in molecular evolution and polymorphism. , 1976, Theoretical population biology.

[13]  M. Huynen,et al.  Smoothness within ruggedness: the role of neutrality in adaptation. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[14]  N. Takahata,et al.  On the overdispersed molecular clock. , 1987, Genetics.

[15]  M. Kimura Preponderance of synonymous changes as evidence for the neutral theory of molecular evolution , 1977, Nature.

[16]  Ugo Bastolla,et al.  Un modèle statistique d'évolution avec sélection stabilisante , 1991 .

[17]  E. Shakhnovich,et al.  Understanding hierarchical protein evolution from first principles. , 2001, Journal of molecular biology.

[18]  S. Vishveshwara,et al.  Identification of side-chain clusters in protein structures by a graph spectral method. , 1999, Journal of molecular biology.

[19]  E. Bornberg-Bauer,et al.  Modeling evolutionary landscapes: mutational stability, topology, and superfunnels in sequence space. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Frances M. G. Pearl,et al.  Protein folds, functions and evolution. , 1999, Journal of molecular biology.

[21]  E. Bornberg-Bauer,et al.  How are model protein structures distributed in sequence space? , 1997, Biophysical journal.

[22]  L. Hood,et al.  Gene families: the taxonomy of protein paralogs and chimeras. , 1997, Science.

[23]  U. Bastolla,et al.  Structurally constrained protein evolution: results from a lattice simulation , 1999, cond-mat/9912331.

[24]  R. Britten,et al.  Rates of DNA sequence evolution differ between taxonomic groups. , 1986, Science.

[25]  F. Ayala,et al.  Vagaries of the molecular clock. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Jon E. Ness,et al.  Predicting the emergence of antibiotic resistance by directed evolution and structural analysis , 2001, Nature Structural Biology.

[27]  Linda M. Wills,et al.  Reverse Engineering , 1996, Springer US.

[28]  U Bastolla,et al.  A statistical mechanical method to optimize energy functions for protein folding. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[29]  P. Schuster,et al.  IR-98-039 / April Continuity in Evolution : On the Nature of Transitions , 1998 .

[30]  A Maritan,et al.  Molecular dynamics studies on HIV‐1 protease: Drug resistance and folding pathways , 2001, Proteins.

[31]  Charles H. Langley,et al.  An examination of the constancy of the rate of molecular evolution , 2005, Journal of Molecular Evolution.

[32]  R A Goldstein,et al.  The foldability landscape of model proteins , 1997, Biopolymers.

[33]  J. L. King,et al.  Non-Darwinian evolution. , 1969, Science.

[34]  T. Ohta,et al.  On the constancy of the evolutionary rate of cistrons , 2005, Journal of Molecular Evolution.

[35]  R A Goldstein,et al.  On the thermodynamic hypothesis of protein folding. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[36]  O. Ptitsyn,et al.  Protein folding and protein evolution: common folding nucleus in different subfamilies of c-type cytochromes? , 1998, Journal of molecular biology.

[37]  L. Mirny,et al.  Universally conserved positions in protein folds: reading evolutionary signals about stability, folding kinetics and function. , 1999, Journal of molecular biology.

[38]  Anthony D. Keefe,et al.  Functional proteins from a random-sequence library , 2001, Nature.

[39]  L. Mirny,et al.  Evolutionary conservation of the folding nucleus. , 2000, Journal of molecular biology.

[40]  J. Gillespie The causes of molecular evolution , 1991 .

[41]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[42]  E. Shakhnovich,et al.  Conserved residues and the mechanism of protein folding , 1996, Nature.

[43]  J H Gillespie,et al.  Lineage effects and the index of dispersion of molecular evolution. , 1989, Molecular biology and evolution.

[44]  P. Harbury,et al.  Reverse engineering the (β/α)8 barrel fold , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Manfred J. Sippl,et al.  Neutral Networks in Protein Space , 1996 .

[46]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[47]  U Bastolla,et al.  How to guarantee optimal stability for most representative structures in the protein data bank , 2001, Proteins.

[48]  B. Rost,et al.  Protein structures sustain evolutionary drift. , 1997, Folding & design.

[49]  M. Gerstein Patterns of protein‐fold usage in eight microbial genomes: A comprehensive structural census , 1998, Proteins.

[50]  P. Wolynes,et al.  Optimal protein-folding codes from spin-glass theory. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[51]  L A Mirny,et al.  How to derive a protein folding potential? A new approach to an old problem. , 1996, Journal of molecular biology.

[52]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[53]  E. Shakhnovich,et al.  Proteins with selected sequences fold into unique native conformation. , 1994, Physical review letters.

[54]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[55]  Alan R. Fersht,et al.  Directed evolution of new catalytic activity using the α/β-barrel scaffold , 2000, Nature.