Biological roles of specific peptides in enzymes

It has recently been shown (Kunik et al., PLOS Comput Biol 2007;3(8):e167) that the occurrence of specific peptides (SPs) on sequences of enzymes allows for accurate EC classification of enzymes. We inquire whether these SPs play important roles in bringing about the enzymatic function. This is assessed by cross‐checking the occurrence of SPs on enzymes with Swiss‐Prot annotations and PDB spatial structures of enzymes. Analyzing the coverage of functional annotations of enzymes, we demonstrate that SPs contain major fractions of all annotated features. This result is statistically highly significant and associates over 10% of all SPs with important biological markers. Concentrating on DNA binding regions, relevant to LexA repressor enzymes, we find interesting coverage patterns. Moreover, for the same data, we demonstrate that SPs allow for subclassification of the relevant bacteria into phylogenetic classes. An analysis of mutagen annotations on SPs appearing on all enzymes leads to the conclusion that mutations on SPs tend to damage the enzymatic function much more than expected from a background model, hence SPs are of high importance to enzymatic functions. SPs that lie in 3D pockets that are shared by active and binding sites, are shown to be significantly enriched by glycine, leading to the hypothesis that they are responsible for conformational plasticity. Finally we show that SPs can partially resolve outstanding difficult problems of convergent evolution by representing correctly enzyme functions in spite of remote homologies in sequence and in structure. Proteins 2008. © 2008 Wiley‐Liss, Inc.

[1]  Vladimir Poroikov,et al.  Prediction of protein functional specificity without an alignment. , 2006, Omics : a journal of integrative biology.

[2]  R. Kaptein,et al.  Solution structure of the LexA repressor DNA binding domain determined by 1H NMR spectroscopy. , 1994, The EMBO journal.

[3]  M. Gerstein,et al.  The relationship between protein structure and function: a comprehensive survey with application to the yeast genome. , 1999, Journal of molecular biology.

[4]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[5]  Eytan Ruppin,et al.  Unsupervised learning of natural languages , 2006 .

[6]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[7]  N. Xuong,et al.  A binary complex of the catalytic subunit of cAMP-dependent protein kinase and adenosine further defines conformational flexibility. , 1997, Structure.

[8]  B X Yan,et al.  Glycine Residues Provide Flexibility for Enzyme Active Sites* , 1997, The Journal of Biological Chemistry.

[9]  R. Abagyan,et al.  Large‐scale prediction of protein geometry and stability changes for arbitrary single point mutations , 2004, Proteins.

[10]  M Ptashne,et al.  Mechanism of action of the lexA gene product. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[11]  F A Quiocho,et al.  A Pro to Gly mutation in the hinge of the arabinose-binding protein enhances binding and alters specificity. Sugar-binding and crystallographic studies. , 1992, The Journal of biological chemistry.

[12]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[13]  R. Bywater,et al.  Computational analysis of chain flexibility and fluctuations in Rhizomucor miehei lipase. , 1999, Protein engineering.

[14]  David A. Agard,et al.  Structural plasticity broadens the specificity of an engineered protease , 1989, Nature.

[15]  C. Sander,et al.  The amino-acid mutational spectrum of human genetic disease , 2003, Genome Biology.

[16]  Eytan Ruppin,et al.  Functional Representation of Enzymes by Specific Peptides , 2007, PLoS Comput. Biol..

[17]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[18]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[19]  Jie Liang,et al.  CASTp: Computed Atlas of Surface Topography of proteins , 2003, Nucleic Acids Res..

[20]  M. Page How do enzymes work? , 1985, Nature.

[21]  T. Horii,et al.  Nucleotide sequence of the lexA gene of E. coli , 1981, Cell.

[22]  D. Koshland Application of a Theory of Enzyme Specificity to Protein Synthesis. , 1958, Proceedings of the National Academy of Sciences of the United States of America.

[23]  L Serrano,et al.  Alpha-helix stability in proteins. I. Empirical correlations concerning substitution of side-chains at the N and C-caps and the replacement of alanine by glycine or serine at solvent-exposed surfaces. , 1992, Journal of molecular biology.

[24]  A Teplyakov,et al.  Crystal structure of bacteriophage T4 deoxynucleotide kinase with its substrates dGMP and ATP. , 1996, The EMBO journal.