Systematic elicitation of sequence patterns associated with non-proline cis peptide bonds

Non-proline cis peptide bonds have been quite underrated for many years, due to the limited amount of structural information available. There is now significant evidence that non-proline cis peptide bonds occur more frequently than previously thought, and that they are often located at or near important sites of the protein molecule. In this work, we employ a combinatorial pattern discovery algorithm in order to identify simple and specific amino acid patterns, associated with the occurrence of non-proline cis peptide bonds in proteins. The derived patterns after careful validation help in gaining insight into the factors that influence the formation of non-proline cis peptide bonds.

[1]  Robert Preissner,et al.  Prediction of prolyl residues in cis‐conformation in protein structures on the basis of the amino acid sequence , 1990, FEBS letters.

[2]  Aris Floratos,et al.  Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm [published erratum appears in Bioinformatics 1998;14(2): 229] , 1998, Bioinform..

[3]  G Fischer,et al.  Regulation of peptide bond cis/trans isomerization by enzyme catalysis and its implication in physiological processes. , 2003, Reviews of physiology, biochemistry and pharmacology.

[4]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[5]  David S. Wishart,et al.  VADAR: a web server for quantitative evaluation of protein structure quality , 2003, Nucleic Acids Res..

[6]  A. Jabs,et al.  Peptide bonds revisited , 1998, Nature Structural &Molecular Biology.

[7]  S. Wodak,et al.  Relations between protein sequence and structure and their significance. , 1990, Journal of molecular biology.

[8]  K. Wüthrich,et al.  Nmr studies of the rates of proline cis–trans isomerization in oligopeptides , 1981 .

[9]  M J Rooman,et al.  Weak correlation between predictive power of individual sequence patterns and overall prediction accuracy in proteins , 1991, Proteins.

[10]  Amalio Telenti,et al.  Crystal structure of GyrA intein from Mycobacterium xenopi reveals structural basis of protein splicing , 1998, Nature Structural Biology.

[11]  A. Jabs,et al.  Non-proline cis peptide bonds in proteins. , 1999, Journal of molecular biology.

[12]  D. T. Jones,et al.  Sequence patterns associated with disordered regions in proteins , 2004, Proteins.

[13]  Dimitrios I. Fotiadis,et al.  Prediction of cis/trans isomerization using feature selection and support vector machines , 2009, J. Biomed. Informatics.

[14]  J E Wampler,et al.  Occurrence and role of cis peptide bonds in protein structures. , 1990, Journal of molecular biology.

[15]  D. Pal,et al.  Cis peptide bonds in proteins: residues involved, their conformations, interactions and locations. , 1999, Journal of molecular biology.

[16]  Dirk Labudde,et al.  COPS - Cis/trans peptide bond conformation prediction of amino acids on the basis of secondary structure information , 2005, Bioinform..

[17]  Jiangning Song,et al.  Prediction of cis/trans isomerization in proteins using PSI-BLAST profiles and secondary structure information , 2006, BMC Bioinformatics.

[18]  Wolfgang Stroebe Breaking up is hard to do. , 1983 .

[19]  C Ouzounis,et al.  Dictionary building via unsupervised hierarchical motif discovery in the sequence space of natural proteins , 1999, Proteins.

[20]  M-L Wang,et al.  Support vector machines for prediction of peptidyl prolyl cis/trans isomerization. , 2008, The journal of peptide research : official journal of the American Peptide Society.