Wrap-and-Pack: A New Paradigm for Beta Structural Motif Recognition with Application to Recognizing Beta Trefoils

A method is presented that uses beta-strand interactions at both the sequence and the atomic level, to predict beta-structural motifs of protein sequences. A program called Wrap-and- Pack implements this method and is shown to recognize beta-trefoils, an important class of globular beta-structures, in the Protein Data Bank with 92% specificity and 92.3% sensitivity in cross-validation. It is demonstrated that Wrap-and-Pack learns each of the ten known SCOP beta-trefoil families, when trained primarily on beta-structures that are not beta-trefoils, together with three-dimensional structures of known beta-trefoils from outside the family. Wrap-and-Pack also predicts many proteins of unknown structure to be beta-trefoils. The computational method used here may generalize to other beta-structures for which strand topology and profiles of residue accessibility are well conserved.

[1]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[2]  D. Cerretti,et al.  Cloning, sequence and expression of bovine interleukin 1α and interleukin 1β complementary DNAs , 1988 .

[3]  Bonnie Berger,et al.  trilogy: Discovery of sequence–structure patterns across diverse proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[5]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[6]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[7]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[8]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[9]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[10]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[11]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[12]  U Derewenda,et al.  Crystal structure, at 2.6-A resolution, of the Streptomyces lividans xylanase A, a member of the F family of beta-1,4-D-glycanases. , 1995, The Journal of biological chemistry.

[13]  Lenore Cowen,et al.  Predicting the Beta-Helix Fold from Protein Sequence Data , 2002, J. Comput. Biol..

[14]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.

[15]  B. Berger,et al.  betawrap: Successful prediction of parallel β-helices from primary sequence reveals an association with many microbial pathogens , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[16]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[17]  Bonnie Berger,et al.  Algorithms for Protein Structural Motif Recognition , 1995, J. Comput. Biol..

[18]  Mona Singh,et al.  An Iterative Method for Improved Protein Structural Motif Recognition , 1997, J. Comput. Biol..

[19]  P. Gray,et al.  The nucleotide sequence for the cDNA of bovine interleukin-1 beta. , 1988, Nucleic acids research.

[20]  S. Bryant,et al.  An empirical energy function for threading protein sequence through the folding motif , 1993, Proteins.

[21]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[22]  M J Sternberg,et al.  Progress in protein structure prediction: assessment of CASP3. , 1999, Current opinion in structural biology.