A brief review of protein sequence pattern matching

Of the many current and abandoned methods for the prediction of a protein’s tertiary structure from its amino acid sequence, the diverse yet related set of pattern matching techniques offer, perhaps, the best potential for solving this fundamental problem in molecular biology. The need for reliable structure prediction techniques has never been greater, and with the future prospect of sequencing the entire human genome, the necessity of a workable solution can only become significantly more acute.

[1]  M. Schiffer,et al.  Use of helical wheels to represent the structures of proteins and to identify segments with helical potential. , 1967, Biophysical journal.

[2]  J. M. Thornton,et al.  Prediction of super-secondary structure in proteins , 1983, Nature.

[3]  V. Lim Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure. , 1974, Journal of molecular biology.

[4]  H. M. Martinez,et al.  A multiple sequence alignment program , 1986, Nucleic Acids Res..

[5]  M. Gribskov,et al.  [9] Profile analysis , 1990 .

[6]  S. Wodak,et al.  Relations between protein sequence and structure and their significance. , 1990, Journal of molecular biology.

[7]  D. Lipman,et al.  Rapid similarity searches of nucleic acid and protein data banks. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Shoshana J. Wodak,et al.  Identification of predictive sequence motifs limited by protein structure data base size , 1988, Nature.

[9]  D J Osguthorpe,et al.  Refined models for computer simulation of protein folding. Applications to the study of conserved secondary structure and flexible hinge points during the folding of pancreatic trypsin inhibitor. , 1979, Journal of molecular biology.

[10]  J. Wootton The coenzyme-binding domains of glutamate dehydrogenases , 1974, Nature.

[11]  R. Kretsinger,et al.  Structure and evolution of calcium-modulated proteins. , 1980, CRC critical reviews in biochemistry.

[12]  K. Nagano,et al.  Triplet information in helix prediction applied to the analysis of super-secondary structures. , 1977, Journal of molecular biology.

[13]  A. Lesk,et al.  Determinants of a protein fold. Unique features of the globin amino acid sequences. , 1987, Journal of molecular biology.

[14]  M. Schiffer,et al.  Correlation of amino acid sequence and conformation in tobacco mosaic virus. , 1968, Biophysical journal.

[15]  William R. Taylor,et al.  Analysis and prediction of protein β-sheet structures by a combinatorial approach , 1980, Nature.

[16]  V. Lim Algorithms for prediction of α-helical and β-structural regions in globular proteins , 1974 .

[17]  Temple F. Smith,et al.  Comparison of biosequences , 1981 .

[18]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[19]  W R Taylor,et al.  Hierarchical method to align large numbers of biological sequences. , 1990, Methods in enzymology.

[20]  W. Taylor,et al.  Identification of protein sequence homology by consensus template alignment. , 1986, Journal of molecular biology.

[21]  P. Sellers Pattern recognition in genetic sequences by mismatch density , 1984 .

[22]  R J Fletterick,et al.  Secondary structure assignment for alpha/beta proteins by a combinatorial approach. , 1983, Biochemistry.

[23]  William R. Taylor,et al.  Analysis and prediction of the packing of α-helices against a β-sheet in the tertiary structure of globular proteins , 1982 .

[24]  Douglas L. Brutlag,et al.  Rapid searches for complex patterns in biological molecules , 1984, Nucleic Acids Res..

[25]  Peter H. Sellers,et al.  An Algorithm for the Distance Between Two Finite Sequences , 1974, J. Comb. Theory, Ser. A.

[26]  R. M. Abarbanel,et al.  Turn prediction in proteins using a pattern-matching approach. , 1986, Biochemistry.

[27]  William R. Taylor,et al.  A structural model for the retroviral proteases , 1987, Nature.

[28]  K. Nagano Logical analysis of the mechanism of protein folding: V. Packing game simulation of αβ proteins☆☆☆ , 1980 .

[29]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[30]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[31]  C. Anfinsen,et al.  The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[32]  W R Taylor,et al.  Recognition of super-secondary structure in proteins. , 1984, Journal of molecular biology.

[33]  M J Rooman,et al.  Amino acid sequence templates derived from recurrent turn motifs in proteins: critical evaluation of their predictive power. , 1989, Protein engineering.

[34]  W R Taylor,et al.  Pattern matching methods in protein sequence comparison and structure prediction. , 1988, Protein engineering.

[35]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[36]  B. Seed,et al.  Expression and function of CD8 in a murine T cell hybridoma , 1987, Nature.

[37]  J. Wootton,et al.  Construction of validated, non-redundant composite protein sequence databases. , 1990, Protein engineering.

[38]  M S Waterman,et al.  Multiple sequence alignment by consensus. , 1986, Nucleic acids research.

[39]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[40]  J. Palau,et al.  The structural code for proteins: zonal distribution of amino acid residues and stabilization of helices by hydrophobic triplets. , 1974, Journal of molecular biology.

[41]  J. Walker,et al.  Distantly related sequences in the alpha‐ and beta‐subunits of ATP synthase, myosin, kinases and other ATP‐requiring enzymes and a common nucleotide binding fold. , 1982, The EMBO journal.

[42]  Wim G. J. Hol,et al.  Predicted nucleotide-binding properties of p21 protein and its cancer-associated variant , 1983, Nature.

[43]  Frederic M. Richards,et al.  Packing of α-helices: Geometrical constraints and contact areas☆ , 1978 .

[44]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[45]  D. Lipman,et al.  Rapid and sensitive protein similarity searches. , 1985, Science.

[46]  R. F. Smith,et al.  Automatic generation of primary sequence patterns from sets of related protein sequences. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[47]  R F Doolittle,et al.  Relationships of human protein sequences to those of other organisms. , 1986, Cold Spring Harbor symposia on quantitative biology.