Iterated profile searches with PSI-BLAST--a tool for discovery in protein databases.

We thank the developers of PSI-BLAST, who include D. J. Lipman, T. L. Madden, W. Miller, A. A. Schaffer, J. Zhang and Z. Zhang. We also thank L. Aravind for his collaboration on the application of PSI-BLAST to the detection of subtle relationships among proteins.

[1]  S. Shuman,et al.  RNA capping enzyme and DNA ligase: a superfamily of covalent nucleotidyl transferases , 1995, Molecular microbiology.

[2]  Durbin,et al.  Biological Sequence Analysis , 1998 .

[3]  Kevin Karplus,et al.  A Flexible Motif Search Technique Based on Generalized Profiles , 1996, Comput. Chem..

[4]  W. Pearson Empirical statistical estimates for sequence similarity searches. , 1998, Journal of molecular biology.

[5]  J. Wootton,et al.  Analysis of compositionally biased regions in sequence databases. , 1996, Methods in enzymology.

[6]  M. Uhlén,et al.  The sequence of a 30 kb fragment on the left arm of chromosome XV from Saccharomyces cerevisiae reveals 15 open reading frames, five of which correspond to previously identified genes , 1996, Yeast.

[7]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[8]  S F Altschul,et al.  Local alignment statistics. , 1996, Methods in enzymology.

[9]  C Sander,et al.  New structure--novel fold? , 1997, Structure.

[10]  E S Lander,et al.  Recognition of related proteins by iterative template refinement (ITR) , 1994, Protein science : a publication of the Protein Society.

[11]  W. Pearson Comparison of methods for searching protein sequence databases , 1995, Protein science : a publication of the Protein Society.

[12]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[13]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[14]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[15]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[16]  A. Lupas Prediction and analysis of coiled-coil structures. , 1996, Methods in enzymology.

[17]  M Gribskov,et al.  Translational initiation factors IF-1 and eIF-2 alpha share an RNA-binding motif with prokaryotic ribosomal protein S1 and polynucleotide phosphorylase. , 1992, Gene.

[18]  Martin Vingron,et al.  Sequence Comparison Significance and Poisson Approximation , 1994 .

[19]  P Bork,et al.  Homology-based fold predictions for Mycoplasma genitalium proteins. , 1998, Journal of molecular biology.

[20]  Thomas L. Madden,et al.  Protein sequence similarity searches using patterns as seeds. , 1998, Nucleic acids research.

[21]  A. Dembo,et al.  Limit Distribution of Maximal Non-Aligned Two-Sequence Segmental Score , 1994 .