A fast and sensitive multiple sequence alignment algorithm

A two-step multiple alignment strategy is presented that allows rapid alignment of a set of homologous sequences and comparison of pre-aligned groups of sequences. Examples are given demonstrating the improvement in the quality of alignments when comparing entire groups instead of single sequences. The modular design of computer programs based on this algorithm allows for storage of aligned sequences and successive alignment of any number of sequences.

[1]  W R Taylor,et al.  Pattern matching methods in protein sequence comparison and structure prediction. , 1988, Protein engineering.

[2]  E. Adman,et al.  Structure and Function of Small Blue Copper Proteins , 1985 .

[3]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[4]  M. Sternberg,et al.  A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons. , 1987, Journal of molecular biology.

[5]  R. Doolittle,et al.  Progressive sequence alignment as a prerequisitetto correct phylogenetic trees , 2007, Journal of Molecular Evolution.

[6]  William R. Taylor,et al.  Multiple sequence alignment by a pairwise algorithm , 1987, Comput. Appl. Biosci..

[7]  M. O. Dayhoff,et al.  Establishing homologies in protein sequences. , 1983, Methods in enzymology.

[8]  G J Barton,et al.  Evaluation and improvements in the automatic alignment of protein sequences. , 1987, Protein engineering.

[9]  Michael S. Waterman,et al.  General methods of sequence comparison , 1984 .

[10]  M G Rossmann,et al.  Comparison of protein structures. , 1985, Methods in enzymology.

[11]  J. Richardson,et al.  Simultaneous comparison of three protein sequences. , 1985, Proceedings of the National Academy of Sciences of the United States of America.

[12]  S Karlin,et al.  Efficient algorithms for molecular sequence analysis. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Kathryn E. Sidman,et al.  The protein identification resource (PIR). , 1986, Nucleic acids research.

[14]  S. B. Needleman,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 1989 .

[15]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[16]  H. M. Martinez,et al.  A multiple sequence alignment program , 1986, Nucleic Acids Res..

[17]  W. Rutter,et al.  Splice junctions: association with variation in protein structure. , 1983, Science.

[18]  H. M. Martinez A flexible multiple sequence alignment program. , 1988, Nucleic acids research.

[19]  M Levitt,et al.  Alignment of the amino acid sequences of distantly related proteins using variable gap penalties. , 1986, Protein engineering.

[20]  M S Waterman,et al.  Multiple sequence alignment by consensus. , 1986, Nucleic acids research.