论文信息 - Reconstruction of Genuine Pair-Wise Sequence Alignment

Reconstruction of Genuine Pair-Wise Sequence Alignment

In many applications, the algorithmically obtained alignment ideally should restore the "golden standard" (GS) alignment, which superimposes positions originating from the same position of the common ancestor of the compared sequences. The average similarity between the algorithmically obtained and GS alignments ("the quality") is an important characteristic of an alignment algorithm. We proposed to determine the quality of an algorithm, using sequences that were artificially generated in accordance with an appropriate evolution model; the approach was applied to the global version of the Smith-Waterman algorithm (SWA). The quality of SWA is between 97% (for a PAM distance of 60) and 70% (for a PAM distance of 300). The percentage of identical aligned residues is the same for algorithmic and GS alignments. The total length of indels in algorithmic alignments is less than in the GS-mainly due to a substantial decrease in the number of indels in algorithmic alignments.

Valery Polyanovsky | Mikhail A. Roytberg | Vladimir G. Tumanyan

[1] M. Vingron,et al. Quantifying the local reliability of a sequence alignment. , 1996, Protein engineering.

[2] P. Argos,et al. An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited. , 1995, Journal of molecular biology.

[3] A. Finkelstein,et al. From analysis of protein structural alignments toward a novel approach to align protein sequences , 2004, Proteins.

[4] M J Sippl,et al. Structure-based evaluation of sequence comparison and fold recognition alignment accuracy. , 2000, Journal of molecular biology.

[5] Folker Meyer,et al. Rose: generating sequence families , 1998, Bioinform..

[6] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[7] D. Lipman,et al. Rapid and sensitive protein similarity searches. , 1985, Science.

[8] N. Saitou,et al. The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[9] William R. Pearson,et al. Empirical determination of effective gap penalties for sequence comparison , 2002, Bioinform..

[10] I. I. Litvinov,et al. Information on the secondary structure improves the quality of protein sequence alignment , 2006, Molecular Biology.

[11] G. Gonnet,et al. Empirical and structural models for insertions and deletions in the divergent evolution of proteins. , 1993, Journal of molecular biology.