Solving Sequence Alignment Problem using Pipeline Approach

This paper presents two models based on pipeline approach for determining pair-wise sequence alignment of two molecular sequences. One of the models considers a variation of Needleman-Wunsch method as a basic algorithm and other is based on the use of scoring matrix for alignment. The basic purpose of using the pipelines is to reduce the time-complexity of alignment significantly. Paper also discusses the design & implementation of the basic linear version of the algorithms in our software tool by the name "Sequence Comparison and Analysis Tool (SCAT)". Our tool also provides the option of sequence alignment on the basis of common grouping like chemical, functional & structural etc. The software tool is implemented using Visual Basic-6 package with user-friendly windows environment.

[1]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[2]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[3]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[4]  Ronald L. Rivest,et al.  Introduction to Algorithms, Second Edition , 2001 .

[5]  Daniel S. Hirschberg,et al.  A linear space algorithm for computing maximal common subsequences , 1975, Commun. ACM.

[6]  Mike Paterson,et al.  A Faster Algorithm Computing String Edit Distances , 1980, J. Comput. Syst. Sci..

[7]  Eugene W. Myers,et al.  AnO(ND) difference algorithm and its variations , 1986, Algorithmica.

[8]  S. Salzberg,et al.  Fast algorithms for large-scale genome alignment and comparison. , 2002, Nucleic acids research.

[9]  Raffaele Giancarlo,et al.  Seqeunce Alignment in Molecular Biology , 1998, J. Comput. Biol..

[10]  Pankaj Agarwal,et al.  A New Bucket-Based A New Bucket-Based Algorithm for Finding LCS from two given Molecular Sequences , 2006, Third International Conference on Information Technology: New Generations (ITNG'06).

[11]  Thierry Lecroq,et al.  Handbook of Exact String Matching Algorithms , 2004 .

[12]  Joel H. Saltz,et al.  Parallel processing of biological sequence comparison algorithms , 1988, International Journal of Parallel Programming.

[13]  João Meidanis,et al.  Introduction to computational molecular biology , 1997 .

[14]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[15]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[16]  Xiaoqiu Huang,et al.  A space-efficient parallel sequence comparison algorithm for a message-passing multiprocessor , 1990, International Journal of Parallel Programming.

[17]  N. Balakrishnan,et al.  A FAST Pattern Matching Algorithm , 2004, J. Chem. Inf. Model..

[18]  Srinivas Aluru,et al.  Parallel biological sequence comparison using prefix computations , 2003, J. Parallel Distributed Comput..

[19]  A. Lesk COMPUTATIONAL MOLECULAR BIOLOGY , 1988, Proceeding of Data For Discovery.