Pairwise local alignment using wavelet transform

The first fact of sequence analysis is sequence alignment, which is in turn pave the way for structural and functional analysis of the molecular sequence. Owing to the increase in biological data, the alignment approaches take more time for computation. Focusing this issue, in this work the local alignment is made for pairwise molecular sequences by applying a wavelet transform based approach (WMSA) to solve the alignment problem. Like global alignment, local alignment also gives more peculiar information of molecular sequences. Here, the sequence is converted into numerical values using the electron-ion interaction potential model. This in turn decomposed using one of the wavelet transform types and the similarity between the sequences is found using the cross-correlation measure. The significance of the similarity is evaluated using two scoring function namely Position Weight Matrix (PSM) and a new function called Count Score. The work is compared with the standard Smith-Waterman algorithm. The result shows that the proposed approach improves the speed without sacrificing the accuracy.

[1]  I. Cosic Macromolecular bioactivity: is it resonant interaction between macromolecules?-theory and applications , 1994, IEEE Transactions on Biomedical Engineering.

[2]  Hans A. Kestler,et al.  Extended pairwise local alignment of wild card DNA/RNA sequences using dynamic programming , 2015 .

[3]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[4]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[5]  Biswanath Chowdhury,et al.  A cascaded pairwise biomolecular sequence alignment technique using evolutionary algorithm , 2015, Inf. Sci..

[6]  Hong Qu,et al.  One-dimensional pairwise CNN for the global alignment of two DNA sequences , 2015, Neurocomputing.

[7]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[8]  Yi Yang,et al.  Analyzing functional similarity of protein sequences with discrete wavelet transform , 2005, Comput. Biol. Chem..

[9]  Thomas Kiel Rasmussen,et al.  Improved Hidden Markov Model training for multiple sequence alignment by a particle swarm optimization-evolutionary algorithm hybrid. , 2003, Bio Systems.

[10]  Nirupam Chakraborti,et al.  A novel method for alignment of two nucleic acid sequences using ant colony optimization and genetic algorithms , 2007, Appl. Soft Comput..

[11]  Qiang Fang,et al.  Protein sequence comparison based on the wavelet transform approach. , 2002, Protein engineering.

[12]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[13]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[14]  Reda Alhajj,et al.  Multiple sequence alignment with affine gap by using multi-objective genetic algorithm , 2014, Comput. Methods Programs Biomed..