Aligning Molecular Sequences by Wavelet Transform using Cross Correlation Similarity Metric

The first fact of sequence analysis is sequence alignment for the study of structural and functional analysis of the molecular sequence. Owing to the increase in biological data, there is a trade-off between accuracy and the computation of sequence alignment process. Sequences can be aligned both in locally and globally to gives vital information for biologists. Focusing these issues, in this work the local and global alignment are focused on aligning multiple molecular sequences by applying a wavelet transform. Here, the sequence is converted into numerical values using the electron-ion interaction potential model. This is decomposed using a type of wavelet transform and the similarity between the sequences is found using the crosscorrelation measure. The significance of the similarity is evaluated using two scoring function namely Position Specific Matrix and a new function called Count score. The work is compared with Fast Fourier Transform based approach and the result shows that the proposed method improves the alignment.

[1]  Tripti Negi,et al.  Time Series : Similarity Search and its Applications , 2004 .

[2]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[3]  Amit Konar,et al.  Swarm Intelligence Algorithms in Bioinformatics , 2008, Computational Intelligence in Bioinformatics.

[4]  Yi Yang,et al.  Analyzing functional similarity of protein sequences with discrete wavelet transform , 2005, Comput. Biol. Chem..

[5]  Qiang Fang,et al.  Protein sequence comparison based on the wavelet transform approach. , 2002, Protein engineering.

[6]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[7]  Alan L Rockwood,et al.  Sequence alignment by cross-correlation. , 2005, Journal of biomolecular techniques : JBT.

[8]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[9]  M. Khalil A New Heuristic Approach for DNA Sequences Alignment , 2015 .

[10]  Reda Alhajj,et al.  Multiple sequence alignment with affine gap by using multi-objective genetic algorithm , 2014, Comput. Methods Programs Biomed..

[11]  Thomas Kiel Rasmussen,et al.  Improved Hidden Markov Model training for multiple sequence alignment by a particle swarm optimization-evolutionary algorithm hybrid. , 2003, Bio Systems.

[12]  K. Katoh,et al.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.

[13]  J. Jayapriya,et al.  A Novel Distance Metric for Aligning Multiple Sequences Using DNA Hybridization Process , 2016 .

[14]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[15]  I. Cosic Macromolecular bioactivity: is it resonant interaction between macromolecules?-theory and applications , 1994, IEEE Transactions on Biomedical Engineering.

[16]  Ruhul A. Sarker,et al.  Progressive Alignment Method Using Genetic Algorithm for Multiple Sequence Alignment , 2012, IEEE Transactions on Evolutionary Computation.

[17]  Arabi Keshk,et al.  Enhanced Dynamic Algorithm of Genome Sequence Alignments , 2014 .

[18]  Zne-Jung Lee,et al.  Genetic algorithm with ant colony optimization (GA-ACO) for multiple sequence alignment , 2008, Appl. Soft Comput..

[19]  Fernando Guirado,et al.  Improving multiple sequence alignment biological accuracy through genetic algorithms , 2012, The Journal of Supercomputing.