Distance based methods of DNA sequence analysis in phylogenetics

Alignment based methods are considered under NP hard problems they does not perform well when database becomes largeand complex. This paper discusses alignment free methods based on k-tuple, frequency and composition of bases and k-mer. All the methods are discussed taking pair of DNA sequence of same length and different length giving distance meaure between them. In the results we have shown all the discussed alignment free methods are almost same in calculating the distance measure of same length, but for different length sequences k-tuple based method and frequency and compositional based method's result are more close as compared to k-mer based method. Role of entropy in sequence analysis and various distance methods like Euclidean, Manhattan UPGMA and neighbor joining are also discussed.

[1]  Shi Feng,et al.  A new distance metric and its application in phylogenetic tree construction , 2004, 2004 Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[2]  M. Ragan,et al.  Is Multiple-Sequence Alignment Required for Accurate Inference of Phylogeny? , 2007, Systematic biology.

[3]  Cheng-Yan Kao,et al.  The Impact of Normalization and Phylogenetic Information on Estimating the Distance for Metagenomes , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[4]  Khalid Sayood,et al.  A new sequence distance measure for phylogenetic tree construction , 2003, Bioinform..

[5]  Wei You,et al.  Similarity Analysis of DNA Sequences Using “Molecular Connectivity Indices” Method , 2009, 2009 Sixth International Conference on Fuzzy Systems and Knowledge Discovery.

[6]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[7]  B. Kozarzewski A Method for Nucleotide Sequence Analysis , 2012 .

[8]  Qingshan Jiang,et al.  A DNA sequence distance measure approach for phylogenetic tree construction , 2010, 2010 IEEE Fifth International Conference on Bio-Inspired Computing: Theories and Applications (BIC-TA).

[9]  Chun Li,et al.  Relative entropy of DNA and its application , 2005 .

[10]  Daryl Essam,et al.  Iterative progressive alignment method (IPAM) for multiple sequence alignment , 2009, 2009 International Conference on Computers & Industrial Engineering.