Comparison of nucleotide DNA alignment search programmes

This paper evaluates the performance of search alignment tools using Blast and MegaBlast obtained from the National Center for Biotechnology Information (NCBI) website. The main objective is to articulate clearly the dependencies of the different parameters on the performance of the search alignment tool. The parameters that are of interest will be the seed size used as well as the size of the nucleotide input sequence. Four experiments were conducted using programmes available on NCBI website. The first two experiments preselect two input nucleotide query sequences of various lengths belonging to the rat's organism and searched the source databases for matches. The source databases selected were the RAT EST and the MOUSE EST. The third experiment used an input query sequence string of a large nucleotide belonging to the rat's organism and searched against the Mouse genome. The fourth experiment used an input query sequence string belonging to another Hessian fly species and it was searched against the MOUSE EST. Experimental results using Gapped Blast version 2.2.6 and MegaBlast showed that the performance timing of both search engines was comparable. Also, the number of sequences found for a single seed size was much higher in MegaBlast compared to Blast.

[1]  Jian Ye,et al.  BLAST: improvements for better sequence analysis , 2006, Nucleic Acids Res..

[2]  Daniel G. Brown,et al.  Vector seeds: An extension to spaced seeds , 2005, J. Comput. Syst. Sci..

[3]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[4]  Yu Chen,et al.  An iterative refinement algorithm for consistency based multiple structural alignment methods , 2006, Bioinform..

[5]  M. Cameron,et al.  Improved gapped alignment in BLAST , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  Tao Jiang,et al.  DNA sequencing and string learning , 2005, Mathematical systems theory.

[7]  Bonnie Berger,et al.  A Parameterized Algorithm for Protein Structure Alignment , 2007, J. Comput. Biol..

[8]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[9]  Dennis Shasha,et al.  New techniques for extracting features from protein sequences , 2001, IBM Syst. J..