Mapping short reads to a genome without using hash look-up table algorithm and Burrows Wheeler Transformation

Many new sequencing techniques have been proposed in the last few years. A number of tools have been developed for mapping short reads to the genome. However, these tools used the hash look-up table algorithm for reads or for the genome. With more and more reads sequenced and the length of reads increases; these tools will require huge memory usage and may be not practical for biologists and applications. Recently, new tools were designed to overcome this problem by using Burrows Wheeler Transformation. In this paper, we propose a program, FRESCO, without using hash look-up table algorithm and Burrows Wheeler Transformation. By well-designed techniques in FRESCO, the memory usage can be controlled by users. A case study of micro-RNAs (miRNAs) was applied into FRESCO and the results showed that FRESCO can only used 268MB memory and mapped reads to all of known miRNAs and 98.6% novel miRNAs candidates.

[1]  Richard D. Hayes,et al.  Draft Genome Sequence of the Sexually Transmitted Pathogen Trichomonas vaginalis , 2007, Science.

[2]  Ambuj K. Singh,et al.  Speeding up whole-genome alignment by indexing frequency vectors , 2004, Bioinform..

[3]  Colin N. Dewey,et al.  Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution , 2004, Nature.

[4]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[5]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[6]  Ruiqiang Li,et al.  SOAP: short oligonucleotide alignment program , 2008, Bioinform..

[7]  Michael Q. Zhang,et al.  Using quality scores and longer reads improves accuracy of Solexa read mapping , 2008, BMC Bioinformatics.

[8]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[9]  Wing Hung Wong,et al.  SeqMap: mapping massive amount of oligonucleotides to the genome , 2008, Bioinform..

[10]  Siu-Ming Yiu,et al.  SOAP2: an improved ultrafast tool for short read alignment , 2009, Bioinform..

[11]  Bin Ma,et al.  ZOOM! Zillions of oligos mapped , 2008, Bioinform..

[12]  Robert J. Moore,et al.  A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach. , 2008, Genome research.

[13]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.