Adaptive reference-free compression of sequence quality scores
暂无分享,去创建一个
[1] Jens Stoye,et al. metaBEETL: high-throughput analysis of heterogeneous microbial populations from shotgun DNA sequences , 2013, BMC Bioinformatics.
[2] Giovanna Rosone,et al. Lightweight LCP Construction for Next-Generation Sequencing Datasets , 2013, WABI.
[3] Gabor T. Marth,et al. A general approach to single-nucleotide polymorphism discovery , 1999, Nature Genetics.
[4] Giovanna Rosone,et al. Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform , 2012, Bioinform..
[5] R. Durbin,et al. Efficient de novo assembly of large genomes using compressed data structures. , 2012, Genome research.
[6] Markus Hsi-Yang Fritz,et al. Efficient storage of high throughput DNA sequencing data using reference-based compression. , 2011, Genome research.
[7] Richard Durbin,et al. Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .
[8] Heng Li,et al. Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly , 2012, Bioinform..
[9] Lucian Ilie,et al. HiTEC: accurate error correction in high-throughput sequencing data , 2011, Bioinform..
[10] Shoshana Neuburger,et al. The Burrows-Wheeler transform: data compression, suffix arrays, and pattern matching by Donald Adjeroh, Timothy Bell and Amar Mukherjee Springer, 2008 , 2010 .
[11] Giovanna Rosone,et al. Comparing DNA Sequence Collections by Direct Comparison of Compressed Text Indexes , 2012, WABI.
[12] Walter L. Ruzzo,et al. Compression of next-generation sequencing reads aided by highly efficient de novo assembly , 2012, Nucleic acids research.
[13] Faraz Hach,et al. SCALCE: boosting sequence compression algorithms using locally consistent encoding , 2012, Bioinform..
[14] Antonio Restivo,et al. Balancing and clustering of words in the Burrows-Wheeler transform , 2011, Theor. Comput. Sci..
[15] Giovanna Rosone,et al. Lightweight algorithms for constructing and inverting the BWT of string collections , 2013, Theor. Comput. Sci..
[16] Peter M. Rice,et al. The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants , 2009, Nucleic acids research.
[17] Amar Mukherjee,et al. The Burrows-Wheeler Transform:: Data Compression, Suffix Arrays, and Pattern Matching , 2008 .
[18] James K. Bonfield,et al. Compression of FASTQ and SAM Format Sequencing Data , 2013, PloS one.
[19] Giovanni Manzini,et al. An analysis of the Burrows-Wheeler transform , 2001, SODA '99.
[20] P Green,et al. Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.
[21] C. E. SHANNON,et al. A mathematical theory of communication , 1948, MOCO.
[22] Chiara Epifanio,et al. Novel Combinatorial and Information‐Theoretic Alignment‐Free Distances for Biological Data Mining , 2010 .
[23] Alexander J. Hartemink,et al. A generalized model for multi-marker analysis of cell cycle progression in synchrony experiments , 2011, Bioinform..
[24] George Varghese,et al. Compressing Genomic Sequence Fragments Using SlimGene , 2010, RECOMB.
[25] Kiyoshi Asai,et al. Transformations for the compression of FASTQ quality scores of next-generation sequencing data , 2012, Bioinform..
[26] Srinivas Aluru,et al. A survey of error-correction methods for next-generation sequencing , 2013, Briefings Bioinform..
[27] Giovanna Rosone,et al. Lightweight BWT Construction for Very Large String Collections , 2011, CPM.
[28] P. Green,et al. Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.
[29] Michael Q. Zhang,et al. Using quality scores and longer reads improves accuracy of Solexa read mapping , 2008, BMC Bioinformatics.
[30] M. DePristo,et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.
[31]
R. Durbin,et al.
Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P ,
2022
.
[32]
R Staden,et al.
The application of numerical estimates of base calling accuracy to DNA sequencing projects.
,
1995,
Nucleic acids research.