ADAM: Genomics Formats and Processing Patterns for Cloud Scale Computing
暂无分享,去创建一个
David A. Patterson | Anthony D. Joseph | Frank Austin Nothaft | Matt Massie | André Schumacher | Christos Kozanitis | Frank A. Nothaft | Christopher Hartl | C. Hartl | D. Patterson | A. Joseph | Matt Massie | C. Kozanitis | André Schumacher
[1] H. Zimmermann,et al. OSI Reference Model - The ISO Model of Architecture for Open Systems Interconnection , 1980, IEEE Transactions on Communications.
[2] Temple F. Smith,et al. Comparison of biosequences , 1981 .
[3] Michael Stonebraker,et al. C-Store: A Column-oriented DBMS , 2005, VLDB.
[4] Daniel J. Abadi,et al. Integrating compression and execution in column-oriented database systems , 2006, SIGMOD Conference.
[5] Gonçalo R. Abecasis,et al. The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..
[6] M. DePristo,et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.
[7] Vladimir Yanovsky. ReCoil - an algorithm for compression of extremely large datasets of dna data , 2010, Algorithms for Molecular Biology.
[8] J. Kitzman,et al. which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Whole exome capture in solution with 3Gbp of data , 2010 .
[9] George Varghese,et al. Compressing Genomic Sequence Fragments Using SlimGene , 2010, RECOMB.
[10] Scott Shenker,et al. Spark: Cluster Computing with Working Sets , 2010, HotCloud.
[11] Daniel Rios,et al. Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .
[12] M. DePristo,et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.
[13] Markus Hsi-Yang Fritz,et al. Efficient storage of high throughput DNA sequencing data using reference-based compression. , 2011, Genome research.
[14] V. Bafna. Abstractions for Genomics : Or , which way to the Genomic Information Age ? , 2011 .
[15] Walter L. Ruzzo,et al. Compression of next-generation sequencing reads aided by highly efficient de novo assembly , 2012, Nucleic acids research.
[16] Idoia Ochoa,et al. Lossy Compression of Quality Values via Rate Distortion Theory , 2012, ArXiv.
[17] Ramakrishna Varadarajan,et al. The Vertica Analytic Database: C-Store 7 Years Later , 2012, Proc. VLDB Endow..
[18] Eija Korpelainen,et al. Hadoop-BAM: directly manipulating next generation sequencing data in the cloud , 2012, Bioinform..
[19] Giovanna Rosone,et al. Large-scale compression of genomic sequence databases with the Burrows-Wheeler transform , 2012, Bioinform..
[20] Kiyoshi Asai,et al. Transformations for the compression of FASTQ quality scores of next-generation sequencing data , 2012, Bioinform..
[21] George Varghese,et al. Abstractions for genomics , 2013, CACM.
[22] N. Popitsch,et al. NGC: lossless and lossy compression of aligned high-throughput sequencing data , 2012, Nucleic acids research.
[23] Scott Shenker,et al. Shark: SQL and rich analytics at scale , 2012, SIGMOD '13.
[24] Giovanna Rosone,et al. Adaptive reference-free compression of sequence quality scores , 2014, Bioinform..
[25] George Varghese,et al. Using Genome Query Language to uncover genetic variation , 2014, Bioinform..