Will solid-state drives accelerate your bioinformatics? In-depth profiling, performance analysis and beyond
暂无分享,去创建一个
[1] Andreas Holzinger. Biomedical Informatics: Discovering Knowledge in Big Data , 2014 .
[2] María Martín,et al. The Universal Protein Resource (UniProt) in 2010 , 2010 .
[3] J. Rinn,et al. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature biotechnology.
[4] Gonçalo R. Abecasis,et al. The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..
[5] Steven L Salzberg,et al. Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.
[6] Satoru Miyano,et al. Open source clustering software , 2004 .
[7] Inanç Birol,et al. De novo transcriptome assembly with ABySS , 2009, Bioinform..
[8] Helga Thorvaldsdóttir,et al. Integrative Genomics Viewer , 2011, Nature Biotechnology.
[9] Elon Portugaly,et al. Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space , 2008, ISMB.
[10] Ronald C. Taylor. An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics , 2010, BMC Bioinformatics.
[11] Jesse Gillis,et al. Gene function analysis in complex data sets using ErmineJ , 2010, Nature Protocols.
[12] Baris E. Suzek,et al. The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..
[13] William Stafford Noble,et al. Assessing computational tools for the discovery of transcription factor binding sites , 2005, Nature Biotechnology.
[14] Sven Rahmann,et al. Efficient exact motif discovery , 2009, Bioinform..
[15] Sanjay Ghemawat,et al. MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.
[16] Jongmoo Choi,et al. IO Workload Characterization Revisited: A Data-Mining Approach , 2014, IEEE Transactions on Computers.
[17] Hai Jin,et al. Disk System Architectures for High Performance Computing , 2002 .
[18] Cheng Li,et al. Assert(!Defined(Sequential I/O)) , 2014, HotStorage.
[19] J. Rinn,et al. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs , 2010, Nature Biotechnology.
[20] Russell J. Davenport,et al. Removing Noise From Pyrosequenced Amplicons , 2011, BMC Bioinformatics.
[21] Steven Swanson,et al. Near-Data Processing: Insights from a MICRO-46 Workshop , 2014, IEEE Micro.
[22] Graziano Pesole,et al. WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences , 2007, BMC Bioinformatics.
[23] Wilfred W. Li,et al. MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..
[24] E. Myers,et al. Basic local alignment search tool. , 1990, Journal of molecular biology.
[25] Nikolai Joukov,et al. A nine year study of file system and storage benchmarking , 2008, TOS.
[26] Jaehwan Lee,et al. Introducing SSDs to the Hadoop MapReduce Framework , 2014, 2014 IEEE 7th International Conference on Cloud Computing.
[27] Steven J. M. Jones,et al. Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .
[28] Minjae Lee,et al. RNA design rules from a massive open laboratory , 2014, Proceedings of the National Academy of Sciences.
[29] E. Schadt. The changing privacy landscape in the era of big data , 2012, Molecular systems biology.
[30] Srinivas Aluru,et al. Reptile: representative tiling for short read error correction , 2010, Bioinform..
[31] W. J. Kent,et al. BLAT--the BLAST-like alignment tool. , 2002, Genome research.
[32] D. Altshuler,et al. A map of human genome variation from population-scale sequencing , 2010, Nature.
[33] Paul Flicek,et al. Sense from sequence reads: methods for alignment and assembly , 2009, Nature Methods.
[34] Giovanni De Micheli,et al. Clustering protein environments for function prediction: finding PROSITE motifs in 3D , 2007, BMC Bioinformatics.
[35] Richard Durbin,et al. Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .
[36] Jae-Myung Kim,et al. A case for flash memory ssd in enterprise database applications , 2008, SIGMOD Conference.
[37] M. DePristo,et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.
[38] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .
[39]
R. Durbin,et al.
Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P ,
2022
.
[40]
M. Metzker.
Sequencing technologies — the next generation
,
2010,
Nature Reviews Genetics.
[41]
James C. Browne,et al.
Comprehensive job level resource usage measurement and analysis for XSEDE HPC systems
,
2013,
XSEDE.
[42]
B. Williams,et al.
Mapping and quantifying mammalian transcriptomes by RNA-Seq
,
2008,
Nature Methods.
[43]
J. Thompson,et al.
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.
,
1994,
Nucleic acids research.
[44]
M. Schatz,et al.
Algorithms Gage: a Critical Evaluation of Genome Assemblies and Assembly Material Supplemental
,
2008
.
[45]
S Miyano,et al.
Open source clustering software.
,
2004,
Bioinformatics.
[46]
Lior Pachter,et al.
Sequence Analysis
,
2020,
Definitions.
[47]
Andrea K. Bartram,et al.
Generation of Multimillion-Sequence 16S rRNA Gene Libraries from Complex Microbial Communities by Assembling Paired-End Illumina Reads
,
2011,
Applied and Environmental Microbiology.
[48]
Antony I. T. Rowstron,et al.
Scale-up vs scale-out for Hadoop: time to rethink?
,
2013,
SoCC.