HAlign-II: efficient ultra-large multiple sequence alignment and phylogenetic tree reconstruction with distributed and parallel computing
暂无分享,去创建一个
[1] Xiangxiang Zeng,et al. HPTree: Reconstructing phylogenetic trees for ultra-large unaligned DNA sequences via NJ model and Hadoop , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).
[2] Kai Wang,et al. BioPig: a Hadoop-based analytic toolkit for large-scale sequence data , 2013, Bioinform..
[3] Olivier Poch,et al. BAliBASE 3.0: Latest developments of the multiple sequence alignment benchmark , 2005, Proteins.
[4] Carlo Curino,et al. Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.
[5] Alexandros Stamatakis,et al. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..
[6] Yufeng Wu,et al. COALESCENT‐BASED SPECIES TREE INFERENCE FROM GENE TREE TOPOLOGIES UNDER INCOMPLETE LINEAGE SORTING BY MAXIMUM LIKELIHOOD , 2012, Evolution; international journal of organic evolution.
[7] K. Katoh,et al. MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. , 2002, Nucleic acids research.
[8] Michael I. Jordan,et al. SparkNet: Training Deep Networks in Spark , 2015, ICLR.
[9] Chih-Wei Huang,et al. CloudDOE: A User-Friendly Tool for Deploying Hadoop Clouds and Analyzing High-Throughput Sequencing Data with MapReduce , 2014, PloS one.
[10] Kevin Karplus,et al. Evaluation of protein multiple alignments by SAM-T99 using the BAliBASE multiple alignment test set , 2001, Bioinform..
[11] Erik L. L. Sonnhammer,et al. Kalign – an accurate and fast multiple sequence alignment algorithm , 2005, BMC Bioinformatics.
[12] Nikil D. Dutt,et al. SPARK: a high-level synthesis framework for applying parallelizing compiler transformations , 2003, 16th International Conference on VLSI Design, 2003. Proceedings..
[13] James G. Shanahan,et al. Large Scale Distributed Data Science using Apache Spark , 2015, KDD.
[14] Sudhir Kumar,et al. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. , 2016, Molecular biology and evolution.
[15] Maurits J. J. Dijkstra,et al. Multiple Sequence Alignment. , 2017, Methods in molecular biology.
[16] Cheng Ling,et al. SparkSW: Scalable Distributed Computing System for Large-Scale Biological Sequence Alignment , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.
[17] Eija Korpelainen,et al. Hadoop-BAM: directly manipulating next generation sequencing data in the cloud , 2012, Bioinform..
[18] Klaus Peter Schliep,et al. phangorn: phylogenetic analysis in R , 2010, Bioinform..
[19] C. Huttenhower,et al. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes , 2013, Nature Communications.
[20] Xiaohui Xie,et al. Interactive and fuzzy search: a dynamic way to explore MEDLINE , 2010, Bioinform..
[21] A. von Haeseler,et al. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies , 2014, Molecular biology and evolution.
[22] TölkeJonas. Implementation of a Lattice Boltzmann kernel using the Compute Unified Device Architecture developed by nVIDIA , 2010 .
[23] Hidetoshi Shimodaira,et al. Mitochondrial genome variation in eastern Asia and the peopling of Japan. , 2004, Genome research.
[24] P. J. Narayanan,et al. Accelerating Large Graph Algorithms on the GPU Using CUDA , 2007, HiPC.
[25] Qinghua Hu,et al. HAlign: Fast multiple similar DNA/RNA sequence alignment based on the centre star strategy , 2015, Bioinform..
[26] Amitava Datta,et al. Multiple sequence alignment in parallel on a workstation cluster , 2004, Bioinform..
[27] Nagesh V. Honnalli,et al. Hobbes: optimized gram-based methods for efficient read alignment , 2011, Nucleic acids research.
[28] N. Saitou,et al. The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.
[29] D. Higgins,et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.
[30] Eija Korpelainen,et al. SeqPig: simple and scalable scripting for large sequencing data sets in Hadoop , 2013, Bioinform..
[31] Anthony Skjellum,et al. A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..
[32] Marek S. Wiewiórka,et al. SparkSeq: fast, scalable and cloud-ready tool for the interactive genomic data analysis with nucleotide precision , 2014, Bioinform..
[33] M S Waterman,et al. Identification of common molecular subsequences. , 1981, Journal of molecular biology.
[34] Xi Chen,et al. CMSA: a heterogeneous CPU/GPU computing system for multiple similar RNA/DNA sequence alignment , 2017, BMC Bioinformatics.
[35] Robert C. Edgar,et al. MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.
[36] Silvio C. E. Tosatto,et al. The PASTA server for protein aggregation prediction. , 2007, Protein engineering, design & selection : PEDS.
[37] W. J. Kent,et al. BLAT--the BLAST-like alignment tool. , 2002, Genome research.
[38] Michael J. Franklin,et al. Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing , 2012, NSDI.
[39] Ronald C. Taylor. An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics , 2010, BMC Bioinformatics.
[40] Jonas Tölke,et al. Implementation of a Lattice Boltzmann kernel using the Compute Unified Device Architecture developed by nVIDIA , 2009, Comput. Vis. Sci..
[41] Kuo-Bin Li,et al. ClustalW-MPI: ClustalW analysis using distributed and parallel computing , 2003, Bioinform..