aTRAM 2.0: An Improved, Flexible Locus Assembler for NGS Data

Massive strides have been made in technologies for collecting genome-scale data. However, tools for efficiently and flexibly assembling raw outputs into downstream analytical workflows are still nascent. aTRAM 1.0 was designed to assemble any locus from genome sequencing data but was neither optimized for efficiency nor able to serve as a single toolkit for all assembly needs. We have completely re-implemented aTRAM and redesigned its structure for faster read retrieval while adding a number of key features to improve flexibility and functionality. The software can now (1) assemble single- or paired-end data, (2) utilize both read directions in the database, (3) use an additional de novo assembly module, and (4) leverage new built-in pipelines to automate common workflows in phylogenomics. Owing to reimplementation of databasing strategies, we demonstrate that aTRAM 2.0 is much faster across all applications compared to the previous version.

[1]  Travis C Glenn,et al.  Ultraconserved elements anchor thousands of genetic markers spanning multiple evolutionary timescales. , 2012, Systematic biology.

[2]  Steven Salzberg,et al.  Beware of mis-assembled genomes , 2005, Bioinform..

[3]  R. T. Brumfield,et al.  Applications of next-generation sequencing to phylogeography and phylogenetics. , 2013, Molecular phylogenetics and evolution.

[4]  Kevin P. Johnson,et al.  aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data , 2015, BMC Bioinformatics.

[5]  Mihai Pop,et al.  Genome assembly reborn: recent computational challenges , 2009, Briefings Bioinform..

[6]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[7]  H. Robertson,et al.  Next-generation phylogenomics using a Target Restricted Assembly Method. , 2013, Molecular phylogenetics and evolution.

[8]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[9]  Steven J. M. Jones,et al.  Abyss: a Parallel Assembler for Short Read Sequence Data Material Supplemental Open Access , 2022 .

[10]  E. Eichler,et al.  Limitations of next-generation genome sequence assembly , 2011, Nature Methods.

[11]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[12]  Kevin P. Johnson,et al.  Phylogenomics from Whole Genome Sequences Using aTRAM , 2017, Systematic biology.