ASaiM-MT: a validated and optimized ASaiM workflow for metatranscriptomics analysis within Galaxy framework

The Human Microbiome Project (HMP) aided in understanding the role of microbial communities and the influence of collective genetic material (the 'microbiome') in human health and disease. With the evolution of new sequencing technologies, researchers can now investigate the microbiome and map its influence on human health. Advances in bioinformatics methods for next-generation sequencing (NGS) data analysis have helped researchers to gain an in-depth knowledge about the taxonomic and genetic composition of microbial communities. Metagenomic-based methods have been the most commonly used approaches for microbiome analysis; however, it primarily extracts information about taxonomic composition and genetic potential of the microbiome under study, lacking quantification of the gene products (RNA and proteins). Conversely, metatranscriptomics, the study of a microbial community's RNA expression, can reveal the dynamic gene expression of individual microbial populations and the community as a whole, ultimately providing information about the active pathways in the microbiome.  In order to address the analysis of NGS data, the ASaiM analysis framework was previously developed and made available via the Galaxy platform. Although developed for both metagenomics and metatranscriptomics, the original publication demonstrated the use of ASaiM only for metagenomics, while thorough testing for metatranscriptomics data was lacking.  In the current study, we have focused on validating and optimizing the tools within ASaiM for metatranscriptomics data. As a result, we deliver a robust workflow that will enable researchers to understand dynamic functional response of the microbiome in a wide variety of metatranscriptomics studies. This improved and optimized ASaiM-metatranscriptomics (ASaiM-MT) workflow is publicly available via the ASaiM framework, documented and supported with training material so that users can interrogate and characterize metatranscriptomic data, as part of larger meta-omic studies of microbiomes.

[1]  Timothy L. Tickle,et al.  Compact graphical representation of phylogenetic data and metadata with GraPhlAn , 2015, PeerJ.

[2]  Handan Melike Dönertaş,et al.  Characterising Complex Enzyme Reaction Data , 2015, bioRxiv.

[3]  Luke R. Thompson,et al.  Species-level functional profiling of metagenomes and metatranscriptomes , 2018, Nature Methods.

[4]  Fernando Azpiroz,et al.  MetaTrans: an open-source pipeline for metatranscriptomics , 2016, Scientific Reports.

[5]  Po-E Li,et al.  Accurate read-based metagenome characterization using a hierarchical suite of unique signatures , 2015, Nucleic acids research.

[6]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[7]  Anthony Bretaudeau,et al.  Community-driven data analysis training for biology , 2017, bioRxiv.

[8]  Patrick S. G. Chain,et al.  Advances and Challenges in Metatranscriptomic Analysis , 2019, Front. Genet..

[9]  Wen J. Li,et al.  Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation , 2015, Nucleic Acids Res..

[10]  Markus Krummenacker,et al.  The MetaCyc database of metabolic pathways and enzymes , 2017, Nucleic acids research.

[11]  B. Henrissat,et al.  From proteins to polysaccharides: lifestyle and genetic evolution of Coprothermobacter proteolyticus , 2018, The ISME Journal.

[12]  P. Turnbaugh,et al.  An Invitation to the Marriage of Metagenomics and Metabolomics , 2008, Cell.

[13]  Derrick E. Wood,et al.  Kraken: ultrafast metagenomic sequence classification using exact alignments , 2014, Genome Biology.

[14]  R. Knight,et al.  Impact of fire on active layer and permafrost microbial communities and metagenomes in an upland Alaskan boreal forest , 2014, The ISME Journal.

[15]  H. Harmsen,et al.  The role of the microbiome for human health: from basic science to clinical applications , 2018, European Journal of Nutrition.

[16]  Adam M. Phillippy,et al.  Interactive metagenomic visualization in a Web browser , 2011, BMC Bioinformatics.

[17]  James E. Johnson,et al.  metaQuantome: An Integrated, Quantitative Metaproteomics Approach Reveals Connections Between Taxonomy and Protein Function in Complex Microbiomes. , 2019, Molecular & cellular proteomics : MCP.

[18]  Gerhard G. Thallinger,et al.  Wx Scout Fashion Sneaker Splash Navy Women's Keds qAS4tR1wn4 for bawln.com , 2009 .

[19]  Eran Elinav,et al.  Use of Metatranscriptomics in Microbiome Research , 2016, Bioinformatics and biology insights.

[20]  Hideaki Tanaka,et al.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads , 2011, BCB '11.

[21]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[22]  James E. Johnson,et al.  ASaiM-MT: a validated and optimized ASaiM workflow for metatranscriptomics analysis within Galaxy framework , 2021, F1000Research.

[23]  Hélène Touzet,et al.  SortMeRNA: fast and accurate filtering of ribosomal RNAs in metatranscriptomic data , 2012, Bioinform..

[24]  Davide Heller,et al.  eggNOG 5.0: a hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses , 2018, Nucleic Acids Res..

[25]  Chao Xie,et al.  Fast and sensitive protein alignment using DIAMOND , 2014, Nature Methods.

[26]  Bérénice Batut,et al.  ASaiM: a Galaxy-based framework to analyze microbiota data , 2018, GigaScience.

[27]  S. Tringe,et al.  Metagenome, metatranscriptome and single-cell sequencing reveal microbial response to Deepwater Horizon oil spill , 2012, The ISME Journal.

[28]  G. Panagiotou,et al.  COMAN: a web server for comprehensive metatranscriptomics analysis , 2016, BMC Genomics.

[29]  Haixu Tang,et al.  FragGeneScan: predicting genes in short and error-prone reads , 2010, Nucleic acids research.

[30]  Duy Tin Truong,et al.  MetaPhlAn2 for enhanced metagenomic taxonomic profiling , 2015, Nature Methods.

[31]  N. Friedman,et al.  Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data , 2011, Nature Biotechnology.

[32]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[33]  Daniel J. Blankenberg,et al.  Galaxy: a platform for interactive large-scale genome analysis. , 2005, Genome research.

[34]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[35]  Yu Hen Hu,et al.  A decade of seasonal dynamics and co-occurrences within freshwater bacterioplankton communities from eutrophic Lake Mendota, WI, USA , 2012, The ISME Journal.

[36]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[37]  Paul Wilmes,et al.  Metaproteomics: studying functional gene expression in microbial ecosystems. , 2006, Trends in microbiology.

[38]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[39]  Siu-Ming Yiu,et al.  IDBA-MT: De Novo Assembler for Metatranscriptomic Data Generated from Next-Generation Sequencing Technology , 2013, J. Comput. Biol..

[40]  Francesco Asnicar,et al.  Author Correction: Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2 , 2019, Nature Biotechnology.