Compositional analysis: a valid approach to analyze microbiome high-throughput sequencing data.

A workshop held at the 2015 annual meeting of the Canadian Society of Microbiologists highlighted compositional data analysis methods and the importance of exploratory data analysis for the analysis of microbiome data sets generated by high-throughput DNA sequencing. A summary of the content of that workshop, a review of new methods of analysis, and information on the importance of careful analyses are presented herein. The workshop focussed on explaining the rationale behind the use of compositional data analysis, and a demonstration of these methods for the examination of 2 microbiome data sets. A clear understanding of bioinformatics methodologies and the type of data being analyzed is essential, given the growing number of studies uncovering the critical role of the microbiome in health and disease and the need to understand alterations to its composition and function following intervention with fecal transplant, probiotics, diet, and pharmaceutical agents.

[1]  Mihai Pop,et al.  Statistical Methods for Detecting Differentially Abundant Features in Clinical Metagenomic Samples , 2009, PLoS Comput. Biol..

[2]  S. Reardon Bacterium can reverse autism-like behaviour in mice , 2013, Nature.

[3]  V. Pawlowsky-Glahn,et al.  Modeling and Analysis of Compositional Data , 2015 .

[4]  Lawrence A. David,et al.  Diet rapidly and reproducibly alters the human gut microbiome , 2013, Nature.

[5]  R. Knight,et al.  UniFrac: a New Phylogenetic Method for Comparing Microbial Communities , 2005, Applied and Environmental Microbiology.

[6]  G. Gloor,et al.  High throughput sequencing methods and analysis for microbiome research. , 2013, Journal of microbiological methods.

[7]  Jonathan Friedman,et al.  Inferring Correlation Networks from Genomic Survey Data , 2012, PLoS Comput. Biol..

[8]  C. Huttenhower,et al.  Metagenomic biomarker discovery and explanation , 2011, Genome Biology.

[9]  J. Aitchison,et al.  Biplots of Compositional Data , 2002 .

[10]  K. Gerald van den Boogaart,et al.  Analyzing Compositional Data with R , 2013 .

[11]  Jürg Bähler,et al.  Proportionality: A Valid Alternative to Correlation for Relative Data , 2014, bioRxiv.

[12]  E. Louis,et al.  Alterations in the Intestinal Microbiome (Dysbiosis) as a Predictor of Relapse After Infliximab Withdrawal in Crohn's Disease , 2014, Inflammatory bowel diseases.

[13]  Daniel M. Saman,et al.  Recovery of the Gut Microbiome following Fecal Microbiota Transplantation , 2014, mBio.

[14]  T. Ball,et al.  Pyrosequencing of the Chaperonin-60 Universal Target as a Tool for Determining Microbial Community Composition , 2009, Applied and Environmental Microbiology.

[15]  Javier Palarea-Albaladejo,et al.  zCompositions — R package for multivariate imputation of left-censored data under a compositional approach , 2015 .

[16]  Karen P. Scott,et al.  16S rRNA gene-based profiling of the human infant gut microbiota is strongly influenced by sample processing and PCR primer choice , 2015, Microbiome.

[17]  William A. Walters,et al.  Using QIIME to Analyze 16S rRNA Gene Sequences from Microbial Communities , 2012, Current protocols in microbiology.

[18]  V. Pawlowsky-Glahn,et al.  Modelling and Analysis of Compositional Data: Pawlowsky-Glahn/Modelling and Analysis of Compositional Data , 2015 .

[19]  H. Wainer,et al.  A Statistical Guide for the Ethically Perplexed , 2012 .

[20]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[21]  K. Pearson Mathematical contributions to the theory of evolution.—On the law of reversion , 1900, Proceedings of the Royal Society of London.

[22]  Gregory B. Gloor,et al.  Displaying Variation in Large Datasets: Plotting a Visual Summary of Effect Sizes , 2016 .

[23]  Jean M. Macklaim,et al.  Changes in vaginal microbiota following antimicrobial and probiotic therapy , 2015, Microbial ecology in health and disease.

[24]  Rob Knight,et al.  Analysis of composition of microbiomes: a novel method for studying microbial composition , 2015, Microbial ecology in health and disease.

[25]  Jean M. Macklaim,et al.  Microbiota of Human Breast Tissue , 2014, Applied and Environmental Microbiology.

[26]  Jean M. Macklaim,et al.  Unifying the analysis of high-throughput sequencing datasets: characterizing RNA-seq, 16S rRNA gene sequencing and selective growth experiments by compositional data analysis , 2014, Microbiome.

[27]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[28]  D. Warton,et al.  Distance‐based multivariate analyses confound location and dispersion effects , 2012 .

[29]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[30]  P. Filzmoser,et al.  Univariate Statistical Analysis of Environmental (compositional) Data: Problems and Possibilities , 2009 .

[31]  Jean M. Macklaim,et al.  ANOVA-Like Differential Expression (ALDEx) Analysis for Mixed Population RNA-Seq , 2013, PloS one.

[32]  Jean YH Yang,et al.  Bioconductor: open software development for computational biology and bioinformatics , 2004, Genome Biology.

[33]  Christian L. Müller,et al.  Sparse and Compositionally Robust Inference of Microbial Ecological Networks , 2014, PLoS Comput. Biol..

[34]  W. Chung,et al.  Short-term probiotic therapy alleviates small intestinal bacterial overgrowth, but does not improve intestinal permeability in chronic liver disease , 2014, European journal of gastroenterology & hepatology.

[35]  D. Coomans,et al.  High-throughput 16S rRNA gene sequencing reveals alterations of intestinal microbiota in myalgic encephalomyelitis/chronic fatigue syndrome patients. , 2013, Anaerobe.

[36]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[37]  K. Pearson Mathematical contributions to the theory of evolution.—On a form of spurious correlation which may arise when indices are used in the measurement of organs , 1897, Proceedings of the Royal Society of London.

[38]  J. Petrosino,et al.  Microbiota Modulate Behavioral and Physiological Abnormalities Associated with Neurodevelopmental Disorders , 2013, Cell.

[39]  B. Paster,et al.  Microbial signature profiles of periodontally healthy and diseased patients. , 2014, Journal of clinical periodontology.