VOLARE: visual analysis of disease-associated microbiome-immune system interplay

BackgroundRelationships between specific microbes and proper immune system development, composition, and function have been reported in a number of studies. However, researchers have discovered only a fraction of the likely relationships. “Omic” methodologies such as 16S ribosomal RNA (rRNA) sequencing and time-of-flight mass cytometry (CyTOF) immunophenotyping generate data that support generation of hypotheses, with the potential to identify additional relationships at a level of granularity ripe for further experimentation. Pairwise linear regressions between microbial and host immune features provide one approach for quantifying relationships between “omes”, and the differences in these relationships across study cohorts or arms. This approach yields a top table of candidate results. However, the top table alone lacks the detail that domain experts such as microbiologists and immunologists need to vet candidate results for follow-up experiments.ResultsTo support this vetting, we developed VOLARE (Visualization Of LineAr Regression Elements), a web application that integrates a searchable top table, small in-line graphs illustrating the fitted models, a network summarizing the top table, and on-demand detailed regression plots showing full sample-level detail. We applied VOLARE to three case studies—microbiome:cytokine data from fecal samples in human immunodeficiency virus (HIV), microbiome:cytokine data in inflammatory bowel disease and spondyloarthritis, and microbiome:immune cell data from gut biopsies in HIV. We present both patient-specific phenomena and relationships that differ by disease state. We also analyzed interaction data from system logs to characterize usage scenarios. This log analysis revealed that users frequently generated detailed regression plots, suggesting that this detail aids the vetting of results.ConclusionsSystematically integrating microbe:immune cell readouts through pairwise linear regressions and presenting the top table in an interactive environment supports the vetting of results for scientific relevance. VOLARE allows domain experts to control the analysis of their results, screening dozens of candidate relationships with ease. This interactive environment transcends the limitations of a static top table.

[1]  E. Matteson,et al.  An expansion of rare lineage intestinal microbes characterizes rheumatoid arthritis , 2016, Genome Medicine.

[2]  Ash A. Alizadeh,et al.  Large-Scale and Comprehensive Immune Profiling and Functional Analysis of Normal Human Aging , 2015, PloS one.

[3]  R. Ley,et al.  Innate immunity and intestinal microbiota in the development of Type 1 diabetes , 2008, Nature.

[4]  Matthew J. Gebert,et al.  Alterations in the gut microbiota associated with HIV-1 infection. , 2013, Cell host & microbe.

[5]  M. Noval Rivas,et al.  The microbiome in asthma , 2016, Current opinion in pediatrics.

[6]  R Balfour Sartor,et al.  Microbial influences in inflammatory bowel diseases. , 2008, Gastroenterology.

[7]  André Karch,et al.  Colonic Butyrate-Producing Communities in Humans: an Overview Using Omics Data , 2017, mSystems.

[8]  Jeroen Ooms,et al.  The jsonlite Package: A Practical and Consistent Mapping Between JSON Data and R Objects , 2014, ArXiv.

[9]  Tamara Munzner,et al.  Visualization Analysis and Design , 2014, A.K. Peters visualization series.

[10]  M. McCarter,et al.  Fecal Microbiota Composition Drives Immune Activation in HIV-infected Individuals , 2018, EBioMedicine.

[11]  Riyue Bao,et al.  The commensal microbiome is associated with anti–PD-1 efficacy in metastatic melanoma patients , 2018, Science.

[12]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[13]  E. Le Chatelier,et al.  Gut microbiome modulates response to anti–PD-1 immunotherapy in melanoma patients , 2018, Science.

[14]  P. Fayers,et al.  The Visual Display of Quantitative Information , 1990 .

[15]  Martin Hartmann,et al.  Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities , 2009, Applied and Environmental Microbiology.

[16]  F. Bushman,et al.  QIIME allows integration and analysis of high-throughput community sequencing data. Nat. Meth. , 2010 .

[17]  Daniel N. Frank,et al.  Functional intraepithelial lymphocyte changes in inflammatory bowel disease and spondyloarthritis have disease specific correlations with intestinal microbiota , 2018, Arthritis Research & Therapy.

[18]  Edward Rolf Tufte,et al.  The visual display of quantitative information , 1985 .

[19]  Monther Alhamdoosh,et al.  RNA-seq analysis is easy as 1-2-3 with limma, Glimma and edgeR , 2016, F1000Research.

[20]  Susan Holmes,et al.  phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data , 2013, PloS one.

[21]  Stephen L. Hauser,et al.  Gut bacteria from multiple sclerosis patients modulate human T cells and exacerbate symptoms in mouse models , 2017, Proceedings of the National Academy of Sciences.

[22]  Anushya Muruganujan,et al.  PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements , 2016, Nucleic Acids Res..

[23]  B. Palmer,et al.  An exploration of Prevotella-rich microbiomes in HIV and men who have sex with men , 2018, Microbiome.

[24]  Holden T Maecker,et al.  Algorithmic Tools for Mining High-Dimensional Cytometry Data , 2015, The Journal of Immunology.

[25]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[26]  Jeffrey Heer,et al.  SpanningAspectRatioBank Easing FunctionS ArrayIn ColorIn Date Interpolator MatrixInterpola NumObjecPointI Rectang ISchedu Parallel Pause Scheduler Sequen Transition Transitioner Transiti Tween Co DelimGraphMLCon IData JSONCon DataField DataSc Dat DataSource Data DataUtil DirtySprite LineS RectSprite , 2011 .

[27]  Akifumi Yamashita,et al.  Conventional culture methods with commercially available media unveil the presence of novel culturable bacteria , 2018, Gut microbes.

[28]  E. Kuh,et al.  Linear Regression Diagnostics , 1977 .

[29]  Laurence Zitvogel,et al.  Gut microbiome influences efficacy of PD-1–based immunotherapy against epithelial tumors , 2018, Science.

[30]  Rob Knight,et al.  EMPeror: a tool for visualizing high-throughput microbial community data , 2013, GigaScience.

[31]  Michael Shaffer,et al.  Diverse Intestinal Bacteria Contain Putative Zwitterionic Capsular Polysaccharides with Anti-inflammatory Properties. , 2016, Cell host & microbe.

[32]  Jeffrey S. Morris,et al.  iBAG: integrative Bayesian analysis of high-dimensional multiplatform genomics data , 2012, Bioinform..

[33]  Natalia Shulzhenko,et al.  Multi-omics Comparative Analysis Reveals Multiple Layers of Host Signaling Pathway Regulation by the Gut Microbiota , 2017, mSystems.

[34]  C. Lozupone,et al.  Complexities of Gut Microbiome Dysbiosis in the Context of HIV Infection and Antiretroviral Therapy , 2016, Clinical pharmacology and therapeutics.

[35]  J. Faith,et al.  Extensive personal human gut microbiota culture collections characterized and manipulated in gnotobiotic mice , 2011, Proceedings of the National Academy of Sciences.

[36]  Lawrence Hunter,et al.  Visual analysis of biological data-knowledge networks , 2015, BMC Bioinformatics.

[37]  Chen Peng,et al.  Improve Glioblastoma Multiforme Prognosis Prediction by Using Feature Selection and Multiple Kernel Learning , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[38]  Matthew A. Hibbs,et al.  Visualization of omics data for systems biology , 2010, Nature Methods.

[39]  Ni Zhang,et al.  Programmed cell death-1/programmed cell death ligand-1 checkpoint inhibitors: differences in mechanism of action. , 2019, Immunotherapy.

[40]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[41]  R. Tibshirani,et al.  Automated identification of stratifying signatures in cellular subpopulations , 2014, Proceedings of the National Academy of Sciences.

[42]  Kristel Steen,et al.  Integration of gene expression and methylation to unravel biological networks in glioblastoma patients , 2017, Genetic epidemiology.

[43]  Jeffrey Heer,et al.  D³ Data-Driven Documents , 2011, IEEE Transactions on Visualization and Computer Graphics.

[44]  C. Montoya,et al.  Particular activation phenotype of T cells expressing HLA-DR but not CD38 in GALT from HIV-controllers is associated with immune regulation and delayed progression to AIDS , 2016, Immunologic research.

[45]  John Hardy,et al.  Genome, transcriptome and proteome: the rise of omics data and their integration in biomedical sciences , 2016, Briefings Bioinform..

[46]  Patrick Breheny,et al.  Visualization of Regression Models Using visreg , 2017, R J..

[47]  Alioune Ngom,et al.  A review on machine learning principles for multi-view biological data integration , 2016, Briefings Bioinform..

[48]  Susan P. Holmes,et al.  Waste Not , Want Not : Why Rarefying Microbiome Data is Inadmissible . October 1 , 2013 , 2013 .

[49]  Jeffrey S. Morris,et al.  Integrative Bayesian Analysis of High-Dimensional Multi-platform Genomics Data , 2012 .

[50]  Daniel N. Frank,et al.  Enhancement of HIV-1 infection and intestinal CD4+ T cell depletion ex vivo by gut microbes altered during chronic HIV-1 infection , 2016, Retrovirology.

[51]  Mingrui Liu,et al.  IntLIM: integration using linear models of metabolomics and gene expression data , 2018, BMC Bioinformatics.

[52]  Ben Shneiderman,et al.  The eyes have it: a task by data type taxonomy for information visualizations , 1996, Proceedings 1996 IEEE Symposium on Visual Languages.

[53]  Sean C. Bendall,et al.  Extracting a Cellular Hierarchy from High-dimensional Cytometry Data with SPADE , 2011, Nature Biotechnology.