A latent allocation model for the analysis of microbial composition and disease

BackgroundEstablishing the relationship between microbiota and specific diseases is important but requires appropriate statistical methodology. A specialized feature of microbiome count data is the presence of a large number of zeros, which makes it difficult to analyze in case-control studies. Most existing approaches either add a small number called a pseudo-count or use probability models such as the multinomial and Dirichlet-multinomial distributions to explain the excess zero counts, which may produce unnecessary biases and impose a correlation structure taht is unsuitable for microbiome data.ResultsThe purpose of this article is to develop a new probabilistic model, called BERnoulli and MUltinomial Distribution-based latent Allocation (BERMUDA), to address these problems. BERMUDA enables us to describe the differences in bacteria composition and a certain disease among samples. We also provide a simple and efficient learning procedure for the proposed model using an annealing EM algorithm.ConclusionWe illustrate the performance of the proposed method both through both the simulation and real data analysis. BERMUDA is implemented with R and is available from GitHub (https://github.com/abikoushi/Bermuda).

[1]  M. Pop,et al.  Robust methods for differential abundance analysis in marker gene surveys , 2013, Nature Methods.

[2]  Jesse R. Zaneveld,et al.  Normalization and microbial differential abundance strategies depend upon data characteristics , 2017, Microbiome.

[3]  James T. Morton,et al.  Parkinson's disease and Parkinson's disease medications have distinct signatures of the gut microbiome , 2017, Movement disorders : official journal of the Movement Disorder Society.

[4]  Hongzhe Li,et al.  A two-part mixed-effects model for analyzing longitudinal microbiome compositional data , 2016, Bioinform..

[5]  E. Pekkonen,et al.  Gut microbiota are related to Parkinson's disease and clinical phenotype , 2015, Movement disorders : official journal of the Movement Disorder Society.

[6]  H. Qin,et al.  The role of gut microbiota in the pathogenesis of colorectal cancer , 2013, Tumor Biology.

[7]  Charles E. Robertson,et al.  Application of Two-Part Statistics for Comparison of Sequence Variant Counts , 2011, PloS one.

[8]  A. Heintz‐Buschart,et al.  The nasal and gut microbiome in Parkinson's disease and idiopathic rapid eye movement sleep behavior disorder , 2017, Movement disorders : official journal of the Movement Disorder Society.

[9]  Hongzhe Li,et al.  A Logistic Normal Multinomial Regression Model for Microbiome Compositional Data Analysis , 2013, Biometrics.

[10]  I. Saltykova,et al.  Analysis of Gut Microbiota in Patients with Parkinson’s Disease , 2017, Bulletin of Experimental Biology and Medicine.

[11]  G. Deuschl,et al.  Gut microbiota in Parkinson disease in a northern German cohort , 2017, Brain Research.

[12]  J Paul Brooks,et al.  Challenges for case-control studies with microbiome data. , 2016, Annals of epidemiology.

[13]  Jens Roat Kultima,et al.  Potential of fecal microbiota for early‐stage detection of colorectal cancer , 2014 .