Modeling Longitudinal Microbiome Compositional Data: A Two-Part Linear Mixed Model with Shared Random Effects

Longitudinal microbiome studies have been widely used to unveil the dynamics in the complex host-microbial ecosystems. Modeling the longitudinal microbiome compositional data, which is semi-continuous in nature, is challenging in several aspects: the overabundance of zeros, the heavy skewness of non-zero values that are bounded in (0, 1), and the dependence between the binary and non-zero parts. To deal with these challenges, we first extended the work of Chen and Li [1] and proposed a two-part zero-inflated Beta regression model with shared random effects (ZIBR-SRE), which characterize the dependence between the binary and the continuous parts. Besides, the microbiome compositional data have unit-sum constraint, indicating the existence of negative correlations among taxa. As ZIBR-SRE models each taxon separately, it does not satisfy the sum-to-one constraint. We then proposed a two-part linear mixed model (TPLMM) with shared random effects to formulate the log-transformed standardized relative abundances rather than the original ones. Such transformation is called “additive logistic transformation”, initially developed for cross-sectional compositional data. We extended it to analyze the longitudinal microbiome compositions and showed that the unit-sum constraint can be automatically satisfied under the TPLMM framework. Model performances of TPLMM and ZIBR-SRE were compared with existing methods in simulation studies. Under settings adopted from real data, TPLMM had the best performance and is recommended for practical use. An oral microbiome application further showed that TPLMM and ZIBR-SRE estimated a strong correlation structure in the binary and the continuous parts, suggesting models without accounting for this dependence would lead to biased inferences.

[1]  A. Eck,et al.  Long Term Development of Gut Microbiota Composition in Atopic Children: Impact of Probiotics , 2015, PloS one.

[2]  Lawrence A. David,et al.  Diet rapidly and reproducibly alters the human gut microbiome , 2013, Nature.

[3]  E. Vogtmann,et al.  Temporal Variability of Oral Microbiota over 10 Months and the Implications for Future Epidemiologic Studies , 2018, Cancer Epidemiology, Biomarkers & Prevention.

[4]  Hongzhe Li Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis , 2015 .

[5]  David J. Edwards,et al.  Hypothesis Testing and Power Calculations for Taxonomic-Based Human Microbiome Data , 2012, PloS one.

[6]  A. Kostic,et al.  The microbiome in inflammatory bowel disease: current status and the future ahead. , 2014, Gastroenterology.

[7]  M. Silverberg,et al.  Analyzing the Human Microbiome: A “How To” guide for Physicians , 2014, The American Journal of Gastroenterology.

[8]  Charles E. Robertson,et al.  Application of Two-Part Statistics for Comparison of Sequence Variant Counts , 2011, PloS one.

[9]  Tobias Kollmann,et al.  Early infancy microbial and metabolic alterations affect risk of childhood asthma , 2015, Science Translational Medicine.

[10]  Roberto Romero,et al.  The composition and stability of the vaginal microbiota of normal pregnant women is different from that of non-pregnant women , 2014, Microbiome.

[11]  V. Tremaroli,et al.  Dynamics and Stabilization of the Human Gut Microbiome during the First Year of Life. , 2015, Cell host & microbe.

[12]  Gang Li,et al.  Zero-Inflated Beta Regression for Differential Abundance Analysis with Metagenomics Data , 2016, J. Comput. Biol..

[13]  Yanjiao Zhou,et al.  Longitudinal Analysis of the Premature Infant Intestinal Microbiome Prior to Necrotizing Enterocolitis: A Case-Control Study , 2015, PloS one.

[14]  E. Vogtmann,et al.  Epidemiologic studies of the human microbiome and cancer , 2016, British Journal of Cancer.

[15]  Tommi Vatanen,et al.  The dynamics of the human infant gut microbiome in development and in progression toward type 1 diabetes. , 2015, Cell host & microbe.

[16]  F. Bushman,et al.  Linking Long-Term Dietary Patterns with Gut Microbial Enterotypes , 2011, Science.

[17]  P. Turnbaugh,et al.  Microbial ecology: Human gut microbes associated with obesity , 2006, Nature.

[18]  Karoline Faust,et al.  Metagenomics meets time series analysis: unraveling microbial community dynamics. , 2015, Current opinion in microbiology.

[19]  Georg K Gerber,et al.  The dynamic microbiome , 2014, FEBS letters.

[20]  Hongzhe Li,et al.  A two-part mixed-effects model for analyzing longitudinal microbiome compositional data , 2016, Bioinform..

[21]  Raydonal Ospina,et al.  A general class of zero-or-one inflated beta regression models , 2011, Comput. Stat. Data Anal..

[22]  M. C. Jones,et al.  The Statistical Analysis of Compositional Data , 1986 .

[23]  C. Huttenhower,et al.  Metagenomic microbial community profiling using unique clade-specific marker genes , 2012, Nature Methods.