Hierarchical non-negative matrix factorization using clinical information for microbial communities

Background The human microbiome forms very complex communities that consist of hundreds to thousands of different microorganisms that not only affect the host, but also participate in disease processes. Several state-of-the-art methods have been proposed for learning the structure of microbial communities and to investigate the relationship between microorganisms and host environmental factors. However, these methods were mainly designed to model and analyze single microbial communities that do not interact with or depend on other communities. Such methods therefore cannot comprehend the properties between interdependent systems in communities that affect host behavior and disease processes. Results We introduce a novel hierarchical Bayesian framework, called BALSAMICO (BAyesian Latent Semantic Analysis of MIcrobial COmmunities), which uses microbial metagenome data to discover the underlying microbial community structures and the associations between microbiota and their environmental factors. BALSAMICO models mixtures of communities in the framework of nonnegative matrix factorization, taking into account environmental factors. This method first proposes an efficient procedure for estimating parameters. A simulation then evaluates the accuracy of the estimated parameters. Finally, the method is used to analyze clinical data. In this analysis, we successfully detected bacteria related to colorectal cancer. These results show that the method not only accurately estimates the parameters needed to analyze the connections between communities of microbiota and their environments, but also allows for the effective detection of these communities in real-world circumstances.

[1]  Satoshi Matsumoto,et al.  Gut microbiota and colorectal cancer , 2016, Genes and Environment.

[2]  Jens Roat Kultima,et al.  Potential of fecal microbiota for early‐stage detection of colorectal cancer , 2014 .

[3]  T. Miller,et al.  Incidence of methanogenic bacteria in a sigmoidoscopy population: an association of methanogenic bacteria and diverticulosis. , 1986, Gut.

[4]  Jun Yu,et al.  The Composition of Colonic Commensal Bacteria According to Anatomical Localization in Colorectal Cancer , 2017 .

[5]  Hong Gu,et al.  BioMiCo: a supervised Bayesian model for inference of microbial community structure , 2015, Microbiome.

[6]  Toby Kenney,et al.  Learning Microbial Community Structures with Supervised and Unsupervised Non-negative Matrix Factorization , 2017, Microbiome.

[7]  M. Stephens Dealing with label switching in mixture models , 2000 .

[8]  S. Dusko Ehrlich,et al.  MetaHIT: The European Union Project on Metagenomics of the Human Intestinal Tract , 2011 .

[9]  V. Young,et al.  The gut microbiome in health and in disease , 2015, Current opinion in gastroenterology.

[10]  D. Relman,et al.  The Application of Ecological Theory Toward an Understanding of the Human Microbiome , 2012, Science.

[11]  H. Flint,et al.  Role of the gut microbiota in nutrition and health , 2018, British Medical Journal.

[12]  Chong Wang,et al.  Variational inference in nonconjugate models , 2012, J. Mach. Learn. Res..

[13]  Sandra Plancade,et al.  Inferring Aggregated Functional Traits from Metagenomic Data Using Constrained Non-negative Matrix Factorization: Application to Fiber Degradation in the Human Gut Microbiota , 2016, PLoS Comput. Biol..

[14]  P. Bork,et al.  Enterotypes of the human gut microbiome , 2011, Nature.

[15]  J. Dushoff,et al.  Functional Biogeography of Ocean Microbes Revealed through Non-Negative Matrix Factorization , 2012, PloS one.

[16]  M. Nieuwdorp,et al.  Role of the microbiome in energy regulation and metabolism. , 2014, Gastroenterology.

[17]  Amy M. Sheflin,et al.  Stool Microbiome and Metabolome Differences between Colorectal Cancer Patients and Healthy Adults , 2013, PloS one.

[18]  W. M. de Vos,et al.  Human Microbiota in Health and Disease , 2012 .

[19]  Y. Belkaid,et al.  Role of the Microbiota in Immunity and Inflammation , 2014, Cell.

[20]  B. Jiang,et al.  Analysis of Mucosa-Associated Microbiota in Colorectal Cancer , 2017, Medical science monitor : international medical journal of experimental and clinical research.

[21]  F. Bäckhed,et al.  Diet–microbiota interactions as moderators of human metabolism , 2016, Nature.

[22]  Katherine H. Huang,et al.  Structure, Function and Diversity of the Healthy Human Microbiome , 2012, Nature.

[23]  Chris Whidden,et al.  Interactions in the microbiome: communities of organisms and communities of genes , 2013, FEMS microbiology reviews.

[24]  Ali Taylan Cemgil,et al.  Bayesian Inference for Nonnegative Matrix Factorisation Models , 2009, Comput. Intell. Neurosci..

[25]  Jesse R. Zaneveld,et al.  Normalization and microbial differential abundance strategies depend upon data characteristics , 2017, Microbiome.