A Guide to Enterotypes across the Human Body: Meta-Analysis of Microbial Community Structures in Human Microbiome Datasets

Recent analyses of human-associated bacterial diversity have categorized individuals into ‘enterotypes’ or clusters based on the abundances of key bacterial genera in the gut microbiota. There is a lack of consensus, however, on the analytical basis for enterotypes and on the interpretation of these results. We tested how the following factors influenced the detection of enterotypes: clustering methodology, distance metrics, OTU-picking approaches, sequencing depth, data type (whole genome shotgun (WGS) vs.16S rRNA gene sequence data), and 16S rRNA region. We included 16S rRNA gene sequences from the Human Microbiome Project (HMP) and from 16 additional studies and WGS sequences from the HMP and MetaHIT. In most body sites, we observed smooth abundance gradients of key genera without discrete clustering of samples. Some body habitats displayed bimodal (e.g., gut) or multimodal (e.g., vagina) distributions of sample abundances, but not all clustering methods and workflows accurately highlight such clusters. Because identifying enterotypes in datasets depends not only on the structure of the data but is also sensitive to the methods applied to identifying clustering strength, we recommend that multiple approaches be used and compared when testing for enterotypes.

[1]  F. Bushman,et al.  Linking Long-Term Dietary Patterns with Gut Microbial Enterotypes , 2011, Science.

[2]  Ravi Jain,et al.  Cluster Validating Techniques in the Presence of Duplicates , 2008, Computational Intelligence Paradigms.

[3]  Jonathan Krakoff,et al.  Energy-balance studies reveal associations between gut microbes, caloric load, and nutrient absorption in humans. , 2011, The American journal of clinical nutrition.

[4]  William A. Walters,et al.  Experimental and analytical tools for studying the human microbiome , 2011, Nature Reviews Genetics.

[5]  P. Gajer,et al.  Vaginal microbiome of reproductive-age women , 2010, Proceedings of the National Academy of Sciences.

[6]  D. Sinderen,et al.  Gut microbiota composition correlates with diet and health in the elderly , 2012, Nature.

[7]  P. Bork,et al.  Enterotypes of the human gut microbiome , 2011, Nature.

[8]  Katherine H. Huang,et al.  A framework for human microbiome research , 2012, Nature.

[9]  D. Relman,et al.  Incomplete recovery and individualized responses of the human distal gut microbiota to repeated antibiotic perturbation , 2010, Proceedings of the National Academy of Sciences.

[10]  L. T. Angenent,et al.  Succession of microbial consortia in the developing infant gut microbiome , 2010, Proceedings of the National Academy of Sciences.

[11]  Lynn K. Carmichael,et al.  Evaluation of 16S rDNA-Based Community Profiling for Human Microbiome Research , 2012, PloS one.

[12]  Rob Knight,et al.  Regulation of myocardial ketone body metabolism by the gut microbiota during nutrient deprivation , 2009, Proceedings of the National Academy of Sciences.

[13]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[14]  P. Bork,et al.  A human gut microbial gene catalogue established by metagenomic sequencing , 2010, Nature.

[15]  R. Knight,et al.  Bacterial Community Variation in Human Body Habitats Across Space and Time , 2009, Science.

[16]  R. Knight,et al.  Moving pictures of the human microbiome , 2011, Genome Biology.

[17]  Katherine H. Huang,et al.  Structure, Function and Diversity of the Healthy Human Microbiome , 2012, Nature.

[18]  R. Knight,et al.  Postprandial remodeling of the gut microbiota in Burmese pythons , 2010, The ISME Journal.

[19]  Peer Bork,et al.  Erratum: Enterotypes of the human gut microbiome (Nature (2011) 473 (174-180)) , 2011 .

[20]  C. Huttenhower,et al.  Metagenomic microbial community profiling using unique clade-specific marker genes , 2012, Nature Methods.

[21]  Eric P. Nawrocki,et al.  An improved Greengenes taxonomy with explicit ranks for ecological and evolutionary analyses of bacteria and archaea , 2011, The ISME Journal.

[22]  R. Knight,et al.  Delivery mode shapes the acquisition and structure of the initial microbiota across multiple body habitats in newborns , 2010, Proceedings of the National Academy of Sciences.

[23]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[24]  S. Massart,et al.  Impact of diet in shaping gut microbiota revealed by a comparative study in children from Europe and rural Africa , 2010, Proceedings of the National Academy of Sciences.

[25]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[26]  Lakhmi C. Jain,et al.  Computational Intelligence Paradigms , 2008 .

[27]  P. Turnbaugh,et al.  Microbial ecology: Human gut microbes associated with obesity , 2006, Nature.

[28]  André Hardy,et al.  An examination of procedures for determining the number of clusters in a data set , 1994 .

[29]  B. Roe,et al.  A core gut microbiome in obese and lean twins , 2008, Nature.

[30]  Robert Tibshirani,et al.  Cluster Validation by Prediction Strength , 2005 .

[31]  S. Dowd,et al.  Target Region Selection Is a Critical Determinant of Community Fingerprints Generated by 16S Pyrosequencing , 2011, PloS one.

[32]  P. Legendre,et al.  vegan : Community Ecology Package. R package version 1.8-5 , 2007 .

[33]  R. Knight,et al.  UniFrac: a New Phylogenetic Method for Comparing Microbial Communities , 2005, Applied and Environmental Microbiology.

[34]  Christian Hennig,et al.  Comparing latent class and dissimilarity based clustering for mixed type variables with application to social stratification , 2010 .

[35]  G. W. Milligan,et al.  An examination of procedures for determining the number of clusters in a data set , 1985 .