Microbial Similarity between Students in a Common Dormitory Environment Reveals the Forensic Potential of Individual Microbial Signatures

Humans leave behind a microbial trail, regardless of intention. This may allow for the identification of individuals based on the “microbial signatures” they shed in built environments. In a shared living environment, these trails intersect, and through interaction with common surfaces may become homogenized, potentially confounding our ability to link individuals to their associated microbiota. We sought to understand the factors that influence the mixing of individual signatures and how best to process sequencing data to best tease apart these signatures. ABSTRACT The microbiota of the built environment is an amalgamation of both human and environmental sources. While human sources have been examined within single-family households or in public environments, it is unclear what effect a large number of cohabitating people have on the microbial communities of their shared environment. We sampled the public and private spaces of a college dormitory, disentangling individual microbial signatures and their impact on the microbiota of common spaces. We compared multiple methods for marker gene sequence clustering and found that minimum entropy decomposition (MED) was best able to distinguish between the microbial signatures of different individuals and was able to uncover more discriminative taxa across all taxonomic groups. Further, weighted UniFrac- and random forest-based graph analyses uncovered two distinct spheres of hand- or shoe-associated samples. Using graph-based clustering, we identified spheres of interaction and found that connection between these clusters was enriched for hands, implicating them as a primary means of transmission. In contrast, shoe-associated samples were found to be freely interacting, with individual shoes more connected to each other than to the floors they interact with. Individual interactions were highly dynamic, with groups of samples originating from individuals clustering freely with samples from other individuals, while all floor and shoe samples consistently clustered together. IMPORTANCE Humans leave behind a microbial trail, regardless of intention. This may allow for the identification of individuals based on the “microbial signatures” they shed in built environments. In a shared living environment, these trails intersect, and through interaction with common surfaces may become homogenized, potentially confounding our ability to link individuals to their associated microbiota. We sought to understand the factors that influence the mixing of individual signatures and how best to process sequencing data to best tease apart these signatures.

[1]  Xubo Song,et al.  Performance of Microbiome Sequence Inference Methods in Environments with Varying Biomass , 2019, mSystems.

[2]  J. Siegel,et al.  Filter forensics: microbiota recovery from residential HVAC filters , 2018, Microbiome.

[3]  Sune Lehmann,et al.  Constrained information flows in temporal networks reveal intermittent communities , 2017, Physical review. E.

[4]  August E. Woerner,et al.  Targeted sequencing of clade-specific markers from skin microbiomes for forensic human identification. , 2018, Forensic science international. Genetics.

[5]  N. Segata,et al.  Shotgun metagenomics, from sampling to analysis , 2017, Nature Biotechnology.

[6]  August E. Woerner,et al.  Forensic Human Identification Using Skin Microbiomes , 2017, Applied and Environmental Microbiology.

[7]  Paul J. McMurdie,et al.  Exact sequence variants should replace operational taxonomic units in marker-gene data analysis , 2017, The ISME Journal.

[8]  A. Doxey,et al.  The Skin Microbiome of Cohabiting Couples , 2017, mSystems.

[9]  Peter E. Larsen,et al.  Bacterial colonization and succession in a newly opened hospital , 2017, Science Translational Medicine.

[10]  Vincent J. Denef,et al.  Are Oligotypes Meaningful Ecological and Phylogenetic Units? A Case Study of Microcystis in Freshwater Lakes , 2017, Front. Microbiol..

[11]  Jose A Navas-Molina,et al.  Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns , 2017, mSystems.

[12]  N. Fierer,et al.  Microbial analyses of airborne dust collected from dormitory rooms predict the sex of occupants , 2017, Indoor air.

[13]  Andreas Ziegler,et al.  ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R , 2015, 1508.04409.

[14]  Andrew J. Hoisington,et al.  Characterizing the bacterial communities in retail stores in the United States. , 2016, Indoor air.

[15]  Ben Nichols,et al.  VSEARCH: a versatile open source tool for metagenomics , 2016, PeerJ.

[16]  S. Holmes,et al.  Bioconductor Workflow for Microbiome Data Analysis: from raw reads to community analyses , 2016, F1000Research.

[17]  Allyson L. Byrd,et al.  Temporal Stability of the Human Skin Microbiome , 2016, Cell.

[18]  Paul J. McMurdie,et al.  DADA2: High resolution sample inference from Illumina amplicon data , 2016, Nature Methods.

[19]  Mihai Pop,et al.  A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity , 2016, npj Biofilms and Microbiomes.

[20]  Paul J. McMurdie,et al.  Bioconductor Workflow for Microbiome Data Analysis: from raw reads to community analyses , 2016, F1000Research.

[21]  Ulrich Bodenhofer,et al.  msa: an R package for multiple sequence alignment , 2015, Bioinform..

[22]  Steven E. Lindow,et al.  Relative and contextual contribution of different sources to the composition and abundance of indoor air bacteria in residences , 2015, Microbiome.

[23]  Jason Stenson,et al.  Humans differ in their personal microbial cloud , 2015, PeerJ.

[24]  Orin C. Shanks,et al.  Comparison of Sewage and Animal Fecal Microbiomes by Using Oligotyping Reveals Potential Human Fecal Indicators in Multiple Taxonomic Groups , 2015, Applied and Environmental Microbiology.

[25]  J. Gilbert,et al.  Athletic equipment microbiota are shaped by interactions with human skin , 2015, Microbiome.

[26]  William W. Nazaroff,et al.  Chamber Bioaerosol Study: Outdoor Air and Human Occupants as Sources of Indoor Airborne Microbes , 2015, PloS one.

[27]  Daniel Patrick Smith,et al.  Forensic analysis of the microbiome of phones and shoes , 2015, Microbiome.

[28]  Katherine H. Huang,et al.  Identifying personal microbiomes using metagenomic codes , 2015, Proceedings of the National Academy of Sciences.

[29]  M. Sogin,et al.  Minimum entropy decomposition: Unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences , 2014, The ISME Journal.

[30]  M. Sogin,et al.  A single genus in the gut microbiome reflects host preference and specificity , 2014, The ISME Journal.

[31]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[32]  L. Amaral-Zettler,et al.  Oligotyping reveals community level habitat selection within the genus Vibrio , 2014, Front. Microbiol..

[33]  Dietmar Wolfram,et al.  Measuring Scholarly Impact: Methods and Practice , 2014 .

[34]  Rob Knight,et al.  Longitudinal analysis of microbial interaction between humans and the indoor environment , 2014, Science.

[35]  Susan M. Huse,et al.  Oligotyping analysis of the human oral microbiome , 2014, Proceedings of the National Academy of Sciences.

[36]  T. Sharpton An introduction to the analysis of shotgun metagenomic data , 2014, Front. Plant Sci..

[37]  Jeff Kline,et al.  Architectural Design Drives the Biogeography of Indoor Bacterial Communities , 2014, PloS one.

[38]  Pelin Yilmaz,et al.  The SILVA and “All-species Living Tree Project (LTP)” taxonomic frameworks , 2013, Nucleic Acids Res..

[39]  Mason A. Porter,et al.  Multilayer networks , 2013, J. Complex Networks.

[40]  James F. Meadow,et al.  Indoor airborne bacterial communities are influenced by ventilation, occupancy, and outdoor air source , 2013, Indoor air.

[41]  Ludvig Bohlin,et al.  Community detection and visualization of networks with the map equation framework , 2014 .

[42]  Sharon L. Grim,et al.  Oligotyping: differentiating between closely related microbial taxa using 16S rRNA gene data , 2013, Methods in ecology and evolution.

[43]  Robert C. Edgar,et al.  UPARSE: highly accurate OTU sequences from microbial amplicon reads , 2013, Nature Methods.

[44]  A. Arenas,et al.  Mathematical Formulation of Multilayer Networks , 2013, 1307.4977.

[45]  M. Sogin,et al.  A Filtering Method to Generate High Quality Short Reads Using Illumina Paired-End Technology , 2013, PloS one.

[46]  Noah Fierer,et al.  Home Life: Factors Structuring the Bacterial Diversity Found within and between Homes , 2013, PloS one.

[47]  Susan Holmes,et al.  phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data , 2013, PloS one.

[48]  Se Jin Song,et al.  Cohabiting family members share microbiota with one another and with their dogs , 2013, eLife.

[49]  Rob Knight,et al.  Diversity, distribution and sources of bacteria in residential kitchens. , 2013, Environmental microbiology.

[50]  Victor Kunin,et al.  Effects of OTU Clustering and PCR Artifacts on Microbial Diversity Estimates , 2012, Microbial Ecology.

[51]  Manuel J. Gómez,et al.  Exploring Bacterial Diversity in Hospital Environments by GS-FLX Titanium Pyrosequencing , 2012, PloS one.

[52]  W W Nazaroff,et al.  Size-resolved emission rates of airborne bacteria and fungi in an occupied classroom , 2012, Indoor air.

[53]  S. Kelley,et al.  Office Space Bacterial Abundance and Diversity in Three Metropolitan Areas , 2012, PloS one.

[54]  William W. Nazaroff,et al.  Human Occupancy as a Source of Indoor Airborne Bacteria , 2012, PloS one.

[55]  William A. Walters,et al.  Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms , 2012, The ISME Journal.

[56]  Jeff Kline,et al.  Architectural design influences the diversity and structure of the built environment microbiome , 2012, The ISME Journal.

[57]  R. Knight,et al.  Supervised classification of human microbiota. , 2011, FEMS microbiology reviews.

[58]  Klaus Peter Schliep,et al.  phangorn: phylogenetic analysis in R , 2010, Bioinform..

[59]  Martin Rosvall,et al.  Multilevel Compression of Random Walks on Networks Reveals Hierarchical Organization in Large Integrated Systems , 2010, PloS one.

[60]  H. Wickham ggplot2 , 2011 .

[61]  Susan M. Huse,et al.  Ironing out the wrinkles in the rare biosphere through improved OTU clustering , 2010, Environmental microbiology.

[62]  R. Knight,et al.  Forensic identification using skin bacterial communities , 2010, Proceedings of the National Academy of Sciences.

[63]  R. Knight,et al.  Bacterial Community Variation in Human Body Habitats Across Space and Time , 2009, Science.

[64]  Carl T. Bergstrom,et al.  The map equation , 2009, 0906.1405.

[65]  C. Deming,et al.  Topographical and Temporal Diversity of the Human Skin Microbiome , 2009, Science.

[66]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[67]  Brian J. Smith,et al.  boa: An R Package for MCMC Output Convergence Assessment and Posterior Inference , 2007 .

[68]  Hadley Wickham,et al.  Reshaping Data with the reshape Package , 2007 .

[69]  J. Tiedje,et al.  Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy , 2007, Applied and Environmental Microbiology.

[70]  A. Barabasi,et al.  Quantifying social group evolution , 2007, Nature.

[71]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[72]  S. Horvath,et al.  Unsupervised Learning With Random Forest Predictors , 2006 .

[73]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[74]  K. Konstantinidis,et al.  Genomic insights that advance the species definition for prokaryotes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[75]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[76]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[77]  Korbinian Strimmer,et al.  APE: Analyses of Phylogenetics and Evolution in R language , 2004, Bioinform..

[78]  P. Dixon VEGAN, a package of R functions for community ecology , 2003 .

[79]  A. Barabasi,et al.  Evolution of the social network of scientific collaborations , 2001, cond-mat/0104162.

[80]  L. Amaral,et al.  The web of human sexual contacts , 2001, Nature.