KatharoSeq Enables High-Throughput Microbiome Analysis from Low-Biomass Samples

Various indoor, outdoor, and host-associated environments contain small quantities of microbial biomass and represent a niche that is often understudied because of technical constraints. Many studies that attempt to evaluate these low-biomass microbiome samples are riddled with erroneous results that are typically false positive signals obtained during the sampling process. We have investigated various low-biomass kits and methods to determine the limit of detection of these pipelines. Here we present KatharoSeq, a high-throughput protocol combining laboratory and bioinformatic methods that can differentiate a true positive signal in samples with as few as 50 to 500 cells. We demonstrate the application of this method in three unique low-biomass environments, including a SAF, a hospital NICU, and an abalone-rearing facility. ABSTRACT Microbiome analyses of low-biomass samples are challenging because of contamination and inefficiencies, leading many investigators to employ low-throughput methods with minimal controls. We developed a new automated protocol, KatharoSeq (from the Greek katharos [clean]), that outperforms single-tube extractions while processing at least five times as fast. KatharoSeq incorporates positive and negative controls to reveal the whole bacterial community from inputs of as few as 50 cells and correctly identifies 90.6% (standard error, 0.013%) of the reads from 500 cells. To demonstrate the broad utility of KatharoSeq, we performed 16S rRNA amplicon and shotgun metagenome analyses of the Jet Propulsion Laboratory spacecraft assembly facility (SAF; n = 192, 96), 52 rooms of a neonatal intensive care unit (NICU; n = 388, 337), and an endangered-abalone-rearing facility (n = 192, 123), obtaining spatially resolved, unique microbiomes reproducible across hundreds of samples. The SAF, our primary focus, contains 32 sOTUs (sub-OTUs, defined as exact sequence matches) and their inferred variants identified by the deblur algorithm, with four (Acinetobacter lwoffii, Paracoccus marcusii, Mycobacterium sp., and Novosphingobium) being present in >75% of the samples. According to microbial spatial topography, the most abundant cleanroom contaminant, A. lwoffii, is related to human foot traffic exposure. In the NICU, we have been able to discriminate environmental exposure related to patient infectious disease, and in the abalone facility, we show that microbial communities reflect the marine environment rather than human input. Consequently, we demonstrate the feasibility and utility of large-scale, low-biomass metagenomic analyses using the KatharoSeq protocol. IMPORTANCE Various indoor, outdoor, and host-associated environments contain small quantities of microbial biomass and represent a niche that is often understudied because of technical constraints. Many studies that attempt to evaluate these low-biomass microbiome samples are riddled with erroneous results that are typically false positive signals obtained during the sampling process. We have investigated various low-biomass kits and methods to determine the limit of detection of these pipelines. Here we present KatharoSeq, a high-throughput protocol combining laboratory and bioinformatic methods that can differentiate a true positive signal in samples with as few as 50 to 500 cells. We demonstrate the application of this method in three unique low-biomass environments, including a SAF, a hospital NICU, and an abalone-rearing facility.

[1]  J. Merchant,et al.  Acinetobacter lwoffii infection and gastritis. , 2003, Microbes and infection.

[2]  Patrick K. H. Lee,et al.  The roles of the outdoors and occupants in contributing to a potential pan-microbiome of the built environment: a review , 2016, Microbiome.

[3]  William A. Walters,et al.  Improved Bacterial 16S rRNA Gene (V4 and V4-5) and Fungal Internal Transcribed Spacer Marker Gene Primers for Microbial Community Surveys , 2015, mSystems.

[4]  C. V. van Boven,et al.  Serratia marcescens infections in neonatal departments: description of an outbreak and review of the literature. , 1997, The Journal of hospital infection.

[5]  V. Mugoyela,et al.  Microbial contamination of nonsterile pharmaceuticals in public hospital settings , 2010, Therapeutics and clinical risk management.

[6]  William A. Walters,et al.  QIIME allows analysis of high-throughput community sequencing data , 2010, Nature Methods.

[7]  Brian C. Thomas,et al.  Microbes in the neonatal intensive care unit resemble those found in the gut of premature infants , 2014, Microbiome.

[8]  Rob Knight,et al.  Longitudinal analysis of microbial interaction between humans and the indoor environment , 2014, Science.

[9]  R. Knight,et al.  Molecular cartography of the human skin surface in 3D , 2015, Proceedings of the National Academy of Sciences.

[10]  Rob Knight,et al.  From Sample to Multi-Omics Conclusions in under 48 Hours , 2016, mSystems.

[11]  J. Romalde,et al.  New Vibrio species associated to molluscan microbiota: a review , 2014, Front. Microbiol..

[12]  Nicola K. Petty,et al.  BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons , 2011, BMC Genomics.

[13]  Sergey I. Nikolenko,et al.  SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing , 2012, J. Comput. Biol..

[14]  Michael Donley,et al.  UV irradiation and autoclave treatment for elimination of contaminating DNA from laboratory consumables. , 2010, Forensic science international. Genetics.

[15]  R. Knight,et al.  UniFrac: a New Phylogenetic Method for Comparing Microbial Communities , 2005, Applied and Environmental Microbiology.

[16]  Jose A Navas-Molina,et al.  Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns , 2017, mSystems.

[17]  G. Horneck,et al.  Microbial Existence in Controlled Habitats and Their Resistance to Space Conditions , 2014, Microbes and environments.

[18]  W W Nazaroff,et al.  Characterizing airborne fungal and bacterial concentrations and emission rates in six occupied children's classrooms. , 2015, Indoor air.

[19]  Alexey A. Gurevich,et al.  QUAST: quality assessment tool for genome assemblies , 2013, Bioinform..

[20]  Peter E. Larsen,et al.  Bacterial colonization and succession in a newly opened hospital , 2017, Science Translational Medicine.

[21]  E. W. Beals,et al.  Bray-curtis ordination: an effective strategy for analysis of multivariate ecological data , 1984 .

[22]  R. Knight,et al.  Forensic identification using skin bacterial communities , 2010, Proceedings of the National Academy of Sciences.

[23]  Rob Knight,et al.  EMPeror: a tool for visualizing high-throughput microbial community data , 2013, GigaScience.

[24]  Jeremiah J Minich,et al.  Improved Multiple Displacement Amplification (iMDA) and Ultraclean Reagents , 2014, BMC Genomics.

[25]  Jeff Kline,et al.  Architectural design influences the diversity and structure of the built environment microbiome , 2012, The ISME Journal.

[26]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[27]  Paul Turner,et al.  Reagent and laboratory contamination can critically impact sequence-based microbiome analyses , 2014, BMC Biology.

[28]  D. Brandon,et al.  Potential NICU Environmental Influences on the Neonate's Microbiome: A Systematic Review , 2015, Advances in neonatal care : official journal of the National Association of Neonatal Nurses.

[29]  William A. Walters,et al.  Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms , 2012, The ISME Journal.

[30]  M. Figueras,et al.  Diversity and pathogenecity of Vibrio species in cultured bivalve molluscs. , 2010, Environmental microbiology reports.

[31]  Brian J. Raney,et al.  Ragout—a reference-assisted assembly tool for bacterial genomes , 2014, Bioinform..

[32]  William B. Langdon,et al.  Performance of genetic programming optimised Bowtie2 on genome comparison and analytic testing (GCAT) benchmarks , 2015, BioData Mining.

[33]  Kasthuri Venkateswaran,et al.  Molecular bacterial community analysis of clean rooms where spacecraft are assembled. , 2007, FEMS microbiology ecology.

[34]  Derrick E. Wood,et al.  Kraken: ultrafast metagenomic sequence classification using exact alignments , 2014, Genome Biology.

[35]  K. Andree,et al.  'Candidatus Xenohaliotis californiensis', a newly described pathogen of abalone, Haliotis spp., along the west coast of North America. , 2000, International journal of systematic and evolutionary microbiology.

[36]  Alexander Mahnert,et al.  Cleanroom Maintenance Significantly Reduces Abundance but Not Diversity of Indoor Microbiomes , 2015, PloS one.

[37]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..