New CRISPR-Cas systems from uncultivated microbes

CRISPR–Cas systems provide microbes with adaptive immunity by employing short DNA sequences, termed spacers, that guide Cas proteins to cleave foreign DNA. Class 2 CRISPR–Cas systems are streamlined versions, in which a single RNA-bound Cas protein recognizes and cleaves target sequences. The programmable nature of these minimal systems has enabled researchers to repurpose them into a versatile technology that is broadly revolutionizing biological and clinical research. However, current CRISPR–Cas technologies are based solely on systems from isolated bacteria, leaving the vast majority of enzymes from organisms that have not been cultured untapped. Metagenomics, the sequencing of DNA extracted directly from natural microbial communities, provides access to the genetic material of a huge array of uncultivated organisms. Here, using genome-resolved metagenomics, we identify a number of CRISPR–Cas systems, including the first reported Cas9 in the archaeal domain of life, to our knowledge. This divergent Cas9 protein was found in little-studied nanoarchaea as part of an active CRISPR–Cas system. In bacteria, we discovered two previously unknown systems, CRISPR–CasX and CRISPR–CasY, which are among the most compact systems yet discovered. Notably, all required functional components were identified by metagenomics, enabling validation of robust in vivo RNA-guided DNA interference activity in Escherichia coli. Interrogation of environmental microbial communities combined with in vivo experiments allows us to access an unprecedented diversity of genomes, the content of which will expand the repertoire of microbe-based biotechnologies.

[1]  Vincent J. Denef,et al.  Comparative genomics in acid mine drainage biofilm communities reveals metabolic and structural differentiation of co-occurring archaea , 2013, BMC Genomics.

[2]  D. G. Gibson,et al.  Enzymatic assembly of DNA molecules up to several hundred kilobases , 2009, Nature Methods.

[3]  Itai Sharon,et al.  Genomes from Metagenomics , 2013, Science.

[4]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[5]  G. O’Toole,et al.  Interaction between Bacteriophage DMS3 and Host CRISPR Region Inhibits Group Behaviors of Pseudomonas aeruginosa , 2008, Journal of bacteriology.

[6]  Mattias Jakobsson,et al.  In Situ Evolutionary Rate Measurements Show Ecological Success of Recently Emerged Bacterial Hybrids , 2022 .

[7]  Brian C. Thomas,et al.  Thousands of microbial genomes shed light on interconnected biogeochemical processes in an aquifer system , 2016, Nature Communications.

[8]  Brian C. Thomas,et al.  Genomic resolution of a cold subsurface aquifer community provides metabolic insights for novel microbes adapted to high CO2 concentrations , 2017, Environmental microbiology.

[9]  Brian C. Thomas,et al.  A new view of the tree of life , 2016, Nature Microbiology.

[10]  U. Qimron,et al.  Proteins and DNA elements essential for the CRISPR adaptation process in Escherichia coli , 2012, Nucleic acids research.

[11]  Jennifer A. Doudna,et al.  Cas1–Cas2 complex formation mediates spacer acquisition during CRISPR–Cas adaptive immunity , 2014, Nature Structural &Molecular Biology.

[12]  Brian C. Thomas,et al.  Small Genomes and Sparse Metabolisms of Sediment-Associated Bacteria from Four Candidate Phyla , 2013, mBio.

[13]  Doug Hyatt,et al.  Enigmatic, ultrasmall, uncultivated Archaea , 2010, Proceedings of the National Academy of Sciences.

[14]  Miriam L. Land,et al.  Trace: Tennessee Research and Creative Exchange Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification Recommended Citation Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification , 2022 .

[15]  Brian C. Thomas,et al.  Unusual biology across a group comprising more than 15% of domain Bacteria , 2015, Nature.

[16]  Samuel H Sternberg,et al.  Mechanism of substrate selection by a highly specific CRISPR endoribonuclease. , 2012, RNA.

[17]  Jennifer A. Doudna,et al.  Integrase-mediated spacer acquisition during CRISPR–Cas adaptive immunity , 2015, Nature.

[18]  Feng Zhang,et al.  Crystal Structure of Cas9 in Complex with Guide RNA and Target DNA , 2014, Cell.

[19]  J. Doudna,et al.  A Programmable Dual-RNA–Guided DNA Endonuclease in Adaptive Bacterial Immunity , 2012, Science.

[20]  Kira S. Makarova,et al.  Classification and evolution of type II CRISPR-Cas systems , 2014, Nucleic acids research.

[21]  Yan Zhang,et al.  DNase H Activity of Neisseria meningitidis Cas9. , 2015, Molecular cell.

[22]  R. Barrangou,et al.  CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes , 2007, Science.

[23]  J. S. Godde,et al.  The Repetitive DNA Elements Called CRISPRs and Their Associated Genes: Evidence of Horizontal Transfer Among Prokaryotes , 2006, Journal of Molecular Evolution.

[24]  Christian Cole,et al.  JPred4: a protein secondary structure prediction server , 2015, Nucleic Acids Res..

[25]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[26]  Jörg Vogel,et al.  Processing-independent CRISPR RNAs limit natural transformation in Neisseria meningitidis. , 2013, Molecular cell.

[27]  Ibtissem Grissa,et al.  CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats , 2007, Nucleic Acids Res..

[28]  K. Katoh,et al.  MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability , 2013, Molecular biology and evolution.

[29]  A. Regev,et al.  Cpf1 Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System , 2015, Cell.

[30]  Adi Stern,et al.  Self-targeting by CRISPR: gene regulation or autoimmunity? , 2010, Trends in genetics : TIG.

[31]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[32]  Kira S. Makarova,et al.  Crystal Structure of Cpf1 in Complex with Guide RNA and Target DNA , 2016, Cell.

[33]  Luis R. Comolli,et al.  Inter-species interconnections in acid mine drainage microbial communities , 2014, Front. Microbiol..

[34]  Philip Hugenholtz,et al.  Lineages of Acidophilic Archaea Revealed by Community Genomic Analysis , 2006, Science.

[35]  Eric S. Lander,et al.  C2c2 is a single-component programmable RNA-guided RNA-targeting CRISPR effector , 2016, Science.

[36]  Robert D. Finn,et al.  HMMER web server: interactive sequence similarity searching , 2011, Nucleic Acids Res..

[37]  Blake A. Simmons,et al.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets , 2016, Bioinform..

[38]  Prashant Mali,et al.  Orthogonal Cas9 Proteins for RNA-Guided Gene Regulation and Editing , 2013, Nature Methods.

[39]  Brian C. Thomas,et al.  Diverse uncultivated ultra-small bacterial cells in groundwater , 2015, Nature Communications.

[40]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[41]  Siu-Ming Yiu,et al.  IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth , 2012, Bioinform..

[42]  Brian C. Thomas,et al.  Metagenomic analysis of a high carbon dioxide subsurface microbial community populated by chemolithoautotrophs and bacteria and archaea from candidate phyla. , 2016, Environmental microbiology.

[43]  James C. Stegen,et al.  The reduced genomes of Parcubacteria (OD1) contain signatures of a symbiotic lifestyle , 2015, Front. Microbiol..

[44]  Ningning Li,et al.  The crystal structure of Cpf1 in complex with CRISPR RNA , 2016, Nature.

[45]  Vladimir Gvozdev,et al.  A Distinct Small RNA Pathway Silences Selfish Genetic Elements in the Germline , 2006, Science.

[46]  R. Barrangou,et al.  Applications of CRISPR technologies in research and beyond , 2016, Nature Biotechnology.

[47]  Eugene V Koonin,et al.  Discovery and Functional Characterization of Diverse Class 2 CRISPR-Cas Systems. , 2015, Molecular cell.

[48]  J. Vogel,et al.  CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III , 2011, Nature.

[49]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[50]  Jennifer A. Doudna,et al.  Structures of Cas9 Endonucleases Reveal RNA-Mediated Conformational Activation , 2014, Science.

[51]  Michael J E Sternberg,et al.  The Phyre2 web portal for protein modeling, prediction and analysis , 2015, Nature Protocols.

[52]  Shiraz A. Shah,et al.  Protospacer recognition motifs Mixed identities and functional diversity , 2013 .

[53]  Sita J. Saunders,et al.  An updated evolutionary classification of CRISPR–Cas systems , 2015, Nature Reviews Microbiology.

[54]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[55]  M. Jinek,et al.  Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease , 2014, Nature.

[56]  Asaf Levy,et al.  CRISPR adaptation biases explain preference for acquisition of foreign DNA , 2015, Nature.

[57]  Brian C. Thomas,et al.  EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data , 2011, Genome Biology.

[58]  Peer Bork,et al.  Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees , 2016, Nucleic Acids Res..

[59]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[60]  V. Kunin,et al.  CRISPR — a widespread system that provides acquired resistance against phages in bacteria and archaea , 2008, Nature Reviews Microbiology.

[61]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[62]  Brian C. Thomas,et al.  Community-wide analysis of microbial genome sequence signatures , 2009, Genome Biology.

[63]  Natalia N. Ivanova,et al.  Insights into the phylogeny and coding potential of microbial dark matter , 2013, Nature.

[64]  Christine L. Sun,et al.  Major bacterial lineages are essentially devoid of CRISPR-Cas viral defence systems , 2016, Nature Communications.

[65]  Connor T. Skennerton,et al.  Crass: identification and reconstruction of CRISPR from unassembled metagenomic data , 2013, Nucleic acids research.

[66]  Benjamin L. Oakes,et al.  Profiling of engineering hotspots identifies an allosteric CRISPR-Cas9 switch , 2016, Nature Biotechnology.

[67]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[68]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.