Nature and Intensity of Selection Pressure on CRISPR-Associated Genes

ABSTRACT The recently discovered CRISPR-Cas adaptive immune system is present in almost all archaea and many bacteria. It consists of cassettes of CRISPR repeats that incorporate spacers homologous to fragments of viral or plasmid genomes that are employed as guide RNAs in the immune response, along with numerous CRISPR-associated (cas) genes that encode proteins possessing diverse, only partially characterized activities required for the action of the system. Here, we investigate the evolution of the cas genes and show that they evolve under purifying selection that is typically much weaker than the median strength of purifying selection affecting genes in the respective genomes. The exceptions are the cas1 and cas2 genes that typically evolve at levels of purifying selection close to the genomic median. Thus, although these genes are implicated in the acquisition of spacers from alien genomes, they do not appear to be directly involved in an arms race between bacterial and archaeal hosts and infectious agents. These genes might possess functions distinct from and additional to their role in the CRISPR-Cas-mediated immune response. Taken together with evidence of the frequent horizontal transfer of cas genes reported previously and with the wide-spread microscale recombination within these genes detected in this work, these findings reveal the highly dynamic evolution of cas genes. This conclusion is in line with the involvement of CRISPR-Cas in antiviral immunity that is likely to entail a coevolutionary arms race with rapidly evolving viruses. However, we failed to detect evidence of strong positive selection in any of the cas genes.

[1]  Mark J. Gibbs,et al.  Sister-Scanning: a Monte Carlo procedure for assessing signals in recombinant sequences , 2000, Bioinform..

[2]  S. Yokoyama,et al.  X‐ray crystal structure of a CRISPR‐associated RAMP superfamily protein, Cmr5, from Thermus thermophilus HB8 , 2009, Proteins.

[3]  David Posada,et al.  An Exact Nonparametric Method for Inferring Mosaic Structure in Sequence Triplets , 2007, Genetics.

[4]  R. Terns,et al.  Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes. , 2008, Genes & development.

[5]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[6]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[7]  Stan J. J. Brouns,et al.  Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes , 2011, Biological chemistry.

[8]  Ziheng Yang,et al.  Statistical methods for detecting molecular adaptation , 2000, Trends in Ecology & Evolution.

[9]  J. Vogel,et al.  CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III , 2011, Nature.

[10]  Anders F. Andersson,et al.  Virus Population Dynamics and Acquired Virus Resistance in Natural Microbial Communities , 2008, Science.

[11]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[12]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[13]  J. García-Martínez,et al.  Short motif sequences determine the targets of the prokaryotic CRISPR defence system. , 2009, Microbiology.

[14]  Stan J. J. Brouns,et al.  Small CRISPR RNAs Guide Antiviral Defense in Prokaryotes , 2008, Science.

[15]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[16]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[17]  Stan J. J. Brouns,et al.  Evolution and classification of the CRISPR–Cas systems , 2011, Nature Reviews Microbiology.

[18]  R. Nielsen,et al.  Effect of recombination on the accuracy of the likelihood method for detecting positive selection at amino acid sites. , 2003, Genetics.

[19]  M. Touchon,et al.  The Small, Slow and Specialized CRISPR and Anti-CRISPR of Escherichia and Salmonella , 2010, PloS one.

[20]  Stephen H. Bryant,et al.  CD-Search: protein domain annotations on the fly , 2004, Nucleic Acids Res..

[21]  R. Terns,et al.  Interaction of the Cas6 riboendonuclease with CRISPR RNAs: recognition and cleavage. , 2011, Structure.

[22]  K. Zhou,et al.  Structural basis for DNase activity of a conserved protein implicated in CRISPR-mediated genome defense. , 2009, Structure.

[23]  Albert J R Heck,et al.  Structural basis for CRISPR RNA-guided DNA recognition by Cascade , 2011, Nature Structural &Molecular Biology.

[24]  Daniel H. Haft,et al.  A Guild of 45 CRISPR-Associated (Cas) Protein Families and Multiple CRISPR/Cas Subtypes Exist in Prokaryotic Genomes , 2005, PLoS Comput. Biol..

[25]  Darren Martin,et al.  RDP: detection of recombination amongst aligned sequences , 2000, Bioinform..

[26]  Jennifer A. Doudna,et al.  Sequence- and Structure-Specific RNA Processing by a CRISPR Endonuclease , 2010, Science.

[27]  Michelle G. Giglio,et al.  TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes , 2006, Nucleic Acids Res..

[28]  Adi Stern,et al.  Self-targeting by CRISPR: gene regulation or autoimmunity? , 2010, Trends in genetics : TIG.

[29]  A. Buckling,et al.  Bacteria-Phage Antagonistic Coevolution in Soil , 2011, Science.

[30]  James C. Hu Faculty Opinions recommendation of Self versus non-self discrimination during CRISPR RNA-directed immunity. , 2010 .

[31]  N. Grishin,et al.  A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action , 2006, Biology Direct.

[32]  J. S. Godde,et al.  The Repetitive DNA Elements Called CRISPRs and Their Associated Genes: Evidence of Horizontal Transfer Among Prokaryotes , 2006, Journal of Molecular Evolution.

[33]  L. Marraffini,et al.  CRISPR Interference Limits Horizontal Gene Transfer in Staphylococci by Targeting DNA , 2008, Science.

[34]  L. Marraffini,et al.  CRISPR interference: RNA-directed adaptive immunity in bacteria and archaea , 2010, Nature Reviews Genetics.

[35]  B. Graveley,et al.  RNA-Guided RNA Cleavage by a CRISPR RNA-Cas Protein Complex , 2009, Cell.

[36]  C. Dunnett Pairwise Multiple Comparisons in the Unequal Variance Case , 1980 .

[37]  J. Banfield,et al.  Rapidly evolving CRISPRs implicated in acquired resistance of microorganisms to viruses. , 2007, Environmental microbiology.

[38]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[39]  Eugene V Koonin,et al.  Unification of Cas protein families and a simple scenario for the origin and evolution of CRISPR-Cas systems , 2011, Biology Direct.

[40]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[41]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[42]  Andrew Emili,et al.  A dual function of the CRISPR–Cas system in bacterial antivirus immunity and DNA repair , 2011, Molecular microbiology.

[43]  James I Mullins,et al.  Potential impact of recombination on sitewise approaches for detecting positive natural selection. , 2003, Genetical research.

[44]  Inna Dubchak,et al.  Trends in Prokaryotic Evolution Revealed by Comparison of Closely Related Bacterial and Archaeal Genomes , 2008, Journal of bacteriology.

[45]  M. F. White,et al.  Structural and Functional Characterization of an Archaeal Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR)-associated Complex for Antiviral Defense (CASCADE)* , 2011, The Journal of Biological Chemistry.

[46]  Jennifer A. Doudna,et al.  Structures of the RNA-guided surveillance complex from a bacterial immune system , 2011, Nature.

[47]  R. Barrangou,et al.  CRISPR/Cas, the Immune System of Bacteria and Archaea , 2010, Science.

[48]  K. Crandall,et al.  Evaluation of methods for detecting recombination from DNA sequences: Computer simulations , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Philippe Horvath,et al.  Cas3 is a single‐stranded DNA nuclease and ATP‐dependent helicase in the CRISPR/Cas immune system , 2011, The EMBO journal.

[50]  John Maynard Smith,et al.  Analyzing the mosaic structure of genes , 1992, Journal of Molecular Evolution.

[51]  R. Barrangou,et al.  CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes , 2007, Science.

[52]  Arcady R. Mushegian,et al.  Computational methods for Gene Orthology inference , 2011, Briefings Bioinform..

[53]  E. Koonin,et al.  A Novel Family of Sequence-specific Endoribonucleases Associated with the Clustered Regularly Interspaced Short Palindromic Repeats* , 2008, Journal of Biological Chemistry.

[54]  Neil Hall,et al.  Antagonistic coevolution accelerates molecular evolution , 2010, Nature.

[55]  D. M. Krylov,et al.  Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. , 2003, Genome research.

[56]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[57]  Inna Dubchak,et al.  ATGC: a database of orthologous genes from closely related prokaryotic genomes and a research platform for microevolution of prokaryotes , 2008, Nucleic Acids Res..

[58]  E V Koonin,et al.  Evolution of aminoacyl-tRNA synthetases--analysis of unique domain architectures and phylogenetic trees reveals a complex history of horizontal gene transfer events. , 1999, Genome research.

[59]  K. Crandall,et al.  A modified bootscan algorithm for automated identification of recombinant sequences and recombination breakpoints. , 2005, AIDS research and human retroviruses.

[60]  Philippe Horvath,et al.  Phage Response to CRISPR-Encoded Resistance in Streptococcus thermophilus , 2007, Journal of bacteriology.

[61]  O. Gascuel,et al.  New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. , 2010, Systematic biology.

[62]  A. Snijders,et al.  Comparative network clustering of direct repeats (DRs) and cas genes confirms the possibility of the horizontal transfer of CRISPR locus among bacteria. , 2010, Molecular phylogenetics and evolution.

[63]  Vincent Moulton,et al.  RDP3: a flexible and fast computer program for analyzing recombination , 2010, Bioinform..

[64]  E. Koonin,et al.  Is evolution Darwinian or/and Lamarckian? , 2009, Biology Direct.

[65]  M. Touchon,et al.  CRISPR Distribution within the Escherichia coli Species Is Not Suggestive of Immunity-Associated Diversifying Selection , 2011, Journal of bacteriology.

[66]  L. Marraffini,et al.  Invasive DNA, chopped and in the CRISPR. , 2009, Structure.

[67]  S. Sawyer,et al.  Possible emergence of new geminiviruses by frequent recombination. , 1999, Virology.

[68]  Sergei L. Kosakovsky Pond,et al.  Datamonkey: rapid detection of selective pressure on individual sites of codon alignments , 2005, Bioinform..