Overlapping Genes and Size Constraints in Viruses - An Evolutionary Perspective

Viruses are the simplest replicating units, characterized by a limited number of coding genes and an exceptionally high rate of overlapping genes. We sought a unified explanation for the evolutionary constraints that govern genome sizes, gene overlapping and capsid properties. We performed an unbiased statistical analysis over the ∼100 known viral families, and came to refute widespread assumptions regarding viral evolution. We found that the volume utilization of viral capsids is often low, and greatly varies among families. Most notably, we show that the total amount of gene overlapping is tightly bounded. Although viruses expand three orders of magnitude in genome length, their absolute amount of gene overlapping almost never exceeds 1500 nucleotides, and mostly confined to <4 significant overlapping instances. Our results argue against the common theory by which gene overlapping is driven by a necessity of viruses to compress their genome. Instead, we support the notion that overlapping has a role in gene novelty and evolution exploration.

[1]  R. Taylor,et al.  RNA replication errors and the evolution of virus pathogenicity and virulence. , 2014, Current opinion in virology.

[2]  Riccardo Bernasconi,et al.  How Viruses Hijack the ERAD Tuning Machinery , 2014, Journal of Virology.

[3]  E. Domingo,et al.  Exploration of sequence space as the basis of viral RNA genome segmentation , 2014, Proceedings of the National Academy of Sciences.

[4]  Jie Cui,et al.  An Allometric Relationship between the Genome Length and Virion Volume of Viruses , 2014, Journal of Virology.

[5]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[6]  J. F. Wright,et al.  AAV empty capsids: for better or for worse? , 2014, Molecular therapy : the journal of the American Society of Gene Therapy.

[7]  Angelo Pavesi,et al.  Viral Proteins Originated De Novo by Overprinting Can Be Identified by Codon Usage: Application to the “Gene Nursery” of Deltaretroviruses , 2013, PLoS Comput. Biol..

[8]  M. Daugherty,et al.  Identification of an overprinting gene in Merkel cell polyomavirus provides evolutionary insight into the birth of viral genes , 2013, Proceedings of the National Academy of Sciences.

[9]  W. Roos,et al.  Probing the biophysical interplay between a viral genome and its capsid. , 2013, Nature chemistry.

[10]  Hans Bitter,et al.  ViralZone: recent updates to the virus knowledge resource , 2012, Nucleic Acids Res..

[11]  Andreas Wagner,et al.  Evolution of Viral Proteins Originated De Novo by Overprinting , 2012, Molecular biology and evolution.

[12]  D. Raoult,et al.  Reclassification of Giant Viruses Composing a Fourth Domain of Life in the New Order Megavirales , 2012, Intervirology.

[13]  Marco Punta,et al.  AntiFam: a tool to help identify spurious ORFs in protein annotation , 2012, Database J. Biol. Databases Curation.

[14]  Michal Linial,et al.  Viral Proteins Acquired from a Host Converge to Simplified Domain Architectures , 2012, PLoS Comput. Biol..

[15]  Tatiana A. Tatusova,et al.  NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy , 2011, Nucleic Acids Res..

[16]  D. Rowlands,et al.  Formation of Higher-Order Foot-and-Mouth Disease Virus 3Dpol Complexes Is Dependent on Elongation Activity , 2011, Journal of Virology.

[17]  E. Koonin,et al.  Viruses with More Than 1,000 Genes: Mamavirus, a New Acanthamoeba polyphaga mimivirus Strain, and Reannotation of Mimivirus Genes , 2011, Genome biology and evolution.

[18]  E. Holmes What Does Virus Evolution Tell Us about Virus Origins? , 2011, Journal of Virology.

[19]  Norman E. Davey,et al.  How viruses hijack cell regulation. , 2011, Trends in biochemical sciences.

[20]  M. Lynch Evolution of the mutation rate. , 2010, Trends in genetics : TIG.

[21]  Robert Belshaw,et al.  Why genes overlap in viruses , 2010, Proceedings of the Royal Society B: Biological Sciences.

[22]  Patrick Forterre,et al.  Giant Viruses: Conflicts in Revisiting the Virus Concept , 2010, Intervirology.

[23]  A. Keith Dunker,et al.  Overlapping Genes Produce Proteins with Unusual Sequence Properties and Offer Insight into De Novo Protein Creation , 2009, Journal of Virology.

[24]  M. Bouvier,et al.  MHC class I antigen presentation: learning from viral evasion strategies , 2009, Nature Reviews Immunology.

[25]  Gavin J. D. Smith,et al.  Origins and evolutionary genomics of the 2009 swine-origin H1N1 influenza A epidemic , 2009, Nature.

[26]  D. Moreira,et al.  Ten reasons to exclude viruses from the tree of life , 2009, Nature Reviews Microbiology.

[27]  Kazuho Ikeo,et al.  Constrained evolution with respect to gene overlap of hepatitis B virus , 2009, Journal of Molecular Evolution.

[28]  S. Harvey,et al.  Packaging double-helical DNA into viral capsids: structures, forces, and energetics. , 2008, Biophysical journal.

[29]  E. Holmes,et al.  Rates of evolutionary change in viruses: patterns and determinants , 2008, Nature Reviews Genetics.

[30]  Andrew Rambaut,et al.  Pacing a small cage: mutation and RNA viruses , 2008, Trends in Ecology & Evolution.

[31]  W. Gelbart,et al.  Packaging of a polymer by a viral capsid: the interplay between polymer length and capsid size. , 2008, Biophysical journal.

[32]  Klaus Schulten,et al.  Stability and dynamics of virus capsids described by coarse-grained modeling. , 2006, Structure.

[33]  Edward C Holmes,et al.  Avian influenza virus exhibits rapid evolutionary dynamics. , 2006, Molecular biology and evolution.

[34]  V. Belyĭ,et al.  Electrostatic origin of the genome packing in viruses , 2006, Proceedings of the National Academy of Sciences.

[35]  Patrick Forterre,et al.  The origin of viruses and their possible roles in major evolutionary transitions. , 2006, Virus research.

[36]  Eugene V Koonin,et al.  Evolutionary genomics of nucleo-cytoplasmic large DNA viruses. , 2006, Virus research.

[37]  David Reguera,et al.  Classical nucleation theory of virus capsids. , 2006, Biophysical journal.

[38]  M. Vignuzzi,et al.  Quasispecies diversity determines pathogenesis through cooperative interactions in a viral population , 2006, Nature.

[39]  W. Miller,et al.  Translational control in positive strand RNA plant viruses. , 2006, Virology.

[40]  R. Fujinami,et al.  Molecular Mimicry, Bystander Activation, or Viral Persistence: Infections and Autoimmune Disease , 2006, Clinical Microbiology Reviews.

[41]  Chandrajit L. Bajaj,et al.  VIPERdb: a relational database for structural virology , 2005, Nucleic Acids Res..

[42]  Jean-Michel Claverie,et al.  Mimivirus and the emerging concept of "giant" virus. , 2005, Virus research.

[43]  Chris M. Brown,et al.  Detecting overlapping coding sequences in virus genomes , 2006, BMC Bioinformatics.

[44]  S. Salzberg,et al.  Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution , 2005, Nature.

[45]  Santiago F. Elena,et al.  Adaptive Value of High Mutation Rates of RNA Viruses: Separating Causes from Consequences , 2005, Journal of Virology.

[46]  Michal Galdzicki,et al.  Mammalian overlapping genes: the comparative perspective. , 2004, Genome research.

[47]  J. V. Etten,et al.  Unusual Life Style of Giant Chlorella Viruses , 2003 .

[48]  Eugene V. Koonin,et al.  Comparative genomics, minimal gene-sets and the last universal common ancestor , 2003, Nature Reviews Microbiology.

[49]  David R Nelson,et al.  Virus shapes and buckling transitions in spherical shells. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[50]  J. V. Van Etten,et al.  Unusual life style of giant chlorella viruses. , 2003, Annual review of genetics.

[51]  S. Harvey,et al.  Investigation of viral DNA packaging using molecular mechanics models. , 2002, Biophysical chemistry.

[52]  Eugene V Koonin,et al.  Purifying and directional selection in overlapping prokaryotic genes. , 2002, Trends in genetics : TIG.

[53]  D. Krakauer,et al.  Redundancy, antiredundancy, and the robustness of genomes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[54]  S. Frank Multiplicity of infection and the evolution of hybrid incompatibility in segmented viruses , 2001, Heredity.

[55]  Luigi Naldini,et al.  Viral vectors for gene therapy: the art of turning infectious agents into vehicles of therapeutics , 2001, Nature Medicine.

[56]  David C. Krakauer,et al.  STABILITY AND EVOLUTION OF OVERLAPPING GENES , 2000, Evolution; international journal of organic evolution.

[57]  E. Holmes,et al.  Evolutionary aspects of recombination in RNA viruses. , 1999, The Journal of general virology.

[58]  H. Ploegh Viral strategies of immune evasion. , 1998, Science.

[59]  A. Pavesi,et al.  On the Informational Content of Overlapping Genes in Prokaryotic and Eukaryotic Viruses , 1997, Journal of Molecular Evolution.

[60]  E. Domingo,et al.  Rapid evolution of viral RNA genomes. , 1997, The Journal of nutrition.

[61]  J. Kappes,et al.  The Vif protein of human and simian immunodeficiency viruses is packaged into virions and associates with viral core structures , 1995, Journal of virology.

[62]  R. Lamb,et al.  The remarkable coding strategy of borna disease virus: a new member of the nonsegmented negative strand RNA viruses. , 1995, Virology.

[63]  P. Keese,et al.  Origins of genes: "big bang" or continuous creation? , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Katherine Spindler,et al.  Rapid evolution of RNA genomes. , 1982, Science.

[65]  C. A. Hutchison,et al.  Overlapping genes in bacteriophage φX174 , 1976, Nature.

[66]  D. Baltimore Expression of animal virus genomes. , 1971, Bacteriological reviews.

[67]  A. Klug,et al.  Physical principles in the construction of regular viruses. , 1962, Cold Spring Harbor symposia on quantitative biology.