Marked Variability in the Extent of Protein Disorder within and between Viral Families

Intrinsically disordered regions in eukaryotic proteomes contain key signaling and regulatory modules and mediate interactions with many proteins. Many viral proteomes encode disordered proteins and modulate host factors through the use of short linear motifs (SLiMs) embedded within disordered regions. However, the degree of viral protein disorder across different viruses is not well understood, so we set out to establish the constraints acting on viruses, in terms of their use of disordered protein regions. We surveyed predicted disorder across 2,278 available viral genomes in 41 families, and correlated the extent of disorder with genome size and other factors. Protein disorder varies strikingly between viral families (from 2.9% to 23.1% of residues), and also within families. However, this substantial variation did not follow the established trend among their hosts, with increasing disorder seen across eubacterial, archaebacterial, protists, and multicellular eukaryotes. For example, among large mammalian viruses, poxviruses and herpesviruses showed markedly differing disorder (5.6% and 17.9%, respectively). Viral families with smaller genome sizes have more disorder within each of five main viral types (ssDNA, dsDNA, ssRNA+, dsRNA, retroviruses), except for negative single-stranded RNA viruses, where disorder increased with genome size. However, surveying over all viruses, which compares tiny and enormous viruses over a much bigger range of genome sizes, there is no strong association of genome size with protein disorder. We conclude that there is extensive variation in the disorder content of viral proteomes. While a proportion of this may relate to base composition, to extent of gene overlap, and to genome size within viral types, there remain important additional family and virus-specific effects. Differing disorder strategies are likely to impact on how different viruses modulate host factors, and on how rapidly viruses can evolve novel instances of SLiMs subverting host functions, such as innate and acquired immunity.

[1]  A. Dunker,et al.  Understanding protein non-folding. , 2010, Biochimica et biophysica acta.

[2]  P. Tompa,et al.  Dual coding in alternative reading frames correlates with intrinsic protein disorder , 2010, Proceedings of the National Academy of Sciences.

[3]  Dan S. Tawfik,et al.  Conformational diversity and protein evolution--a 60-year-old hypothesis revisited. , 2003, Trends in biochemical sciences.

[4]  Philip M. Murphy,et al.  Molecular mimicry and the generation of host defense protein diversity , 1993, Cell.

[5]  A. Mankertz,et al.  Gene expression of the human Torque Teno Virus isolate P/1C1. , 2008, Virology.

[6]  A. Keith Dunker,et al.  Overlapping Genes Produce Proteins with Unusual Sequence Properties and Offer Insight into De Novo Protein Creation , 2009, Journal of Virology.

[7]  J. Claverie,et al.  Horizontal gene transfer and nucleotide compositional anomaly in large DNA viruses , 2007, BMC Genomics.

[8]  Marc S. Cortese,et al.  Rational drug design via intrinsically disordered protein. , 2006, Trends in biotechnology.

[9]  Woei-Chyn Chu,et al.  Categorizing Host-Dependent RNA Viruses by Principal Component Analysis of Their Codon Usage Preferences , 2009, J. Comput. Biol..

[10]  A Keith Dunker,et al.  Conservation of intrinsic disorder in protein domains and families: II. functions of conserved disorder. , 2006, Journal of proteome research.

[11]  Richard J. Edwards,et al.  ELM—the database of eukaryotic linear motifs , 2011, Nucleic Acids Res..

[12]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[13]  Lukasz A. Kurgan,et al.  MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins , 2012, Bioinform..

[14]  Christine A. Orengo,et al.  Inferring Function Using Patterns of Native Disorder in Proteins , 2007, PLoS Comput. Biol..

[15]  A. E. Yeo,et al.  Genomic and molecular evolutionary analysis of a newly identified infectious agent (SEN virus) and its relationship to the TT virus family. , 2001, The Journal of infectious diseases.

[16]  Y. Bigot,et al.  Proteomic analysis of the Spodoptera frugiperda ascovirus 1a virion reveals 21 proteins. , 2009, The Journal of general virology.

[17]  Karlene H. Lynch,et al.  Genomic analysis and relatedness of P2-like phages of the Burkholderia cepacia complex , 2010, BMC Genomics.

[18]  J. S. Sodhi,et al.  Prediction and functional analysis of native disorder in proteins from the three kingdoms of life. , 2004, Journal of molecular biology.

[19]  P. Auewarakul Composition bias and genome polarity of RNA viruses , 2004, Virus Research.

[20]  P. Tompa,et al.  The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins. , 2005, Journal of molecular biology.

[21]  Christopher J. Oldfield,et al.  Do viral proteins possess unique biophysical features? , 2009, Trends in biochemical sciences.

[22]  Norman E. Davey,et al.  How viruses hijack cell regulation. , 2011, Trends in biochemical sciences.

[23]  S. Teichmann,et al.  Tight Regulation of Unstructured Proteins: From Transcript Synthesis to Protein Degradation , 2008, Science.

[24]  Albert H. Mao,et al.  Role of backbone-solvent interactions in determining conformational equilibria of intrinsically disordered proteins. , 2008, Journal of the American Chemical Society.

[25]  Lukasz Kurgan,et al.  Protein intrinsic disorder as a flexible armor and a weapon of HIV-1 , 2011, Cellular and Molecular Life Sciences.

[26]  L. Torrance,et al.  Role of plant virus movement proteins. , 2008, Methods in molecular biology.

[27]  M. Bolognesi,et al.  Function and Structure of Inherently Disordered Proteins This Review Comes from a Themed Issue on Proteins Edited Prediction of Non-folding Proteins and Regions Frequency of Disordered Regions Protein Evolution Partitioning Unstructured Proteins and Regions into Groups Involvement of Inherently Diso , 2022 .

[28]  Antonio Alcami,et al.  Viral mimicry of cytokines, chemokines and their receptors , 2003, Nature Reviews Immunology.

[29]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[30]  R. Kiss,et al.  Calcium‐induced tripartite binding of intrinsically disordered calpastatin to its cognate enzyme, calpain , 2008, FEBS letters.

[31]  D. Baltimore Expression of animal virus genomes. , 1971, Bacteriological reviews.

[32]  V. Uversky Natively unfolded proteins: A point where biology waits for physics , 2002, Protein science : a publication of the Protein Society.

[33]  Amos Bairoch,et al.  ViralZone: a knowledge resource to understand virus diversity , 2010, Nucleic Acids Res..

[34]  P. Rivailler,et al.  Complete Genomic Sequence of an Epstein-Barr Virus-Related Herpesvirus Naturally Infecting a New World Primate: a Defining Point in the Evolution of Oncogenic Lymphocryptoviruses , 2002, Journal of Virology.

[35]  Christian Schaefer,et al.  Protein secondary structure appears to be robust under in silico evolution while protein disorder appears not to be , 2010, Bioinform..

[36]  N. Sueoka,et al.  CORRELATION BETWEEN BASE COMPOSITION OF DEOXYRIBONUCLEIC ACID AND AMINO ACID COMPOSITION OF PROTEIN. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[37]  F. Guerlesquin,et al.  Protein–protein interaction inhibition (2P2I) combining high throughput and virtual screening: Application to the HIV-1 Nef protein , 2007, Proceedings of the National Academy of Sciences.

[38]  J. Roach,et al.  Paramecium bursaria Chlorella Virus 1 Proteome Reveals Novel Architectural and Regulatory Features of a Giant Virus , 2012, Journal of Virology.

[39]  Silvio C. E. Tosatto,et al.  ESpritz: accurate and fast prediction of protein disorder , 2012, Bioinform..

[40]  Jessica W. Chen Conversation of Intrinsic Disorder in Protein Domains and Families , 2005 .

[41]  M. Babu,et al.  The rules of disorder or why disorder rules. , 2009, Progress in biophysics and molecular biology.

[42]  J. Gill,et al.  Efficacy of bacteriophage therapy in a model of Burkholderia cenocepacia pulmonary infection. , 2010, The Journal of infectious diseases.

[43]  A Keith Dunker,et al.  Protein intrinsic disorder and human papillomaviruses: increased amount of disorder in E6 and E7 oncoproteins from high risk HPVs. , 2006, Journal of proteome research.

[44]  Monika Fuxreiter,et al.  Close encounters of the third kind: disordered domains and the interactions of proteins , 2009, BioEssays : news and reviews in molecular, cellular and developmental biology.

[45]  Christopher J. Oldfield,et al.  The unfoldomics decade: an update on intrinsically disordered proteins , 2008, BMC Genomics.

[46]  Stephanie Irausquin,et al.  The evolutionary biology of poxviruses. , 2010, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.

[47]  A. Dunker,et al.  Orderly order in protein intrinsic disorder distribution: disorder in 3500 proteomes from viruses and the three domains of life , 2012, Journal of biomolecular structure & dynamics.

[48]  J. Qiu,et al.  Molecular Characterization of Infectious Clones of the Minute Virus of Canines Reveals Unique Features of Bocaviruses , 2009, Journal of Virology.

[49]  Sonia Longhi,et al.  The C-terminal domain of measles virus nucleoprotein belongs to the class of intrinsically disordered proteins that fold upon binding to their physiological partner. , 2004, Virus research.

[50]  P. Tompa,et al.  Reduction in Structural Disorder and Functional Complexity in the Thermal Adaptation of Prokaryotes , 2010, PloS one.

[51]  R. Nussinov,et al.  Extended disordered proteins: targeting function with less scaffold. , 2003, Trends in biochemical sciences.

[52]  P. Tompa Intrinsically unstructured proteins. , 2002, Trends in biochemical sciences.

[53]  U. Höfle,et al.  Characterization of Two Novel Polyomaviruses of Birds by Using Multiply Primed Rolling-Circle Amplification of Their Genomes , 2006, Journal of Virology.

[54]  Peter Tompa,et al.  The relationship between proteome size, structural disorder and organism complexity , 2011, Genome Biology.

[55]  B. Murphy,et al.  Mutations in the C, D, and V open reading frames of human parainfluenza virus type 3 attenuate replication in rodents and primates. , 1999, Virology.

[56]  P. Romero,et al.  Intrinsic disorder in Viral Proteins Genome-Linked: experimental and predictive analyses , 2009, Virology Journal.

[57]  S. Roberts,et al.  A cyclin-binding motif in human papillomavirus type 18 (HPV18) E1^E4 is necessary for association with CDK-cyclin complexes and G2/M cell cycle arrest of keratinocytes, but is not required for differentiation-dependent viral genome amplification or L1 capsid protein expression. , 2011, Virology.

[58]  K. Saksela,et al.  Versatile retargeting of SH3 domain binding by modification of non‐conserved loop residues , 2007, FEBS letters.

[59]  A Keith Dunker,et al.  A comparative analysis of viral matrix proteins using disorder predictors , 2008, Virology Journal.

[60]  M. A. McClure,et al.  A Bioinformatics Approach to the Structure, Function, and Evolution of the Nucleoprotein of the Order Mononegavirales , 2011, PloS one.

[61]  István Simon,et al.  Molecular principles of the interactions of disordered proteins. , 2007, Journal of molecular biology.

[62]  A Keith Dunker,et al.  Protein intrinsic disorder toolbox for comparative analysis of viral proteins , 2008, BMC Genomics.

[63]  Ben Lehner,et al.  Intrinsic Protein Disorder and Interaction Promiscuity Are Widely Associated with Dosage Sensitivity , 2009, Cell.

[64]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[65]  Gerard Kian-Meng Goh,et al.  Protein intrinsic disorder and influenza virulence: the 1918 H1N1 and H5N1 viruses , 2009, Virology Journal.

[66]  Zsuzsanna Dosztányi,et al.  Prediction of Protein Binding Regions in Disordered Proteins , 2009, PLoS Comput. Biol..

[67]  Christopher J Oldfield,et al.  Viral disorder or disordered viruses: do viral proteins possess unique features? , 2010, Protein and peptide letters.

[68]  H. Dyson,et al.  Linking folding and binding. , 2009, Current opinion in structural biology.

[69]  P. Tompa,et al.  Structural Disorder in Eukaryotes , 2012, PloS one.