Intrinsic disorder in pathogenic and non-pathogenic microbes: discovering and analyzing the unfoldomes of early-branching eukaryotes.

Parasitic protozoal infections have long been known to cause profound degrees of sickness and death in humans as well as animal populations. Despite the increase in the number of annotated genomes available for a large variety of protozoa, a great deal more has yet to be learned about them, from their fundamental physiology to mechanisms invoked during host-pathogen interactions. Most of these genomes share a common feature, namely a high prevalence of low complexity regions in their predicted proteins, which is believed to contribute to the uniqueness of the individual species within this diverse group of early-branching eukaryotes. In the case of Plasmodium species, which cause malaria, such regions have also been reported to hamper the identification of homologues, thus making functional genomics exceptionally challenging. One of the better accepted theories accounting for the high number of low complexity regions is the presence of intrinsic disorder in these microbes. In this study we compare the degree of disordered proteins that are predicted to be expressed in many such ancient eukaryotic cells. Our findings indicate an unusual bias in the amino acids comprising protozoal proteomes, and show that intrinsic disorder is remarkably abundant among their predicted proteins. Additionally, the intrinsically disordered regions tend to be considerably longer in the early-branching eukaryotes. An analysis of a Plasmodium falciparum interactome indicates that protein-protein interactions may be at least one function of the intrinsic disorder. This study provides a bioinfomatics basis for the discovery and analysis of unfoldomes (the complement of intrinsically disordered proteins in a given proteome) of early-branching eukaryotes. It also provides new insights into the evolution of intrinsic disorder in the context of adapting to a parasitic lifestyle and lays the foundation for further work on the subject.

[1]  A Keith Dunker,et al.  Characterization of molecular recognition features, MoRFs, and their binding partners. , 2007, Journal of proteome research.

[2]  Li Li,et al.  PlasmoDB: the Plasmodium genome resource. A database integrating experimental and computational data , 2003, Nucleic Acids Res..

[3]  V. Uversky,et al.  Disorder in the nuclear pore complex: The FG repeat regions of nucleoporins are natively unfolded , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[4]  A. Dunker,et al.  Disorder and sequence repeats in hub proteins and their implications for network evolution. , 2006, Journal of proteome research.

[5]  V. Uversky,et al.  Why are “natively unfolded” proteins unstructured under physiologic conditions? , 2000, Proteins.

[6]  E. Pizzi,et al.  Low-complexity regions in Plasmodium falciparum proteins. , 2001, Genome research.

[7]  L. Aravind,et al.  Plasmodium Biology Genomic Gleanings , 2003, Cell.

[8]  R. Yolken,et al.  Antibodies to Toxoplasma gondii in individuals with first-episode schizophrenia. , 2001, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[9]  J. Flegr Effects of toxoplasma on human behavior. , 2007, Schizophrenia bulletin.

[10]  南 宏和 水素濃度低下に光合成細菌や硫酸還元菌を利用したメタン発酵(Appl.Microbiol.Biotechnol Vol.27,1988) , 1988 .

[11]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. , 2007, Journal of proteome research.

[12]  BMC Bioinformatics , 2005 .

[13]  J. Remington,et al.  Biology of Toxoplasma gondii. , 1993, AIDS.

[14]  Lilia M. Iakoucheva,et al.  Intrinsic Disorder Is a Common Feature of Hub Proteins from Four Eukaryotic Interactomes , 2006, PLoS Comput. Biol..

[15]  L. Iakoucheva,et al.  Intrinsic Disorder and Protein Function , 2002 .

[16]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[17]  P. Romero,et al.  Sequence complexity of disordered protein , 2001, Proteins.

[18]  John B. Anderson,et al.  CDD: a Conserved Domain Database for protein classification , 2004, Nucleic Acids Res..

[19]  Vladimir N Uversky,et al.  Multiple aromatic side chains within a disordered structure are critical for transcription and transforming activity of EWS family oncoproteins , 2007, Proceedings of the National Academy of Sciences.

[20]  Haruki Nakamura,et al.  Disordered domains and high surface charge confer hubs with the ability to interact with multiple proteins in interaction networks , 2006, FEBS letters.

[21]  Gustavo Camps-Valls,et al.  Bioinformatics and Computational Biology , 2015, Encyclopedia of Data Warehousing and Mining.

[22]  Zoran Obradovic,et al.  DisProt: a database of protein disorder , 2005, Bioinform..

[23]  Wim G J Hol,et al.  Heterologous expression of proteins from Plasmodium falciparum: results from 1000 genes. , 2006, Molecular and biochemical parasitology.

[24]  M. Meissner,et al.  Molecular tools for analysis of gene function in parasitic microorganisms , 2007, Applied Microbiology and Biotechnology.

[25]  C. Reich,et al.  The Giardia genome project database. , 2000, FEMS microbiology letters.

[26]  Johannes Buchner,et al.  Protein folding handbook , 2005 .

[27]  Zoran Obradovic,et al.  The protein trinity—linking function and disorder , 2001, Nature Biotechnology.

[28]  Christopher J. Oldfield,et al.  Intrinsic disorder and functional proteomics. , 2007, Biophysical journal.

[29]  Marc S. Cortese,et al.  Coupled folding and binding with α-helix-forming molecular recognition elements , 2005 .

[30]  Li Li,et al.  ToxoDB: accessing the Toxoplasma gondii genome , 2003, Nucleic Acids Res..

[31]  Z. Obradovic,et al.  Identification and functions of usefully disordered proteins. , 2002, Advances in protein chemistry.

[32]  Marc S. Cortese,et al.  Analysis of molecular recognition features (MoRFs). , 2006, Journal of molecular biology.

[33]  V. Uversky,et al.  Protein folding revisited. A polypeptide chain at the folding – misfolding – nonfolding cross-roads: which way to go? , 2003, Cellular and Molecular Life Sciences CMLS.

[34]  Gregory A. Buck,et al.  The genome of Cryptosporidium hominis , 2004, Nature.

[35]  Marc S. Cortese,et al.  Comparing and combining predictors of mostly disordered proteins. , 2005, Biochemistry.

[36]  J. Rose,et al.  Emerging parasite zoonoses associated with water and food. , 2000, International journal for parasitology.

[37]  V. Uversky Intrinsically Disordered Proteins , 2000 .

[38]  R. Snow,et al.  A preliminary continental risk map for malaria mortality among African children. , 1999, Parasitology today.

[39]  V. Uversky,et al.  Lipid-binding activity of intrinsically unstructured cytoplasmic domains of multichain immune recognition receptor signaling subunits. , 2006, Biochemistry.

[40]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[41]  Christopher J. Oldfield,et al.  Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins. , 2007, Journal of proteome research.

[42]  A. Elofsson,et al.  What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? , 2006, Genome Biology.

[43]  Christopher J. Oldfield,et al.  Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling , 2005, Journal of molecular recognition : JMR.

[44]  A. Dunker,et al.  Abundance of intrinsic disorder in protein associated with cardiovascular disease. , 2006, Biochemistry.

[45]  V. Uversky,et al.  The Saccharomyces cerevisiae Nucleoporin Nup2p Is a Natively Unfolded Protein* , 2002, The Journal of Biological Chemistry.

[46]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[47]  Vladimir N Uversky,et al.  What does it mean to be natively unfolded? , 2002, European journal of biochemistry.

[48]  H. Dyson,et al.  Intrinsically unstructured proteins and their functions , 2005, Nature Reviews Molecular Cell Biology.

[49]  A. Iglesias,et al.  Intrinsic disorder is a key characteristic in partners that bind 14‐3‐3 proteins , 2006, Proteins.

[50]  A Keith Dunker,et al.  Calmodulin signaling: Analysis and prediction of a disorder‐dependent molecular recognition , 2006, Proteins.

[51]  L. Iakoucheva,et al.  Intrinsic disorder in cell-signaling and cancer-associated proteins. , 2002, Journal of molecular biology.

[52]  Zoran Obradovic,et al.  Length-dependent prediction of protein intrinsic disorder , 2006, BMC Bioinformatics.

[53]  Stefan Wuchty,et al.  A draft of protein interactions in the malaria parasite P. falciparum. , 2007, Journal of proteome research.

[54]  Jonathan E. Allen,et al.  Genome sequence of the human malaria parasite Plasmodium falciparum , 2002, Nature.

[55]  V. Uversky Natively unfolded proteins: A point where biology waits for physics , 2002, Protein science : a publication of the Protein Society.

[56]  P. Tompa,et al.  Fuzzy complexes: polymorphism and structural disorder in protein-protein interactions. , 2008, Trends in biochemical sciences.

[57]  Renata C. Geer,et al.  Tutorial Section: Entrez: Making Use of Its Power , 2003, Briefings Bioinform..

[58]  L. Brocchieri,et al.  Low-complexity regions in Plasmodium proteins: in search of a function. , 2001, Genome research.

[59]  M Vihinen,et al.  Relationship of protein flexibility to thermostability. , 1987, Protein engineering.

[60]  A Keith Dunker,et al.  Functional anthology of intrinsic disorder. 2. Cellular components, domains, technical terms, developmental processes, and coding sequence diversities correlated with long disordered regions. , 2007, Journal of proteome research.

[61]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[62]  Isma'il ibn Ali al-Sadiq AIDS , 1986, The Lancet.

[63]  Jonathan E. Allen,et al.  Genome Sequence of Theileria parva, a Bovine Pathogen That Transforms Lymphocytes , 2005, Science.

[64]  S. Vucetic,et al.  Flavors of protein disorder , 2003, Proteins.

[65]  H. Dyson,et al.  Coupling of folding and binding for unstructured proteins. , 2002, Current opinion in structural biology.

[66]  Vladimir Vacic,et al.  Composition Profiler: a tool for discovery and visualization of amino acid composition differences , 2007, BMC Bioinformatics.

[67]  S. Hay,et al.  The global distribution of clinical episodes of Plasmodium falciparum malaria , 2005, Nature.

[68]  Amos Bairoch,et al.  ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins , 2006, Nucleic Acids Res..

[69]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[70]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[71]  Xiuzhen Zhang,et al.  Abundance of intrinsically unstructured proteins in P. falciparum and other apicomplexan parasite proteomes. , 2006, Molecular and biochemical parasitology.

[72]  P. Tompa Intrinsically unstructured proteins. , 2002, Trends in biochemical sciences.

[73]  A. Sigalov,et al.  Homooligomerization of the cytoplasmic domain of the T cell receptor zeta chain and of other proteins containing the immunoreceptor tyrosine-based activation motif. , 2004, Biochemistry.

[74]  N. Hoxie,et al.  Cryptosporidiosis-associated mortality following a massive waterborne outbreak in Milwaukee, Wisconsin. , 1997, American journal of public health.

[75]  P. Lansbury,et al.  NACP, a protein implicated in Alzheimer's disease and learning, is natively unfolded. , 1996, Biochemistry.

[76]  Ping Xu,et al.  Complete Genome Sequence of the Apicomplexan, Cryptosporidium parvum , 2004, Science.

[77]  A. Sigalov Immune cell signaling: a novel mechanistic model reveals new therapeutic targets. , 2006, Trends in pharmacological sciences.

[78]  Lindsay Sawyer,et al.  Caseins as rheomorphic proteins: interpretation of primary and secondary structures of the αS1-, β- and κ-caseins , 1993 .

[79]  E. Rorman,et al.  Congenital toxoplasmosis--prenatal aspects of Toxoplasma gondii infection. , 2006, Reproductive toxicology.

[80]  Vladislav Yu Orekhov,et al.  Binding of intrinsically disordered proteins is not necessarily accompanied by a structural transition to a folded form. , 2007, Biochimie.

[81]  Debasis Dash,et al.  Intrinsic disorder in yeast transcriptional regulatory network , 2007, Proteins.

[82]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[83]  E. Pfefferkorn,et al.  Interferon-gamma suppresses the growth of Toxoplasma gondii in human fibroblasts through starvation for tryptophan. , 1986, Molecular and biochemical parasitology.