Eukaryotic Protein Domains as Functional Units of Cellular Evolution

Clustering proteins into groups on the basis of their domain compositions provides insight into protein evolution. Domains for Change Protein domains endow proteins with specific activities and the ability to interact with specific partners. Most protein domains occur in many proteins and most proteins have multiple domains, but the combinations of domains are far fewer than would be predicted, suggesting that there is evolutionary pressure that preserves certain domain combinations. Jin et al. use a proteome-wide clustering method to identify eukaryotic protein domain combinations that correlate with evolutionary change. Their analysis suggests that reciprocal interactions between a protein and its microenvironment constrain the repertoire of domains that control specific cellular functions. They analyzed the proteins in seven eukaryotic species and organized the domains into 1245 “domain clubs,” with the majority of clubs containing proteins with multiple distinct domains and proteins with rich interrelationships among members of different clubs. They grouped proteins on the basis of their domain clubs into functional trees and were able to place domains of unknown function into functional groups, as well as make predictions about the role domain evolution contributes to the evolution of protein function within a molecular environment, as well as to the evolution of molecular environments. Modular protein domains are functional units that can be modified through the acquisition of new intrinsic activities or by the formation of novel domain combinations, thereby contributing to the evolution of proteins with new biological properties. Here, we assign proteins to groups with related domain compositions and functional properties, termed “domain clubs,” which we use to compare multiple eukaryotic proteomes. This analysis shows that different domain types can take distinct evolutionary trajectories, which correlate with the conservation, gain, expansion, or decay of particular biological processes. Evolutionary jumps are associated with a domain that coordinately acquires a new intrinsic function and enters new domain clubs, thereby providing the modified domain with access to a new cellular microenvironment. We also coordinately analyzed the covalent and noncovalent interactions of different domain types to assess the molecular compartment occupied by each domain. This reveals that specific subsets of domains demarcate particular cellular processes, such as growth factor signaling, chromatin remodeling, apoptotic and inflammatory responses, or vesicular trafficking. We suggest that domains, and the proteins in which they reside, are selected during evolution through reciprocal interactions with protein domains in their local microenvironment. Based on this scheme, we propose a mechanism by which Tudor domains may have evolved to support different modes of epigenetic regulation and suggest a role for the germline group of mammalian Tudor domains in Piwi-regulated RNA biology.

[1]  Alfred Wittinghofer,et al.  GEFs and GAPs: Critical Elements in the Control of Small G Proteins , 2007, Cell.

[2]  Andrew D. Moore,et al.  Arrangements in the modular evolution of proteins. , 2008, Trends in biochemical sciences.

[3]  G. Hannon,et al.  The Piwi-piRNA Pathway Provides an Adaptive Defense in the Transposon Arms Race , 2007, Science.

[4]  Tony Pawson,et al.  Redirecting tyrosine kinase signaling to an apoptotic caspase pathway through chimeric adaptor proteins , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  B. Peter,et al.  BAR Domains as Sensors of Membrane Curvature: The Amphiphysin BAR Structure , 2004, Science.

[6]  David S. Wishart,et al.  Improving the accuracy of protein secondary structure prediction using structural alignment , 2006, BMC Bioinformatics.

[7]  C. Orengo,et al.  Protein families and their evolution-a structural perspective. , 2005, Annual review of biochemistry.

[8]  Karen E. Pilcher,et al.  The Dictyostelium Kinome—Analysis of the Protein Kinases from a Simple Model Organism , 2006, PLoS genetics.

[9]  R. Sachidanandam,et al.  Proteomic analysis of murine Piwi proteins reveals a role for arginine methylation in specifying interaction with Tudor family members. , 2009, Genes & development.

[10]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[11]  A. Ridley Rho GTPases and actin dynamics in membrane protrusions and vesicle trafficking. , 2006, Trends in cell biology.

[12]  J. Tschopp,et al.  The PIDDosome, a Protein Complex Implicated in Activation of Caspase-2 in Response to Genotoxic Stress , 2004, Science.

[13]  John Kuriyan,et al.  The origin of protein interactions and allostery in colocalization , 2007, Nature.

[14]  C. Chothia,et al.  Evolution of the Protein Repertoire , 2003, Science.

[15]  Tony Pawson,et al.  Protein modules and signalling networks , 1995, Nature.

[16]  Elisabeth Bock,et al.  Structural basis for a direct interaction between FGFR1 and NCAM and evidence for a regulatory role of ATP. , 2003, Structure.

[17]  A. Caudy,et al.  A micrococcal nuclease homologue in RNAi effector complexes , 2003, Nature.

[18]  S. Wuchty Scale-free behavior in protein domain networks. , 2001, Molecular biology and evolution.

[19]  Adam Godzik,et al.  Comparative analysis of protein domain organization. , 2004, Genome research.

[20]  S. Teichmann,et al.  Multi-domain protein families and domain pairs: comparison with known structures and a random model of domain recombination , 2004, Journal of Structural and Functional Genomics.

[21]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[22]  R. Evans,et al.  The Spt6 SH2 domain binds Ser2-P RNAPII to direct Iws1-dependent mRNA splicing and export. , 2007, Genes & development.

[23]  J. Lis,et al.  The RNA processing exosome is linked to elongating RNA polymerase II in Drosophila , 2002, Nature.

[24]  C. Chothia,et al.  Structure, function and evolution of multidomain proteins. , 2004, Current opinion in structural biology.

[25]  A. Elofsson,et al.  Multi-domain proteins in the three kingdoms of life: orphan domains and other unassigned regions. , 2005, Journal of molecular biology.

[26]  Kristiina Vuori,et al.  GEF what? Dock180 and related proteins help Rac to polarize cells in new ways. , 2007, Trends in cell biology.

[27]  Kumaran Kandasamy,et al.  An evaluation of human protein-protein interaction data in the public domain , 2006, BMC Bioinformatics.

[28]  Frances M. G. Pearl,et al.  Protein folds, functions and evolution. , 1999, Journal of molecular biology.

[29]  I. Kerr,et al.  Jaks and Stats in signaling by the cytokine receptor superfamily. , 1995, Trends in genetics : TIG.

[30]  E. Davidson The Regulatory Genome: Gene Regulatory Networks In Development And Evolution , 2006 .

[31]  Gerard Manning,et al.  The protist, Monosiga brevicollis, has a tyrosine kinase signaling network more elaborate and diverse than found in any known metazoan , 2008, Proceedings of the National Academy of Sciences.

[32]  Yi Zhang,et al.  Recognition of Histone H3 Lysine-4 Methylation by the Double Tudor Domain of JMJD2A , 2006, Science.

[33]  Thomas Franz,et al.  Loss of the Mili-interacting Tudor domain–containing protein-1 activates transposons and alters the Mili-associated small RNA profile , 2009, Nature Structural &Molecular Biology.

[34]  I. Campbell,et al.  Building proteins with fibronectin type III modules. , 1994, Structure.

[35]  S. Carroll Evo-Devo and an Expanding Evolutionary Synthesis: A Genetic Theory of Morphological Evolution , 2008, Cell.

[36]  S. Teichmann,et al.  The relationship between domain duplication and recombination. , 2005, Journal of molecular biology.

[37]  Peer Bork,et al.  SMART 5: domains in the context of genomes and networks , 2005, Nucleic Acids Res..

[38]  David P. Bartel,et al.  Early origins and evolution of microRNAs and Piwi-interacting RNAs in animals , 2008, Nature.

[39]  I D Campbell,et al.  Protein modules. , 1991, Trends in biochemical sciences.

[40]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[41]  Jodie J. Yin,et al.  A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes , 2004, Genome Biology.

[42]  Tom J. Petty,et al.  Methylated lysine 79 of histone H3 targets 53BP1 to DNA double-strand breaks , 2004, Nature.

[43]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[44]  G. Dreyfuss,et al.  The Spinal Muscular Atrophy Disease Gene Product, SMN, and Its Associated Protein SIP1 Are in a Complex with Spliceosomal snRNP Proteins , 1997, Cell.

[45]  T. Jongens,et al.  Arginine methylation of Piwi proteins, catalyzed by dPRMT5, is required for Ago3 and Aub stability , 2009, Nature Cell Biology.

[46]  P. Bork,et al.  Evolution of the phospho-tyrosine signaling machinery in premetazoan lineages , 2008, Proceedings of the National Academy of Sciences.

[47]  M. Ptashne,et al.  Separation of DNA binding from the transcription-activating function of a eukaryotic regulatory protein. , 1986, Science.

[48]  Haifan Lin,et al.  Mili Interacts with Tudor Domain-Containing Protein 1 in Regulating Spermatogenesis , 2009, Current Biology.

[49]  T. Pawson,et al.  Mouse Piwi interactome identifies binding mechanism of Tdrkh Tudor domain to arginine methylated Miwi , 2009, Proceedings of the National Academy of Sciences.

[50]  M. Mann,et al.  SH2 Signaling in a Lower Eukaryote: A STAT Protein That Regulates Stalk Cell Differentiation in Dictyostelium , 1997, Cell.

[51]  T. Pawson,et al.  Assembly of Cell Regulatory Systems Through Protein Interaction Domains , 2003, Science.

[52]  C. Ponting,et al.  The natural history of protein domains. , 2002, Annual review of biophysics and biomolecular structure.

[53]  David Botstein,et al.  Homology of a yeast actin-binding protein to signal transduction proteins and myosin-I , 1990, Nature.

[54]  Sebastian Maurer-Stroh,et al.  The Tudor domain 'Royal Family': Tudor, plant Agenet, Chromo, PWWP and MBT domains. , 2003, Trends in biochemical sciences.

[55]  O. Silvennoinen,et al.  The multifunctional human p100 protein 'hooks' methylated ligands , 2007, Nature Structural &Molecular Biology.

[56]  H. H. Park,et al.  The death domain superfamily in intracellular signaling of apoptosis and inflammation. , 2007, Annual review of immunology.

[57]  Yi Zhang,et al.  Tudor, MBT and chromo domains gauge the degree of lysine methylation , 2006, EMBO reports.

[58]  Pierre Brézellec,et al.  Gene fusion/fission is a major contributor to evolution of multi-domain bacterial proteins , 2006, Bioinform..

[59]  E. Koonin,et al.  Evolution of protein domain promiscuity in eukaryotes. , 2008, Genome research.

[60]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[61]  G. Dreyfuss,et al.  The SMN–SIP1 Complex Has an Essential Role in Spliceosomal snRNP Biogenesis , 1997, Cell.

[62]  E. E. Schmidt,et al.  The origins of polypeptide domains , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[63]  T. Pawson,et al.  Proteomic, Functional, and Domain-Based Analysis of In Vivo 14-3-3 Binding Proteins Involved in Cytoskeletal Regulation and Cellular Organization , 2004, Current Biology.

[64]  B. Mayer,et al.  SH3 domains: complexity in moderation. , 2001, Journal of cell science.

[65]  D. Eisenberg,et al.  Detecting protein function and protein-protein interactions from genome sequences. , 1999, Science.

[66]  Eugene Berezikov,et al.  A Role for Piwi and piRNAs in Germ Cell Maintenance and Transposon Silencing in Zebrafish , 2007, Cell.

[67]  M. Snyder,et al.  Targeting of Chitin Synthase 3 to Polarized Growth Sites in Yeast Requires Chs5p and Myo2p , 1997, The Journal of cell biology.

[68]  T. Pawson,et al.  The human and mouse complement of SH2 domain proteins-establishing the boundaries of phosphotyrosine signaling. , 2006, Molecular cell.

[69]  Andrew F Neuwald,et al.  Computational analysis of protein tyrosine phosphatases: practical guide to bioinformatics and data resources. , 2005, Methods.

[70]  P. Bork,et al.  Protein domain analysis in the era of complete genomes , 2002, FEBS letters.

[71]  N. Nakatsuji,et al.  Tudor-related proteins TDRD1/MTR-1, TDRD6 and TDRD7/TRAP: domain composition, intracellular localization, and function in male germ cells in mice. , 2007, Developmental biology.