Systematic Discovery of New Recognition Peptides Mediating Protein Interaction Networks

Many aspects of cell signalling, trafficking, and targeting are governed by interactions between globular protein domains and short peptide segments. These domains often bind multiple peptides that share a common sequence pattern, or “linear motif” (e.g., SH3 binding to PxxP). Many domains are known, though comparatively few linear motifs have been discovered. Their short length (three to eight residues), and the fact that they often reside in disordered regions in proteins makes them difficult to detect through sequence comparison or experiment. Nevertheless, each new motif provides critical molecular details of how interaction networks are constructed, and can explain how one protein is able to bind to very different partners. Here we show that binding motifs can be detected using data from genome-scale interaction studies, and thus avoid the normally slow discovery process. Our approach based on motif over-representation in non-homologous sequences, rediscovers known motifs and predicts dozens of others. Direct binding experiments reveal that two predicted motifs are indeed protein-binding modules: a DxxDxxxD protein phosphatase 1 binding motif with a K D of 22 μM and a VxxxRxYS motif that binds Translin with a K D of 43 μM. We estimate that there are dozens or even hundreds of linear motifs yet to be discovered that will give molecular insight into protein networks and greatly illuminate cellular processes.

[1]  B. Stillman,et al.  Coordinated leading and lagging strand synthesis during SV40 DNA replication in vitro requires PCNA , 1988, Cell.

[2]  D. E. Anderson,et al.  Expression and nitrogen-15 labeling of proteins for proton and nitrogen-15 nuclear magnetic resonance. , 1989, Methods in enzymology.

[3]  D. Barford,et al.  Purification and crystallization of the catalytic domain of human protein tyrosine phosphatase 1B expressed in Escherichia coli. , 1994, Journal of molecular biology.

[4]  M. Kretzschmar,et al.  A novel mediator of class II gene transcription with homology to viral immediate-early transcriptional regulators , 1994, Cell.

[5]  Roger Brent,et al.  Groucho is required for Drosophila neurogenesis, segmentation, and sex determination and interacts directly with hairy-related bHLH proteins , 1994, Cell.

[6]  R. Roeder,et al.  Purification, cloning, and characterization of a human coactivator, PC4, that mediates transcriptional activation of class II genes , 1994, Cell.

[7]  A. Omori,et al.  A novel gene, Translin, encodes a recombination hotspot binding protein associated with chromosomal translocations , 1995, Nature Genetics.

[8]  J. M. Boyd,et al.  Molecular cloning and characterization of a cellular phosphoprotein that interacts with a conserved C-terminal domain of adenovirus E1A involved in negative modulation of oncogenic transformation. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Kornelia Polyak,et al.  Mechanism of CDK activation revealed by the structure of a cyclinA-CDK2 complex , 1995, Nature.

[10]  W. Gu,et al.  Testis-brain RNA-binding protein, a testicular translational regulatory RNA-binding protein, is present in the brain and binds to the 3' untranslated regions of transported brain mRNAs. , 1995, Biology of reproduction.

[11]  L Serrano,et al.  Rational design of specific high-affinity peptide ligands for the Abl-SH3 domain. , 1996, Biochemistry.

[12]  Philip R. Cohen,et al.  Structural basis for the recognition of regulatory subunits by the catalytic subunit of protein phosphatase 1 , 1997, The EMBO journal.

[13]  P. Gros,et al.  C-terminal domain of transcription cofactor PC4 reveals dimeric ssDNA binding site , 1997, Nature Structural Biology.

[14]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[15]  T. Pawson,et al.  Signaling through scaffold, anchoring, and adaptor proteins. , 1997, Science.

[16]  Aris Floratos,et al.  Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm [published erratum appears in Bioinformatics 1998;14(2): 229] , 1998, Bioinform..

[17]  Marius Sudol,et al.  From Src Homology domains to other signaling modules: proposal of the `protein recognition code' , 1998, Oncogene.

[18]  L. Serrano,et al.  Crystal structure of the abl-SH3 domain complexed with a designed high-affinity peptide ligand: implications for SH3-ligand interactions. , 1998, Journal of molecular biology.

[19]  W. Kaelin,et al.  The Elongin BC complex interacts with the conserved SOCS-box motif present in members of the SOCS, ras, WD-40 repeat, and ankyrin repeat families. , 1998, Genes & development.

[20]  G. Chinnadurai,et al.  Structural Determinants Present in the C-terminal Binding Protein Binding Site of Adenovirus Early Region 1A Proteins* , 1998, The Journal of Biological Chemistry.

[21]  M. Dottore,et al.  Identification of a 14-3-3 Binding Sequence in the Common β Chain of the Granulocyte-Macrophage Colony-Stimulating Factor (GM-CSF), Interleukin-3 (IL-3), and IL-5 Receptors That Is Serine-Phosphorylated by GM-CSF , 1999 .

[22]  M. Dottore,et al.  Identification of a 14-3-3 binding sequence in the common beta chain of the granulocyte-macrophage colony-stimulating factor (GM-CSF), interleukin-3 (IL-3), and IL-5 receptors that is serine-phosphorylated by GM-CSF. , 1999, Blood.

[23]  J. G. Zhang,et al.  The conserved SOCS box motif in suppressors of cytokine signaling binds to elongins B and C and may couple bound proteins to proteasomal degradation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Hideyuki Okano,et al.  WRM-1 Activates the LIT-1 Protein Kinase to Transduce Anterior/Posterior Polarity Signals in C. elegans , 1999, Cell.

[25]  C. Mello,et al.  MOM-4, a MAP kinase kinase kinase-related protein, activates WRM-1/LIT-1 kinase to transduce anterior/posterior polarity signals in C. elegans. , 1999, Molecular cell.

[26]  G. Jensen,et al.  Electron Crystal Structure of an RNA Polymerase II Transcription Elongation Complex , 1999, Cell.

[27]  James R. Knight,et al.  A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae , 2000, Nature.

[28]  C. Nüsslein-Volhard,et al.  The molecular motor dynein is involved in targeting Swallow and bicoid RNA to the anterior pole of Drosophila oocytes , 2000, Nature Cell Biology.

[29]  E. Warbrick The puzzle of PCNA's many partners. , 2000, BioEssays : news and reviews in molecular, cellular and developmental biology.

[30]  H. Paudel,et al.  Neuronal Cdc2-like Protein Kinase (Cdk5/p25) Is Associated with Protein Phosphatase 1 and Phosphorylates Inhibitor-2* , 2001, The Journal of Biological Chemistry.

[31]  M. Bollen,et al.  Combinatorial control of protein phosphatase-1. , 2001, Trends in biochemical sciences.

[32]  R. Ozawa,et al.  A comprehensive two-hybrid analysis to explore the yeast protein interactome , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[33]  M. Yaffe,et al.  A motif-based profile scanning approach for genome-wide prediction of signaling pathways , 2001, Nature Biotechnology.

[34]  K. W. Lo,et al.  The 8-kDa Dynein Light Chain Binds to Its Targets via a Conserved (K/R)XTQT Motif* , 2001, The Journal of Biological Chemistry.

[35]  Chi-Hung Lin,et al.  Identification of a Mouse Thiamine Transporter Gene as a Direct Transcriptional Target for p53* , 2001, The Journal of Biological Chemistry.

[36]  Jeong Ho Chang,et al.  Structural basis for the recognition of the E2F transactivation domain by the retinoblastoma tumor suppressor. , 2002, Genes & development.

[37]  Kenneth R. Henry,et al.  Protein Phosphatase-1 Binding to Scd5p Is Important for Regulation of Actin Organization and Endocytosis in Yeast* , 2002, The Journal of Biological Chemistry.

[38]  A. Krainer,et al.  Nuclear Export and Retention Signals in the RS Domain of SR Proteins , 2002, Molecular and Cellular Biology.

[39]  Patrick Aloy,et al.  The third dimension for protein interactions and complexes. , 2002, Trends in biochemical sciences.

[40]  Peer Bork,et al.  Recent improvements to the SMART domain-based sequence annotation resource , 2002, Nucleic Acids Res..

[41]  B. Snel,et al.  Comparative assessment of large-scale data sets of protein–protein interactions , 2002, Nature.

[42]  G. Chinnadurai,et al.  CtBP, an unconventional transcriptional corepressor in development and oncogenesis. , 2002, Molecular cell.

[43]  S. Shenolikar,et al.  Neurabins Recruit Protein Phosphatase-1 and Inhibitor-2 to the Actin Cytoskeleton* , 2002, The Journal of Biological Chemistry.

[44]  Andrea Musacchio,et al.  How SH3 domains recognize proline. , 2002, Advances in protein chemistry.

[45]  Leszek Rychlewski,et al.  ELM server: a new resource for investigating short functional sites in modular eukaryotic proteins , 2003, Nucleic Acids Res..

[46]  Michael Sattler,et al.  High-resolution X-ray and NMR structures of the SMN Tudor domain: conformational variation in the binding site for symmetrically dimethylated arginine residues. , 2003, Journal of molecular biology.

[47]  James R. Knight,et al.  A Protein Interaction Map of Drosophila melanogaster , 2003, Science.

[48]  T. Pawson,et al.  Assembly of Cell Regulatory Systems Through Protein Interaction Domains , 2003, Science.

[49]  Ewan Birney,et al.  Discovering novel cis-regulatory motifs using functional networks. , 2003, Genome research.

[50]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[51]  Hanno Steen,et al.  Development of human protein reference database as an initial platform for approaching systems biology in humans. , 2003, Genome research.

[52]  Julius Brennecke,et al.  Identification of Drosophila MicroRNA Targets , 2003, PLoS biology.

[53]  Christian von Mering,et al.  STRING: a database of predicted functional associations between proteins , 2003, Nucleic Acids Res..

[54]  Michael R. Green,et al.  Arginine-serine-rich domains bound at splicing enhancers contact the branchpoint to promote prespliceosome assembly. , 2004, Molecular cell.

[55]  S. L. Wong,et al.  A Map of the Interactome Network of the Metazoan C. elegans , 2004, Science.

[56]  Roberto Dominguez,et al.  Structural basis of protein phosphatase 1 regulation , 2004, Nature.

[57]  L. Castagnoli,et al.  Protein Interaction Networks by Proteome Peptide Scanning , 2004, PLoS biology.

[58]  R. Lin,et al.  Phosphorylation by the β-Catenin/MAPK Complex Promotes 14-3-3-Mediated Nuclear Export of TCF/POP-1 in Signal-Responsive Cells in C. elegans , 2004, Cell.

[59]  Y. Shamoo,et al.  Structural and thermodynamic analysis of human PCNA with peptides derived from DNA polymerase-delta p66 subunit and flap endonuclease-1. , 2004, Structure.

[60]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[61]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[62]  R. Russell,et al.  Linear motifs: Evolutionary interaction switches , 2005, FEBS letters.

[63]  K. Lindblad-Toh,et al.  Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammals , 2005, Nature.

[64]  C. Benes,et al.  The C2 Domain of PKCδ Is a Phosphotyrosine Binding Domain , 2005, Cell.

[65]  C. Benes,et al.  The C2 domain of PKCdelta is a phosphotyrosine binding domain. , 2005, Cell.

[66]  John M. Walker,et al.  C. elegans , 2006, Methods in Molecular Biology.