Computational Framework for Prediction of Peptide Sequences That May Mediate Multiple Protein Interactions in Cancer-Associated Hub Proteins

A considerable proportion of protein-protein interactions (PPIs) in the cell are estimated to be mediated by very short peptide segments that approximately conform to specific sequence patterns known as linear motifs (LMs), often present in the disordered regions in the eukaryotic proteins. These peptides have been found to interact with low affinity and are able bind to multiple interactors, thus playing an important role in the PPI networks involving date hubs. In this work, PPI data and de novo motif identification based method (MEME) were used to identify such peptides in three cancer-associated hub proteins—MYC, APC and MDM2. The peptides corresponding to the significant LMs identified for each hub protein were aligned, the overlapping regions across these peptides being termed as overlapping linear peptides (OLPs). These OLPs were thus predicted to be responsible for multiple PPIs of the corresponding hub proteins and a scoring system was developed to rank them. We predicted six OLPs in MYC and five OLPs in MDM2 that scored higher than OLP predictions from randomly generated protein sets. Two OLP sequences from the C-terminal of MYC were predicted to bind with FBXW7, component of an E3 ubiquitin-protein ligase complex involved in proteasomal degradation of MYC. Similarly, we identified peptides in the C-terminal of MDM2 interacting with FKBP3, which has a specific role in auto-ubiquitinylation of MDM2. The peptide sequences predicted in MYC and MDM2 look promising for designing orthosteric inhibitors against possible disease-associated PPIs. Since these OLPs can interact with other proteins as well, these inhibitors should be specific to the targeted interactor to prevent undesired side-effects. This computational framework has been designed to predict and rank the peptide regions that may mediate multiple PPIs and can be applied to other disease-associated date hub proteins for prediction of novel therapeutic targets of small molecule PPI modulators.

[1]  D. Higgins,et al.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega , 2011, Molecular systems biology.

[2]  F. Khuri,et al.  Targeting protein-protein interactions as an anticancer strategy. , 2013, Trends in pharmacological sciences.

[3]  Paul W Brandt-Rauf,et al.  NMR solution structure of a peptide from the mdm-2 binding domain of the p53 protein that is selectively cytotoxic to cancer cells. , 2004, Biochemistry.

[4]  J. Yates,et al.  Large-Scale Identification of c-MYC-Associated Proteins Using a Combined TAP/MudPIT Approach , 2007, Cell cycle.

[5]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[6]  Jerome Wielens,et al.  Oncogenic protein interfaces: small molecules, big challenges , 2014, Nature Reviews Cancer.

[7]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[8]  Robert B. Russell,et al.  PepSite: prediction of peptide-binding sites from protein surfaces , 2012, Nucleic Acids Res..

[9]  Ignacio E. Sánchez,et al.  The eukaryotic linear motif resource ELM: 10 years and counting , 2013, Nucleic Acids Res..

[10]  F. Simonin,et al.  Identification of a Novel Protein-Protein Interaction Motif Mediating Interaction of GPCR-Associated Sorting Proteins with G Protein-Coupled Receptors , 2013, PloS one.

[11]  Victor Neduva,et al.  Peptides mediating interaction networks: new leads at last. , 2006, Current opinion in biotechnology.

[12]  S. Gellman,et al.  Targeting protein-protein interactions: lessons from p53/MDM2. , 2007, Biopolymers.

[13]  Marco Biasini,et al.  SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information , 2014, Nucleic Acids Res..

[14]  Niall J. Haslam,et al.  Understanding eukaryotic linear motifs and their role in cell signaling and regulation. , 2008, Frontiers in bioscience : a journal and virtual library.

[15]  A. Ochocka,et al.  FKBP25, a novel regulator of the p53 pathway, induces the degradation of MDM2 and activation of p53 , 2009, FEBS letters.

[16]  Marc A. Martí-Renom,et al.  MODBASE: a database of annotated comparative protein structure models and associated resources , 2005, Nucleic Acids Res..

[17]  Lan V. Zhang,et al.  Evidence for dynamically organized modularity in the yeast protein–protein interaction network , 2004, Nature.

[18]  Silvio C. E. Tosatto,et al.  ESpritz: accurate and fast prediction of protein disorder , 2012, Bioinform..

[19]  Richard J. Edwards,et al.  The SLiMDisc server: short, linear motif discovery in proteins , 2007, Nucleic Acids Res..

[20]  P. Polakis Wnt signaling and cancer. , 2000, Genes & development.

[21]  Richard J. Edwards,et al.  SLiMFinder: a web server to find novel, significantly over-represented, short protein motifs , 2010, Nucleic Acids Res..

[22]  Frederick P. Roth,et al.  Next generation software for functional trend analysis , 2009, Bioinform..

[23]  M. Stallcup,et al.  Nuclear receptor-binding sites of coactivators glucocorticoid receptor interacting protein 1 (GRIP1) and steroid receptor coactivator 1 (SRC-1): multiple motifs with different binding specificities. , 1998, Molecular Endocrinology.

[24]  Gajendra P. S. Raghava,et al.  AlgPred: prediction of allergenic proteins and mapping of IgE epitopes , 2006, Nucleic Acids Res..

[25]  Richard J. Edwards,et al.  QSLiMFinder: improved short linear motif prediction using specific query protein data , 2015, Bioinform..

[26]  K. Zeller,et al.  Function of the c-Myc oncogenic transcription factor. , 1999, Experimental cell research.

[27]  B. Dasgupta,et al.  Distinct roles of overlapping and non-overlapping regions of hub protein interfaces in recognition of multiple partners. , 2011, Journal of molecular biology.

[28]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[29]  S. Salghetti,et al.  Destruction of Myc by ubiquitin‐mediated proteolysis: cancer‐associated and transforming mutations stabilize Myc , 1999, The EMBO journal.

[30]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[31]  Michelle R. Arkin,et al.  Small-molecule inhibitors of protein–protein interactions: progressing towards the dream , 2004, Nature Reviews Drug Discovery.

[32]  R. Ewing,et al.  How do oncoprotein mutations rewire protein–protein interaction networks? , 2015, Expert review of proteomics.

[33]  R. Russell,et al.  Peptide-mediated interactions in biological systems: new discoveries and applications. , 2008, Current opinion in biotechnology.

[34]  Laurence Pelletier,et al.  Orchestration of the DNA-Damage Response by the RNF8 Ubiquitin Ligase , 2007, Science.

[35]  Jianzhi Zhang,et al.  Why Do Hubs Tend to Be Essential in Protein Networks? , 2006, PLoS genetics.

[36]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[37]  Norman E. Davey,et al.  Attributes of short linear motifs. , 2012, Molecular bioSystems.

[38]  Marc S. Cortese,et al.  Flexible nets , 2005, The FEBS journal.

[39]  T. Gibson,et al.  Systematic Discovery of New Recognition Peptides Mediating Protein Interaction Networks , 2005, PLoS biology.

[40]  Pornpimol Charoentong,et al.  ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks , 2009, Bioinform..

[41]  Sudipto Saha,et al.  LMPID: A manually curated database of linear motifs mediating protein–protein interactions , 2015, Database J. Biol. Databases Curation.

[42]  M. Oren,et al.  Mdm2 promotes the rapid degradation of p53 , 1997, Nature.

[43]  Anna R Panchenko,et al.  Exploring functional roles of multibinding protein interfaces , 2009, Protein science : a publication of the Protein Society.

[44]  Zoran Obradovic,et al.  DisProt: the Database of Disordered Proteins , 2006, Nucleic Acids Res..

[45]  Süleyman Cenk Sahinalp,et al.  Not All Scale-Free Networks Are Born Equal: The Role of the Seed Graph in PPI Network Evolution , 2006, Systems Biology and Computational Proteomics.

[46]  D. Payan,et al.  Utility of peptide-protein affinity complexes in proteomics: identification of interaction partners of a tumor suppressor peptide. , 2008, The journal of peptide research : official journal of the American Peptide Society.

[47]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..

[48]  Pierre Baldi,et al.  SCRATCH: a protein structure and structural feature prediction server , 2005, Nucleic Acids Res..

[49]  Rafael C. Jimenez,et al.  The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases , 2013, Nucleic Acids Res..