Investigation and identification of functional post-translational modification sites associated with drug binding and protein-protein interactions

BackgroundProtein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM’s function is critical to our ability to manipulate the biological mechanisms of protein.ResultsIn this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking tools for exploring the structural characteristics of PTMs, is presented. In addition, all tertiary structures of PTM sites on proteins can be visualized using the JSmol program.ConclusionResolving the function of PTM sites is important for understanding the role that proteins play in biological mechanisms. Our work attempted to delineate the structural correlation between PTM sites and PPI or drug-target binding. CurxPTM could help scientists narrow the scope of their PTM research and enhance the efficiency of PTM identification in the face of big proteome data. CruxPTM is now available at http://csb.cse.yzu.edu.tw/CruxPTM/.

[1]  Hsien-Da Huang,et al.  SNOSite: Exploiting Maximal Dependence Decomposition to Identify Cysteine S-Nitrosylation with Substrate Site Specificity , 2011, PloS one.

[2]  David S. Goodsell,et al.  The RCSB protein data bank: integrative view of protein, gene and 3D structural information , 2016, Nucleic Acids Res..

[3]  G. Rocklin,et al.  Phosphorylation of RhoGDI by Src regulates Rho GTPase binding and cytosol-membrane cycling. , 2006, Molecular biology of the cell.

[4]  Jonathan D. G. Jones,et al.  Evidence for Network Evolution in an Arabidopsis Interactome Map , 2011, Science.

[5]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[6]  Kai-Cheng Hsu,et al.  iGEMDOCK: a graphical environment of enhancing GEMDOCK using pharmacological interactions and post-screening analysis , 2011, BMC Bioinformatics.

[7]  Stefanie Dimmeler,et al.  Akt-Dependent Phosphorylation of p21Cip1 Regulates PCNA Binding and Proliferation of Endothelial Cells , 2001, Molecular and Cellular Biology.

[8]  Arnaud Céol,et al.  3did: a catalog of domain-based interactions of known three-dimensional structure , 2013, Nucleic Acids Res..

[9]  K. Robert Lai,et al.  UbiNet: an online resource for exploring the functional associations and regulatory networks of protein ubiquitylation , 2016, Database J. Biol. Databases Curation.

[10]  J. Kornhauser,et al.  PhosphoSite: A bioinformatics resource dedicated to physiological protein phosphorylation , 2004, Proteomics.

[11]  Hsien-Da Huang,et al.  KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns , 2007, Nucleic Acids Res..

[12]  Hsien-Da Huang,et al.  dbPTM: an information repository of protein post-translational modification , 2005, Nucleic Acids Res..

[13]  Ann M Stock,et al.  Phosphorylation-dependent conformational changes and domain rearrangements in Staphylococcus aureus VraR activation , 2013, Proceedings of the National Academy of Sciences.

[14]  Mehran Yazdanian,et al.  The “High Solubility” Definition of the Current FDA Guidance on Biopharmaceutical Classification System May Be Too Strict for Acidic Drugs , 2004, Pharmaceutical Research.

[15]  Stephen L. Abrams,et al.  Roles of the Raf/MEK/ERK pathway in cell growth, malignant transformation and drug resistance. , 2007, Biochimica et biophysica acta.

[16]  Yu-Ju Chen,et al.  dbSNO 2.0: a resource for exploring structural environment, functional and disease association and regulatory network of protein S-nitrosylation , 2014, Nucleic Acids Res..

[17]  Hsien-Da Huang,et al.  RegPhos: a system to explore the protein kinase–substrate phosphorylation network in humans , 2010, Nucleic Acids Res..

[18]  Tzong-Yi Lee,et al.  topPTM: a new module of dbPTM for identifying functional post-translational modifications in transmembrane proteins , 2013, Nucleic Acids Res..

[19]  Angel Herráez,et al.  Biomolecules in the computer: Jmol to the rescue , 2006, Biochemistry and molecular biology education : a bimonthly publication of the International Union of Biochemistry and Molecular Biology.

[20]  Brian Raught,et al.  Eukaryotic Translation Initiation Factor 4EAvailability Controls the Switch between Cap-Dependent andInternal Ribosomal Entry Site-MediatedTranslation , 2005, Molecular and Cellular Biology.

[21]  Mehdi Mollapour,et al.  Impact of Posttranslational Modifications on the Anticancer Activity of Hsp90 Inhibitors. , 2016, Advances in cancer research.

[22]  Dmitrij Frishman,et al.  Tissue-specific sequence and structural environments of lysine acetylation sites. , 2015, Journal of structural biology.

[23]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2009 update , 2009, Nucleic Acids Res..

[24]  Daniel C. Anthony,et al.  Expanding the diversity of chemical protein modification allows post-translational mimicry , 2007, Nature.

[25]  Alejandro Garcia,et al.  UbiProt: a database of ubiquitylated proteins , 2007, BMC Bioinformatics.

[26]  Wei Wang,et al.  A novel multi-alignment pipeline for high-throughput sequencing data , 2014, Database J. Biol. Databases Curation.

[27]  Joshua S Waitzman,et al.  Survey of phosphorylation near drug binding sites in the Protein Data Bank (PDB) and their effects , 2015, Proteins.

[28]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[29]  Hsien-Da Huang,et al.  dbSNO: a database of cysteine S-nitrosylation , 2012, Bioinform..

[30]  Tzong-Yi Lee,et al.  PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity , 2011, BMC Bioinformatics.

[31]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[32]  K. Bhalla,et al.  Targeting HSP90 for cancer therapy , 2009, British Journal of Cancer.

[33]  Katrin Stierand,et al.  PoseView -- molecular interaction patterns at a glance , 2010, J. Cheminformatics.

[34]  A. Panchenko,et al.  Phosphorylation in protein-protein binding: effect on stability and function. , 2011, Structure.

[35]  Shao-Wei Huang,et al.  Accurate Prediction of Protein Catalytic Residues by Side Chain Orientation and Residue Contact Density , 2012, PloS one.

[36]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[37]  T. Pawson,et al.  Reading protein modifications with interaction domains , 2006, Nature Reviews Molecular Cell Biology.

[38]  Evelyn Jabri,et al.  Urease activity in the crystalline state , 1995, Protein science : a publication of the Protein Society.

[39]  Javier De Las Rivas,et al.  Protein–Protein Interactions Essentials: Key Concepts to Building and Analyzing Interactome Networks , 2010, PLoS Comput. Biol..

[40]  Joachim Selbig,et al.  Detection and characterization of 3D-signature phosphorylation site motifs and their contribution towards improved phosphorylation site prediction in proteins , 2009, BMC Bioinformatics.

[41]  Jorng-Tzong Horng,et al.  Incorporating structural characteristics for identification of protein methylation sites , 2009, J. Comput. Chem..

[42]  Keith Burridge,et al.  The 'invisible hand': regulation of RHO GTPases by RHOGDIs , 2011, Nature Reviews Molecular Cell Biology.

[43]  Patrick Aloy,et al.  Interrogating protein interaction networks through structural biology , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Pierrick Craveur,et al.  PTM-SD: a database of structurally resolved and annotated posttranslational modifications in proteins , 2014, Database J. Biol. Databases Curation.

[45]  S. Gygi,et al.  Regulation of 4E-BP1 phosphorylation: a novel two-step mechanism. , 1999, Genes & development.

[46]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[47]  Hsien-Da Huang,et al.  RegPhos 2.0: an updated resource to explore protein kinase–substrate phosphorylation networks in mammals , 2014, Database J. Biol. Databases Curation.

[48]  A. Vinayagam,et al.  A Directed Protein Interaction Network for Investigating Intracellular Signal Transduction , 2011, Science Signaling.

[49]  Hsien-Da Huang,et al.  dbPTM 3.0: an informative resource for investigating substrate site specificity and functional association of protein post-translational modifications , 2012, Nucleic Acids Res..

[50]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[51]  J. De las Rivas,et al.  Protein-protein interaction networks: unraveling the wiring of molecular machines within the cell. , 2012, Briefings in functional genomics.

[52]  P. Bork,et al.  Functional organization of the yeast proteome by systematic analysis of protein complexes , 2002, Nature.

[53]  Hsien-Da Huang,et al.  dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins , 2015, Nucleic Acids Res..

[54]  S. Grizot,et al.  Crystal structure of the Rac1-RhoGDI complex involved in nadph oxidase activation. , 2001, Biochemistry.

[55]  A. Gartel,et al.  The Role of the Cyclin-dependent Kinase Inhibitor p 21 in Apoptosis 1 , 2002 .

[56]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[57]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[58]  Anna Tramontano,et al.  Phospho3D 2.0: an enhanced database of three-dimensional structures of phosphorylation sites , 2010, Nucleic Acids Res..

[59]  Allegra Via,et al.  Phospho3D: a database of three-dimensional structures of protein phosphorylation sites , 2006, Nucleic Acids Res..

[60]  Irina Neganova,et al.  An Important Role for CDK2 in G1 to S Checkpoint Activation and DNA Damage Response in Human Embryonic Stem Cells , 2011, Stem cells.

[61]  Tien Dung Nguyen,et al.  LymPHOS 2.0: an update of a phosphosite database of primary human T cells , 2015, Database J. Biol. Databases Curation.

[62]  Jorng-Tzong Horng,et al.  KinasePhos: a web tool for identifying protein kinase-specific phosphorylation sites , 2005, Nucleic Acids Res..

[63]  Tzong-Yi Lee,et al.  Incorporating substrate sequence motifs and spatial amino acid composition to identify kinase-specific phosphorylation sites on protein three-dimensional structures , 2013, BMC Bioinformatics.