3D proteochemometrics: using three-dimensional information of proteins and ligands to address aspects of the selectivity of serine proteases.

The high similarity between certain sub-pockets of serine proteases may lead to low selectivity of protease inhibitors. Therefore the application of proteochemometrics (PCM), which quantifies the relationship between protein/ligand descriptors and affinity for multiple ligands and targets simultaneously, is useful to understand and improve the selectivity profiles of potential inhibitors. In this study, protein field-based PCM that uses knowledge-based and WaterMap derived fields to describe proteins in combination with 2D (RDKit and MOE fingerprints) and 3D (4 point pharmacophoric fingerprints and GRIND) ligand descriptors was used to model the bioactivities of 24 homologous serine proteases and 5863 inhibitors in an integrated fashion. Of the multiple field-based PCM models generated based on different ligand descriptors, RDKit fingerprints showed the best performance in terms of external prediction with Rtest2 of 0.72 and RMSEP of 0.81. Further, visual interpretation of the models highlights sub-pocket specific regions that influence affinity and selectivity of serine protease inhibitors.

[1]  S. Pickett,et al.  GRid-INdependent descriptors (GRIND): a novel class of alignment-independent three-dimensional molecular descriptors. , 2000, Journal of medicinal chemistry.

[2]  Paul Geladi,et al.  Principal Component Analysis , 1987, Comprehensive Chemometrics.

[3]  Andreas Bender,et al.  Modelling ligand selectivity of serine proteases using integrative proteochemometric approaches improves model performance and allows the multi-target dependent interpretation of features. , 2014, Integrative biology : quantitative biosciences from nano to macro.

[4]  O. Spjuth,et al.  Origin of aromatase inhibitory activity via proteochemometric modeling , 2016, PeerJ.

[5]  Sung-Hou Kim,et al.  Preparation, characterization, and the crystal structure of the inhibitor ZK-807834 (CI-1031) complexed with factor Xa. , 2000, Biochemistry.

[6]  G. Caron,et al.  Influence of conformation on GRIND-based three-dimensional quantitative structure-activity relationship (3D-QSAR). , 2007, Journal of medicinal chemistry.

[7]  A. Bender,et al.  Prediction of PARP Inhibition with Proteochemometric Modelling and Conformal Prediction , 2015, Molecular informatics.

[8]  J. Janc,et al.  Engineering inhibitors highly selective for the S1 sites of Ser190 trypsin-like serine protease drug targets. , 2002, Chemistry & biology.

[9]  Russell V. Lenth Design and Analysis of Experiments: Classical and Regression Approaches with SAS , 2009 .

[10]  Nina Nikolova-Jeliazkova,et al.  QSAR Applicability Domain Estimation by Projection of the Training Set in Descriptor Space: A Review , 2005, Alternatives to laboratory animals : ATLA.

[11]  T. Lundstedt,et al.  PLS modeling of chimeric MS04/MSH-peptide and MC1/MC3-receptor interactions reveals a novel method for the analysis of ligand-receptor interactions. , 2001, Biochimica et biophysica acta.

[12]  Jahan B. Ghasemi,et al.  Quantitative Characterization of the Interaction Space of the Mammalian Carbonic Anhydrase Isoforms I, II, VII, IX, XII, and XIV and their Inhibitors, Using the Proteochemometric Approach , 2016, Chemical biology & drug design.

[13]  X. Puente,et al.  Human and mouse proteases: a comparative genomic approach , 2003, Nature Reviews Genetics.

[14]  P. Prusis,et al.  Polypharmacology modelling using proteochemometrics (PCM): recent methodological developments, applications to target families, and future prospects , 2015 .

[15]  P. Prusis,et al.  Predictive proteochemometric models for kinases derived from 3D protein field-based descriptors , 2016 .

[16]  Aaron Shipway,et al.  Active site conformational changes of prostasin provide a new mechanism of protease regulation by divalent cations , 2009, Protein science : a publication of the Protein Society.

[17]  Julian E. Fuchs,et al.  Quantitative Correlation of Conformational Binding Enthalpy with Substrate Specificity of Serine Proteases , 2015, The journal of physical chemistry. B.

[18]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[19]  Gerard J. P. van Westen,et al.  Proteochemometric modeling as a tool to design selective compounds and for extrapolating to novel targets , 2011 .

[20]  P. Achermann,et al.  Impaired Postural Control in Healthy Men at Moderate Altitude (1630 M and 2590 M): Data from a Randomized Trial , 2015, PloS one.

[21]  Christopher M. Overall,et al.  Factor Xa subsite mapping by proteome-derived peptide libraries improved using WebPICS, a resource for proteomic identification of cleavage sites , 2011, Biological chemistry.

[22]  P. Prusis,et al.  Visually Interpretable Models of Kinase Selectivity Related Features Derived from Field-Based Proteochemometrics , 2013, J. Chem. Inf. Model..

[23]  Matthias Bethge,et al.  Beyond GLMs: A Generative Mixture Modeling Approach to Neural System Identification , 2012, PLoS Comput. Biol..

[24]  Hans Matter,et al.  Probing the subpockets of factor Xa reveals two binding modes for inhibitors based on a 2-carboxyindole scaffold: a study combining structure-activity relationship and X-ray crystallography. , 2005, Journal of medicinal chemistry.

[25]  Christian Kramer,et al.  Substrate-Driven Mapping of the Degradome by Comparison of Sequence Logos , 2013, PLoS Comput. Biol..

[26]  H. Nar,et al.  Structural basis for inhibition promiscuity of dual specific thrombin and factor Xa blood coagulation inhibitors. , 2001, Structure.