Predictive cartography of metal binders using generative topographic mapping

Generative topographic mapping (GTM) approach is used to visualize the chemical space of organic molecules (L) with respect to binding a wide range of 41 different metal cations (M) and also to build predictive models for stability constants (logK) of 1:1 (M:L) complexes using “density maps,” “activity landscapes,” and “selectivity landscapes” techniques. A two-dimensional map describing the entire set of 2962 metal binders reveals the selectivity and promiscuity zones with respect to individual metals or groups of metals with similar chemical properties (lanthanides, transition metals, etc). The GTM-based global (for entire set) and local (for selected subsets) models demonstrate a good predictive performance in the cross-validation procedure. It is also shown that the data likelihood could be used as a definition of the applicability domain of GTM-based models. Thus, the GTM approach represents an efficient tool for the predictive cartography of metal binders, which can both visualize their chemical space and predict the affinity profile of metals for new ligands.

[1]  Igor I. Baskin,et al.  Continuous indicator fields: a novel universal type of molecular fields , 2015, Journal of Computer-Aided Molecular Design.

[2]  Alexandre Varnek,et al.  Substructural fragments: an universal language to encode reactions, molecular and supramolecular structures , 2005, J. Comput. Aided Mol. Des..

[3]  A. Varnek,et al.  QSPR ensemble modelling of alkaline-earth metal complexation , 2013, Journal of Inclusion Phenomena and Macrocyclic Chemistry.

[4]  Alexandre Varnek,et al.  New Approach for Accurate QSPR Modeling of Metal Complexation: Application to Stability Constants of Complexes of Lanthanide Ions Ln 3+ , Ag + , Zn 2+ , Cd 2+ and Hg 2+ with Organic Ligands in Water , 2012 .

[5]  Dragos Horvath,et al.  Chemical Data Visualization and Analysis with Incremental Generative Topographic Mapping: Big Data Challenge , 2015, J. Chem. Inf. Model..

[6]  Igor I Baskin,et al.  Chemoinformatics as a Theoretical Chemistry Discipline , 2011, Molecular informatics.

[7]  Christopher M. Bishop,et al.  Developments of the generative topographic mapping , 1998, Neurocomputing.

[8]  Igor V. Tetko,et al.  Benchmarking of Linear and Nonlinear Approaches for Quantitative Structure-Property Relationship Studies of Metal Complexation with Ionophores , 2006, J. Chem. Inf. Model..

[9]  Igor I. Baskin,et al.  Fragment Descriptors in Virtual Screening , 2013 .

[10]  J. Dearden,et al.  QSAR modeling: where have you been? Where are you going to? , 2014, Journal of medicinal chemistry.

[11]  José L. Medina-Franco,et al.  Visualization of Molecular Fingerprints , 2011, J. Chem. Inf. Model..

[12]  Gilles Marcou,et al.  An Evolutionary Optimizer of libsvm Models , 2014 .

[13]  Corey Oses,et al.  Materials Cartography: Representing and Mining Material Space Using Structural and Electronic Fingerprints , 2014, 1412.4096.

[14]  D. Fourches,et al.  Successful “In Silico” Design of New Efficient Uranyl Binders , 2007 .

[15]  I. Pletnev,et al.  Classification of metal ions according to their complexing properties: a data-driven approach , 2002 .

[16]  Igor I. Baskin,et al.  Chapter 1:Fragment Descriptors in SAR/QSAR/QSPR Studies, Molecular Similarity Analysis and in Virtual Screening , 2008 .

[17]  Héléna A. Gaspar,et al.  Generative Topographic Mapping (GTM): Universal Tool for Data Visualization, Structure‐Activity Modeling and Dataset Comparison , 2012, Molecular informatics.

[18]  Alexandre Varnek,et al.  Modeling of Ion Complexation and Extraction Using Substructural Molecular Fragments , 2000, J. Chem. Inf. Comput. Sci..

[19]  O. E. Polansky,et al.  Introduction to Similarity Searching in Chemistry , 2004 .

[20]  A. Varnek,et al.  Stability constants of complexes of Zn2+, Cd2+, and Hg2+ with organic ligands: QSPR consensus modeling and design of new metal binders , 2012, Journal of Inclusion Phenomena and Macrocyclic Chemistry.

[21]  Tudor I. Oprea,et al.  Chemography: the Art of Navigating in Chemical Space , 2000 .

[22]  Alban Arrault,et al.  Generative Topographic Mapping-Based Classification Models and Their Applicability Domain: Application to the Biopharmaceutics Drug Disposition Classification System (BDDCS) , 2013, J. Chem. Inf. Model..

[23]  I. Tetko,et al.  ISIDA - Platform for Virtual Screening Based on Fragment and Pharmacophoric Descriptors , 2008 .

[24]  Alexandre Varnek,et al.  QSPR Modeling of the AmIII/EuIII Separation Factor: How Far Can we Predict ? , 2007 .

[25]  Alexandre Varnek,et al.  QSPR ensemble modelling of the 1:1 and 1:2 complexation of Co2+, Ni2+, and Cu2+ with organic ligands: relationships between stability constants , 2014, Journal of Computer-Aided Molecular Design.

[26]  A. Varnek,et al.  Computer-aided design of new metal binders , 2008 .

[27]  Igor I. Baskin,et al.  Machine Learning Methods for Property Prediction in Chemoinformatics: Quo Vadis? , 2012, J. Chem. Inf. Model..

[28]  A. Varnek,et al.  Quantitative Structure—Property Relationships in Solvent Extraction and Complexation of Metals , 2010 .

[29]  Hugo Kubinyi,et al.  Chemogenomics in Drug Discovery: A Medicinal Chemistry Perspective , 2004 .

[30]  Igor I. Baskin,et al.  Stargate GTM: Bridging Descriptor and Activity Spaces , 2015, J. Chem. Inf. Model..

[31]  A. Varnek,et al.  Complexation of Mn2+, Fe2+, Y3+, La3+, Pb2+, and UO22+ with Organic Ligands: QSPR Ensemble Modeling of Stability Constants , 2012 .

[32]  Dragos Horvath,et al.  Generative Topographic Mapping Approach to Chemical Space Analysis , 2017 .

[33]  Héléna A. Gaspar,et al.  GTM‐Based QSAR Models and Their Applicability Domains , 2015, Molecular informatics.

[34]  Alexandre Varnek,et al.  Assessment of the Macrocyclic Effect for the Complexation of Crown-Ethers with Alkali Cations Using the Substructural Molecular Fragments Method. , 2002 .

[35]  H Kubinyi,et al.  Chemogenomics in drug discovery. , 2006, Ernst Schering Research Foundation workshop.

[36]  Satoru Kawai,et al.  An Algorithm for Drawing General Undirected Graphs , 1989, Inf. Process. Lett..

[37]  Dragos Horvath,et al.  Mappability of drug-like space: towards a polypharmacologically competent map of drug-relevant compounds , 2015, Journal of Computer-Aided Molecular Design.

[38]  Christopher M. Bishop,et al.  GTM: The Generative Topographic Mapping , 1998, Neural Computation.

[39]  Igor I Baskin,et al.  The One‐Class Classification Approach to Data Description and to Models Applicability Domain , 2010, Molecular informatics.

[40]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[41]  Ian T. Nabney,et al.  Data Visualization during the Early Stages of Drug Discovery , 2006, J. Chem. Inf. Model..