Chiral Cliffs: Investigating the Influence of Chirality on Binding Affinity

Chirality is understood by many as a binary concept: a molecule is either chiral or it is not. In terms of the action of a structure on polarized light, this is indeed true. When examined through the prism of molecular recognition, the answer becomes more nuanced. In this work, we investigated chiral behavior on protein–ligand binding: when does chirality make a difference in binding activity? Chirality is a property of the 3D structure, so recognition also requires an appreciation of the conformation. In many situations, the bioactive conformation is undefined. We set out to address this by defining and using several novel 2D descriptors to capture general characteristic features of the chiral center. Using machine‐learning methods, we built different predictive models to estimate if a chiral pair (a set of two enantiomers) might exhibit a chiral cliff in a binding assay. A set of about 3800 chiral pairs extracted from the ChEMBL23 database was used to train and test our models. By achieving an accuracy of up to 75 %, our models provide good performance in discriminating chiral cliffs from non‐cliffs. More importantly, we were able to derive some simple guidelines for when one can reasonably use a racemate and when an enantiopure compound is needed in an assay. We critically discuss our results and show detailed examples of using our guidelines. Along with this publication we provide our dataset, our novel descriptors, and the Python code to rebuild the predictive models.

[1]  Alexander Golbraikh,et al.  Novel Chirality Descriptors Derived from Molecular Topology , 2001, J. Chem. Inf. Comput. Sci..

[2]  J. Ellingboe,et al.  Discovery of 5-arylsulfonamido-3-(pyrrolidin-2-ylmethyl)-1H-indole derivatives as potent, selective 5-HT6 receptor agonists and antagonists. , 2005, Journal of medicinal chemistry.

[3]  Chem. , 2020, Catalysis from A to Z.

[4]  Gordon M. Crippen,et al.  Chirality Descriptors in QSAR , 2008 .

[5]  J. Bajorath,et al.  Chemical Substitutions That Introduce Activity Cliffs Across Different Compound Classes and Biological Targets , 2010, J. Chem. Inf. Model..

[6]  Gilles Moreau,et al.  Atomic Chirality, a Quantitative Measure of the Chirality of the Environment of an Atom , 1997, J. Chem. Inf. Comput. Sci..

[7]  Alberto Del Rio,et al.  Exploring enantioselective molecular recognition mechanisms with chemoinformatic techniques. , 2009 .

[8]  J. A. Grant,et al.  A shape-based 3-D scaffold hopping method and its application to a bacterial protein-protein interaction. , 2005, Journal of medicinal chemistry.

[9]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[10]  George Papadatos,et al.  The ChEMBL database in 2017 , 2016, Nucleic Acids Res..

[11]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  Jürgen Bajorath,et al.  Recent progress in understanding activity cliffs and their utility in medicinal chemistry. , 2014, Journal of medicinal chemistry.

[14]  S. Basak,et al.  Numerical descriptors for the characterization of chiral compounds and their applications in modeling biological and toxicological activities. , 2011, Current topics in medicinal chemistry.

[15]  E. Martin,et al.  Effect of Chirality on Common in Vitro Experiments: An Enantiomeric Pair Analysis. , 2015, Journal of medicinal chemistry.

[16]  Jürgen Bajorath,et al.  Exploring activity cliffs in medicinal chemistry. , 2012, Journal of medicinal chemistry.

[17]  R. Venkataraghavan,et al.  Atom pairs as molecular features in structure-activity studies: definition and applications , 1985, J. Chem. Inf. Comput. Sci..

[18]  Jens Sadowski,et al.  Structure Modification in Chemical Databases , 2005 .

[19]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[20]  Jürgen Bajorath,et al.  Extending the Activity Cliff Concept: Structural Categorization of Activity Cliffs and Systematic Identification of Different Types of Cliffs in the ChEMBL Database , 2012, J. Chem. Inf. Model..

[21]  A. Vulpetti,et al.  The experimental uncertainty of heterogeneous public K(i) data. , 2012, Journal of medicinal chemistry.

[22]  Andy Liaw,et al.  Extreme Gradient Boosting as a Method for Quantitative Structure-Activity Relationships , 2016, J. Chem. Inf. Model..

[23]  Jürgen Bajorath,et al.  MMP-Cliffs: Systematic Identification of Activity Cliffs on the Basis of Matched Molecular Pairs , 2012, J. Chem. Inf. Model..

[24]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[25]  C. Wermuth,et al.  Glossary of terms used in medicinal chemistry (IUPAC Recommendations 1998) , 1998 .

[26]  Jameed Hussain,et al.  Computationally Efficient Algorithm to Identify Matched Molecular Pairs (MMPs) in Large Data Sets , 2010, J. Chem. Inf. Model..

[27]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[28]  Subhash C. Basak,et al.  Novel Approach for the Numerical Characterization of Molecular Chirality , 2007, J. Chem. Inf. Model..

[29]  Jürgen Bajorath,et al.  Systematic Identification and Classification of Three-Dimensional Activity Cliffs , 2012, J. Chem. Inf. Model..

[30]  H. J. Mclaughlin,et al.  Learn , 2002 .

[31]  Alexander Golbraikh,et al.  QSAR Modeling Using Chirality Descriptors Derived from Molecular Topology , 2003, J. Chem. Inf. Comput. Sci..

[32]  Gerald M. Maggiora,et al.  On Outliers and Activity Cliffs-Why QSAR Often Disappoints , 2006, J. Chem. Inf. Model..