QSAR Modeling of Datasets with Enantioselective Compounds using Chirality Sensitive Molecular Descriptors

Shape descriptors used in 3D QSAR studies naturally take into account chirality; however, for flexible and structurally diverse molecules such studies require extensive conformational searching and alignment. QSAR modeling studies of two datasets of fragrance compounds with complex stereochemistry using simple alignment-free chirality sensitive descriptors developed in our laboratories are presented. In the first investigation, 44 α-campholenic derivatives with sandalwood odor were represented as derivatives of several common structural templates  with substituents numbered according to their relative spatial positions in the molecules. Both molecular and substituent descriptors were used as independent variables in MLR calculations, and the best model was characterized by the training set q 2 of 0.79 and external test set r 2 of 0.95. In the second study, several types of chirality descriptors were employed in combinatorial QSAR modeling of 98 ambergris fragrance compounds. Among 28 possible combinations of seven types of descriptors and four statistical modeling techniques, k nearest neighbor classification with CoMFA descriptors was initially found to generate the best models with the internal and external accuracies of 76 and 89%, respectively. The same dataset was then studied using novel atom pair chirality descriptors (cAP). The cAP are based on a modified definition of the atomic chirality, in which the seniority of the substituents is defined by their relative partial charge values: higher values correspond to higher seniorities. The resulting models were found to have higher predictive power than those developed with CoMFA descriptors; the best model was characterized by the internal and external accuracies of 82 and 94%, respectively. The success of modeling studies using simple alignment free chirality descriptors discussed in this paper suggests that they should be applied broadly to QSAR studies of many datasets when compound stereochemistry plays an important role in defining their activity.

[1]  M Chastrette,et al.  Trends in structure-odor relationships. , 1997, SAR and QSAR in environmental research.

[2]  Peter Wolschann,et al.  Conformational Parameters of the Sandalwood‐Odor Activity: Conformational calculations on sandalwood odor , 1994 .

[3]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4]  Karen J. Rossiter,et al.  Structure−Odor Relationships , 1996 .

[5]  R. Cramer,et al.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. , 1988, Journal of the American Chemical Society.

[6]  Alexander Golbraikh,et al.  Quantitative Structure−Activity Relationship Analysis of Functionalized Amino Acid Anticonvulsant Agents Using k Nearest Neighbor and Simulated Annealing PLS Methods , 2002 .

[7]  Alexander Golbraikh,et al.  QSAR Modeling of alpha-Campholenic Derivatives with Sandalwood Odor , 2003, J. Chem. Inf. Comput. Sci..

[8]  Paola Gramatica,et al.  The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models , 2003 .

[9]  Peter Wolschann,et al.  Synthesis and olfactoric activity of keto-β-santalol and methoxy-β-santalol , 1998 .

[10]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[11]  Alexander Golbraikh,et al.  Novel ZE-Isomerism Descriptors Derived from Molecular Topology and Their Application to QSAR Analysis , 2002, J. Chem. Inf. Comput. Sci..

[12]  H. Kubinyi Comparative Molecular Field Analysis (CoMFA) , 2002 .

[13]  Alexander Golbraikh,et al.  Combinatorial QSAR of Ambergris Fragrance Compounds , 2004, J. Chem. Inf. Model..

[14]  Alexander Golbraikh,et al.  Predictive QSAR modeling based on diversity sampling of experimental datasets for the training and test set selection , 2004, Molecular Diversity.

[15]  Alexander Golbraikh,et al.  QSAR Modeling of α-Campholenic Derivatives with Sandalwood Odor. , 2003 .

[16]  P. Carrupt,et al.  Molecular fields in quantitative structure–permeation relationships: the VolSurf approach , 2000 .

[17]  R. Venkataraghavan,et al.  Atom pairs as molecular features in structure-activity studies: definition and applications , 1985, J. Chem. Inf. Comput. Sci..

[18]  B D Silverman,et al.  Comparative molecular moment analysis (CoMMA): 3D-QSAR without molecular superposition. , 1996, Journal of medicinal chemistry.

[19]  Alexander Golbraikh,et al.  QSAR Modeling Using Chirality Descriptors Derived from Molecular Topology , 2003, J. Chem. Inf. Comput. Sci..

[20]  Alexander Golbraikh,et al.  Novel Chirality Descriptors Derived from Molecular Topology. , 2001 .

[21]  G. Ohloff,et al.  Structure-Odor Relationships , 1994 .