Chemical pattern recognition and multivariate analysis for QSAR studies

Abstract Chemical pattern recognition (CPR) and quantitative structure-activity relationships (QSAR) studies based on multivariate analysis and chemometric techniques are reviewed. In particular, applications of the SIMCA classification method to structure-taste problems are discussed. Cluster significance analysis (CSA) is compared with modelling powers for feature selection of asymmetric data sets. A concentric hypersphere model is used to predict candidate new sweeteners. Partial least squares (PLS) modelling methods are employed to antiarrhythmic data of phenylpyridines and fungicidal and herbicidal data of thiocarbamates, respectively. The CoMFA approach to 3-dimensional QSAR using PLS modelling is described as well. In practice, QSAR is an important branch of chemometrics and enhances rational drug design and new agent development. The chemometric techniques described in the article not only work well for QSAR but also are very helpful for solving the problems related to analytical characteristics-chemical structure relationships.

[1]  D. E. Patterson,et al.  Crossvalidation, Bootstrapping, and Partial Least Squares Compared with Multiple Regression in Conventional QSAR Studies , 1988 .

[2]  B. Skagerberg,et al.  PLS response surface optimization: The CARSO procedure , 1989 .

[3]  C. Hansch Quantitative approach to biochemical structure-activity relationships , 1969 .

[4]  Donald R. Scott,et al.  Preprocessing, variable selection, and classification rules in the application of SIMCA pattern recognition to mass-spectral data , 1989 .

[5]  William J. Dunn,et al.  Quantitative structure—activity relationships (QSAR) , 1989 .

[6]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[7]  D J Gans,et al.  On the significance of clusters in the graphical display of structure-activity data. , 1986, Journal of medicinal chemistry.

[8]  R. Cramer,et al.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. , 1988, Journal of the American Chemical Society.

[9]  Louis Hodes,et al.  Selection of molecular fragment features for structure-activity studies in antitumor screening , 1981, J. Chem. Inf. Comput. Sci..

[10]  S. Wold,et al.  Application of simca multivariate data analysis to the classification of gas chromatographic profiles of human brain tissues , 1981 .

[11]  André Michel,et al.  A proposal for the molecular basis of μ and δ opiate receptor differentiation based on modeling of two types of cyclic enkephalins and a narcotic alkaloid , 1991, J. Comput. Aided Mol. Des..

[12]  S. Wold,et al.  Nonlinear PLS modeling , 1989 .

[13]  Matthew Clark,et al.  Comparative molecular field analysis (CoMFA). 2. Toward its use with 3D-structural databases , 1990 .

[14]  S. Wold,et al.  Use of chemometrics in environmental toxicology and structure-activity relationships , 1986 .

[15]  S. Sasaki,et al.  Structure-taste correlation of L-aspartyl dipeptides using SIMCA method. , 1986, Journal of medicinal chemistry.

[16]  Kimito Funatsu,et al.  Computer-assisted structure/taste studies on sulfamates by pattern recognition methods , 1986 .

[17]  R D Cramer,et al.  Three-dimensional structure-activity relationships. , 1988, Trends in pharmacological sciences.

[18]  L. Hodes,et al.  A statistical-heuristic methods for automated selection of drugs for screening. , 1977, Journal of medicinal chemistry.

[19]  Shin-ichi Sasaki,et al.  Quantitative structure—activity relationship study of antiarrhythmic phenylpyridines using multivariate partial least squares modelling , 1992 .

[20]  C. Hansch,et al.  p-σ-π Analysis. A Method for the Correlation of Biological Activity and Chemical Structure , 1964 .

[21]  James W. McFarland,et al.  Cluster Significance Analysis: A New Qsar Tool for Asymmetric Data Sets , 1990 .

[22]  D J Gans,et al.  Cluster significance analysis contrasted with three other quantitative structure-activity relationship methods. , 1987, Journal of medicinal chemistry.

[23]  S Wold,et al.  A structure-carcinogenicity study of 4-nitroquinoline 1-oxides using the SIMCA method of pattern recognition. , 1978, Journal of medicinal chemistry.