Non‐linear mapping for structure‐activity and structure‐property modelling

From a review of the theoretical aspects of non‐linear mapping and the different algorithms available in the literature, the methodological and practical problems linked to the use of this multivariate method in structure‐activity and structure‐property relationship studies are underlined. Useful tools for the graphical display of the outputs and the interpretation of the obtained clusters are presented. Statistical parameters estimating the quality of the graphical representation of each individual are also introduced. An example of application on a data matrix of 37 acaricides described by four physicochemical descriptors (π, F, R, MR) is presented.

[1]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[2]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[3]  J. Kruskal Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .

[4]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[5]  Keinosuke Fukunaga,et al.  An Algorithm for Finding Intrinsic Dimensionality of Data , 1971, IEEE Transactions on Computers.

[6]  Keinosuke Fukunaga,et al.  A Two-Dimensional Display for the Classification of Multivariate Data , 1971, IEEE Transactions on Computers.

[7]  B. Kowalski,et al.  Pattern recognition. Powerful approach to interpreting chemical data , 1972 .

[8]  I. White Comment on "A Nonlinear Mapping for Data Structure Analysis" , 1972, IEEE Trans. Computers.

[9]  Keinosuke Fukunaga,et al.  Introduction to Statistical Pattern Recognition , 1972 .

[10]  Richard J. Howarth,et al.  Preliminary assessment of a nonlinear mapping algorithm in a geological context , 1973 .

[11]  B. Kowalski,et al.  Pattern recognition. II. Linear and nonlinear methods for displaying chemical data , 1973 .

[12]  Harry C. Andrews,et al.  Nonlinear Intrinsic Dimensionality Computations , 1974, IEEE Transactions on Computers.

[13]  H Abe,et al.  Applications of computerized pattern recognition: a survey of correlations between pharmacological activities and mass spectra. , 1976, Biomedical mass spectrometry.

[14]  Richard C. T. Lee,et al.  A Triangulation Method for the Sequential Mapping of Points from N-Space to Two-Space , 1977, IEEE Transactions on Computers.

[15]  Ramanathan Gnanadesikan,et al.  Methods for statistical data analysis of multivariate observations , 1977, A Wiley publication in applied statistics.

[16]  Chi-Hsiung. Lin,et al.  Representation-space transformation for the display of multivariate chemical information , 1977 .

[17]  Peter C. Wang,et al.  Graphical Representation of Multivariate Data , 1978 .

[18]  Brian Everitt,et al.  Graphical Techniques for Multivariate Data. , 1978 .

[19]  Ability of the representation space transformation to preserve data structure. Comment , 1978 .

[20]  S. Perone,et al.  Computerized Pattern Recognition Applied to Gas Chromatography/Mass Spectrometry Identification of Pentafluoropropionyl Dipeptide Methyl Esters , 1979 .

[21]  F. W. Pijpers,et al.  Qualitative classification of dithiocarbamate compounds from 13c-n.m.r. and i.r. spectroscopic data by pattern recognition techniques , 1979 .

[22]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[23]  Yoshimasa Takahashi,et al.  A structure-biological activity study based on cluster analysis and the nonlinear mapping method of pattern recognition , 1980 .

[24]  K. Varmuza Pattern recognition in analytical chemistry , 1980 .

[25]  B. Kowalski,et al.  A combined linear and nonlinear factor analysis program package for chemical data evaluation , 1981 .

[26]  H. Meuzelaar,et al.  Pyrolysis mass spectrometry: a new method to differentiate between the mycobacteria of the 'tuberculosis complex' and other mycobacteria. , 1981, Journal of general microbiology.

[27]  A. D. Gordon,et al.  Interpreting multivariate data , 1982 .

[28]  Desire L. Massart,et al.  The Interpretation of Analytical Chemical Data by the Use of Cluster Analysis , 1983 .

[29]  Jildau Bouwman,et al.  Evaluation of field-desorption and fast atom-bombardment mass spectrometric profiles by pattern recognition techniques , 1983 .

[30]  Effect of layer and eluent characteristics on the reversed-phase thin-layer chromatographic behaviour of some barbituric acid derivatives , 1984 .

[31]  D. Bawden Disclose: an integrated set of multivariate display procedures for chemical and pharmaceutical data , 1984 .

[32]  T. Cserháti,et al.  Phospholipid - Amino Acid Interactions , 1987 .

[33]  David J. Livingstone,et al.  Perspectives in QSAR: Computer chemistry and pattern recognition , 1988, J. Comput. Aided Mol. Des..

[34]  E. Rahr,et al.  Physicochemical characteristics of non-electrolytes and their uptake by Brugia pahangi and Dipetalonema viteae. , 1988, Molecular and biochemical parasitology.

[35]  Jack Sklansky,et al.  An overview of mapping techniques for exploratory pattern analysis , 1988, Pattern Recognit..

[36]  Roser Rubio,et al.  Cluster analysis as a tool in the study of groundwater quality , 1988 .

[37]  Brian D. Hudson,et al.  Pattern recognition display methods for the analysis of computed molecular properties , 1989, J. Comput. Aided Mol. Des..

[38]  D. J. Livingstone,et al.  Multivariate quantitative structure‐activity relationship (QSAR) methods which may be applied to pesticide research , 1989 .

[39]  Ian D. Wilson,et al.  HIGH RESOLUTION PROTON MAGNETIC RESONANCE SPECTROSCOPY OF BIOLOGICAL FLUIDS , 1989 .

[40]  Richard C. Dubes,et al.  Experiments in projection and clustering by simulated annealing , 1989, Pattern Recognit..

[41]  René Henrion,et al.  Three-way principal components analysis for multivariate evaluation of round robin tests , 1990 .

[42]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[43]  K. Valko,et al.  Application of chromatographic retention data in an investigation of a quantitative structure-nucleotide incorporation rate relationship , 1990 .

[44]  R. M. Hyde,et al.  U. K. usage of chemometrics and artificial intelligence in QSAR analysis , 1990 .

[45]  Martyn G. Ford,et al.  Multivariate Techniques for Parameter Selection and Data Analysis Exemplified by a Study of Pyrethroid Neurotoxicity , 1990 .

[46]  B R Xiang,et al.  Application of pyrolysis-high-resolution gas chromatography-pattern recognition to the identification of the Chinese traditional medicine mai dong. , 1990, Journal of chromatography.

[47]  M. Chastrette,et al.  Analysis of a system of description of odors by means of four different multivariate statistical methods , 1991 .

[48]  J. Devillers,et al.  Graphical display of the fugacity model level I , 1991 .

[49]  T. Cserháti,et al.  Differential scanning calorimetry to study the possible ternary complex formation between chlorhexidine, phosphatidylcholine and some nonionic tenzides. , 1991, Journal of biochemical and biophysical methods.

[50]  Richard C. Moore,et al.  Quantitative structure-activity relationships in acaricidal 4H-1,3,4-oxadiazin-5(6H)-ones , 1991 .

[51]  Jean Thioulouse,et al.  Graphical Techniques for Multidimensional Data Analysis , 1991 .

[52]  J. Devillers,et al.  Multivariate Analysis of the Input and Output Data in the Fugacity Model Level I , 1991 .

[53]  M. Chastrette,et al.  Multilayer Neural Networks Applied to Structure-Activity Relationships , 1991 .

[54]  D J Livingstone,et al.  Novel method for the display of multivariate data using neural networks. , 1991, Journal of molecular graphics.

[55]  H. Macfie,et al.  An application of unsupervised neural network methodology Kohonen topology-Preserving mapping) to QSAR analysis , 1991 .

[56]  J C Lindon,et al.  Application of pattern recognition methods to the analysis and classification of toxicological data derived from proton nuclear magnetic resonance spectroscopy of urine. , 1991, Molecular pharmacology.

[57]  T. Cserháti,et al.  Comparison of two principal component analysis methods to evaluate reversed-phase retention data. , 1991, Journal of pharmaceutical and biomedical analysis.

[58]  Mayer Aladjem Parametric and nonparametric linear mappings of multidimensional data , 1991, Pattern Recognit..

[59]  J. Devillers,et al.  Applied multivariate analysis in SAR and environmental studies , 1991 .

[60]  David J. Livingstone,et al.  Investigation of a charge-transfer substituent constant using computational chemistry and pattern recognition techniques , 1992 .

[61]  J. Devillers,et al.  Multivariate structure‐property relationships (MSPR) of pesticides , 1992 .

[62]  J. Devillers,et al.  Multivariate structure-environmental fate relationships for chlorinated chemicals , 1992 .