Molecular Similarity and Risk Assessment: Analog Selection and Property Estimation Using Graph Invariants

Abstract Four molecular similarity measures have been used to select the nearest neighbor of chemicals in two data sets of 139 hydrocarbons and 15 nitrosamines, respectively. The similarity methods are based on calculated graph invariants which include atom pairs, connectivity indices and information theoretic topological indices. The property of the selected nearest neighbor by each method was taken as the estimate of the property under investigation. The results show that for these data sets, all four methods give reasonable estimates of the properties studied.

[1]  Gerald J. Niemi,et al.  Predicting properties of molecules using graph invariants , 1991 .

[2]  Frank Harary,et al.  Graph Theory , 2016 .

[3]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[4]  H S Rosenkranz,et al.  Structural basis of the genotoxicity of nitrosatable phenols and derivatives present in smoked food products. , 1990, Mutation research.

[5]  S C Basak,et al.  Comparative study of lipophilicity versus topological molecular descriptors in biological correlations. , 1984, Journal of pharmaceutical sciences.

[6]  Gerald J. Niemi,et al.  Prediction of octanol/water partition coefficient ( K OW ) with algorithmically derived variables , 1992 .

[7]  C. Hansch,et al.  On the structure of medicinal chemistry. , 1976, Journal of medicinal chemistry.

[8]  Subhash C. Basak,et al.  NEIGHBORHOOD COMPLEXITIES AND SYMMETRY OF CHEMICAL GRAPHS AND THEIR BIOLOGICAL APPLICATIONS , 1984 .

[9]  J C Arcos ES Views: Structure-activity relationships. , 1987, Environmental science & technology.

[10]  B. Ames,et al.  Detection of carcinogens as mutagens in the Salmonella/microsome test: assay of 300 chemicals. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Subhash C. Basak Binding of barbiturates to cytochrome P450: A QSAR study using log P and topological indices , 1988 .

[12]  Subhash C. Basak,et al.  A characterization of molecular similarity methods for property prediction , 1988 .

[13]  E. C. Pielou The Interpretation of Ecological Data: A Primer on Classification and Ordination , 1984 .

[14]  N. Rashevsky Life, information theory, and topology , 1955 .

[15]  Vijay K. Gombar,et al.  Quantitative Structure‐Activity Relationship (QSAR) Studies Using Electronic Descriptors Calculated from Topological and Molecular Orbital (MO) Methods , 1990 .

[16]  Gerald J. Niemi,et al.  Optimal characterization of structure for prediction of properties , 1990 .

[17]  P. Seybold,et al.  Molecular modeling of the physical properties of the alkanes , 1988 .

[18]  N. Trinajstic,et al.  Information theory, distance matrix, and molecular branching , 1977 .

[19]  R. Venkataraghavan,et al.  Atom pairs as molecular features in structure-activity studies: definition and applications , 1985, J. Chem. Inf. Comput. Sci..

[20]  P. Willett,et al.  A Comparison of Some Measures for the Determination of Inter‐Molecular Structural Similarity Measures of Inter‐Molecular Structural Similarity , 1986 .

[21]  Nenad Trinajstić,et al.  Chemical graph theory: Modeling the thermodynamic properties of molecules , 1980 .

[22]  C. Raychaudhury,et al.  Discrimination of isomeric structures using information theoretic topological indices , 1984 .

[23]  J V Nabholz,et al.  Mode of action and the assessment of chemical hazards in the presence of limited data: use of structure-activity relationships (SAR) under TSCA, Section 5. , 1990, Environmental health perspectives.

[24]  R. Taft,et al.  Linear solvation energy relationships: 36. Molecular properties governing solubilities of organic nonelectrolytes in water. , 1986, Journal of pharmaceutical sciences.

[25]  M. Randic Characterization of molecular branching , 1975 .

[26]  S. Unger Molecular Connectivity in Structure–activity Analysis , 1987 .

[27]  Milan Randić,et al.  A graph theoretical approach to structure-property and structure-activity correlations , 1980 .

[28]  N. Trinajstic Chemical Graph Theory , 1992 .

[29]  H. Wiener Structural determination of paraffin boiling points. , 1947, Journal of the American Chemical Society.

[30]  M. S. Lajiness,et al.  Molecular similarity-based methods for selecting compounds for screening , 1990 .

[31]  Subhash C. Basak,et al.  Determining structural similarity of chemicals using graph-theoretic indices , 1988, Discret. Appl. Math..

[32]  Subhash C. Basak,et al.  A quantitative correlation of the LC50 values of esters in pimephales promelas using physicochemical and topological parameters , 1984 .

[33]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[34]  A graph-theoretic approach to predicting molecular properties , 1990 .

[35]  William Fisanick,et al.  Similarity searching on CAS Registry substances. 1. Global molecular property and generic atom triangle geometric searching , 1992, J. Chem. Inf. Comput. Sci..

[36]  A. B. Roy,et al.  Topological information content of genetic molecules—I. , 1978 .