Implementation and use of an atom-mapping procedure for similarity searching in databases of 3-D chemical structures

Abstract This paper discusses a procedure for calculating the degree of similarity between an input target molecule and each of the molecules in a database of 3-D chemical structures. A mapping algorithm is used to identify pairs of atoms, one from each of the molecules that are being compared, that lie at the centre of geometrically-related volumes of 3-D space. Techniques are described to increase the run-time efficiency of the basic procedure, so that it can be used for similarity searching in large 3-D databases, and to allow a wide range of types of inter-molecular similarity to be calculated.

[1]  Catherine Burt,et al.  The application of molecular similarity calculations , 1990 .

[2]  Peter Willett,et al.  Development of an atom mapping procedure for similarity searching in databases of three‐dimensional chemical structures , 1991 .

[3]  Peter Willett,et al.  Implementation of nearest-neighbor searching in an online chemical structure search system , 1986, J. Chem. Inf. Comput. Sci..

[4]  Johnz Willett Similarity and Clustering in Chemical Information Systems , 1987 .

[5]  Peter Willett,et al.  Three-dimensional chemical structure handling , 1991 .

[6]  Yvonne C. Martin,et al.  ALADDIN: An integrated tool for computer-assisted molecular design and pharmacophore recognition from geometric, steric, and substructure searching of three-dimensional molecular structures , 1989, J. Comput. Aided Mol. Des..

[7]  A. Ghose,et al.  Atomic Physicochemical Parameters for Three‐Dimensional Structure‐Directed Quantitative Structure‐Activity Relationships I. Partition Coefficients as a Measure of Hydrophobicity , 1986 .

[8]  N. W. Murrall,et al.  Conformational freedom in 3-D databases. 1. Techniques , 1990, J. Chem. Inf. Comput. Sci..

[9]  Dennis H. Smith,et al.  Computer-assisted examination of compounds for common three-dimensional substructures , 1983, Journal of chemical information and computer sciences.

[10]  Peter Willett,et al.  Processing of chemical structure databases using parallel computer hardware , 1989 .

[11]  James J. P. Stewart,et al.  MOPAC: A semiempirical molecular orbital program , 1990, J. Comput. Aided Mol. Des..

[12]  Osman F. Güner,et al.  An integrated approach to three-dimensional information management with MACCS-3D , 1991, J. Chem. Inf. Comput. Sci..

[13]  James G. Nourse,et al.  Similarity searching in the Organic reaction domain , 1988 .

[14]  Peter Willett,et al.  Techniques for the calculation of three-dimensional structural similarity using inter-atomic distances , 1991, J. Comput. Aided Mol. Des..

[15]  Peter Willett,et al.  Upperbound procedures for the identification of similar three-dimensional chemical structures , 1989, J. Comput. Aided Mol. Des..