Algorithms for the identification of three-dimensional maximal common substructures

Two algorithms are described for the identification of the maximal substructures common to two (or more) three-dimensional chemical structures, where a substructure consists of a set of atoms and the associated interatomic distances. The algorithm of Crandell and Smith involves a breadth-first tree search procedure in which substructures are expanded as they are shown to be common to all of the molecules under consideration. The clique-detection algorithm involves the identification of cliques in the correspondence graph linking matching atoms and interatomic distances in pairs of structures that are being compared. This second algorithm is shown to be substantially faster in operation than the Crandell and Smith algorithm when applied to structures taken from the Cambridge Crystallographic Data Bank, and an extension to the algorithm is described that allows it to be used for the identification of the maximal substructure common to arbitrary numbers of molecules.

[1]  G. Levi A note on the derivation of maximal common subgraphs of two directed or undirected graphs , 1973 .

[2]  P. Willett,et al.  Pharmacophoric pattern matching in files of 3d chemical structures: comparison of geometric searching algorithms , 1987 .

[3]  Dennis H. Smith,et al.  Computer-assisted examination of compounds for common three-dimensional substructures , 1983, Journal of chemical information and computer sciences.

[4]  Peter Willett,et al.  Pharmacophoric pattern matching in files of 3-D chemical structures: election of interatomic distance screens , 1986 .

[5]  A. Ghose,et al.  Geometrically feasible binding modes of a flexible ligand molecule at the receptor site , 1985 .

[6]  E. Loukakis A new backtracking algorithm for generating the family of maximal independent sets of a graph , 1983 .

[7]  T. Esaki,et al.  Quantitative drug design studies. V. Approach to lead generation by pharmacophoric pattern searching. , 1982, Chemical & pharmaceutical bulletin.

[8]  Valeriĭ Efimovich Golender,et al.  Logical and combinatorial algorithms for drug design , 1983 .

[9]  Michael M. Cone,et al.  Molecular structure comparison program for the identification of maximal common substructures , 1977 .

[10]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[11]  Harry G. Barrow,et al.  Subgraph Isomorphism, Matching Relational Structures and Maximal Cliques , 1976, Inf. Process. Lett..

[12]  Peter Willett,et al.  Use of a maximum common subgraph algorithm in the automatic identification of ostensible bond changes occurring in chemical reactions , 1981, J. Chem. Inf. Comput. Sci..

[13]  P. Gund Three-Dimensional Pharmacophoric Pattern Searching , 1977 .

[14]  Andrew Timothy Brint Matching algorithms for handling three dimensional molecular co-ordinate data. , 1988 .

[15]  David Bawden,et al.  Pharmacophoric pattern matching in files of 3d chemical structures: evaluation of search performance , 1987 .

[16]  Yuzuru Fujiwara,et al.  A method for the machine detection of near equivalence of major substructures in a molecule , 1986, Journal of computational chemistry.

[17]  Arthur M. Lesk,et al.  Detection of three-dimensional patterns of atoms in chemical structures , 1979, CACM.

[18]  Rainer Franke,et al.  Topological Pharmacophores New Methods and Their Application to a Set of Antimalarials Part 1: The Methods LOGANA and LOCON , 1985 .

[19]  D. K. Friesen,et al.  A combinatorial algorithm for calculating ligand binding , 1984 .

[20]  A. Cammarata,et al.  Pattern recognition. Classification of therapeutic agents according to pharmacophores. , 1976, Journal of medicinal chemistry.

[21]  Peter Gund,et al.  Chapter 29 Pharmacophoric Pattern Searching and Receptor Mapping , 1979 .

[22]  Johnz Willett Similarity and Clustering in Chemical Information Systems , 1987 .

[23]  H. Barrow,et al.  Computational vision , 1981, Proceedings of the IEEE.

[24]  J. J. McGregor,et al.  Backtrack search algorithms and the maximal common subgraph problem , 1982, Softw. Pract. Exp..