MCSS: a new algorithm for perception of maximal common substructures and its application to NMR spectral studies. 1. The algorithm

A new algorithm has been developed which can be used to deal with structure isomorphism, substructure searching, and Maximal Common Substructure (MCSS) detection. The algorithm described can perceive topological or colored maximal common substructures of two given structures, including structures which consist of two or more disconnected parts. The new strategies used in the algorithm are discussed in detail. The program was tested upon a set of carefully selected structure pairs and then on a database containing about 80 000 structures. The investigation revealed that the algorithm has high efficiency and flexibility. The MCSS algorithm has been implemented into the CSEARCH-NMR database system allowing automatic increment analysis and error detection.

[1]  M F Lynch,et al.  Computer analysis of chemical reactions for storage and retrieval. , 1970, Journal of the Chemical Society. Perkin transactions 1.

[2]  Kimito Funatsu,et al.  Automatic recognition of reaction site in organic chemical reactions , 1988 .

[3]  J. E. Crowe,et al.  Documentation of Chemical Reactions by Computer Analysis of Structural Changes , 1967 .

[4]  H. L. Morgan The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. , 1965 .

[5]  Michael M. Cone,et al.  Molecular structure comparison program for the identification of maximal common substructures , 1977 .

[6]  Robert E. Tarjan,et al.  Graph Algorithms in Chemical Computation , 1977 .

[7]  Morton E. Munk,et al.  Computer Perception of Topological Symmetry , 1977, J. Chem. Inf. Comput. Sci..

[8]  Yossi Shiloach,et al.  Computer-Assisted Examination of Chemical Compounds for Structural Similarities , 1979, J. Chem. Inf. Comput. Sci..

[9]  Harry G. Barrow,et al.  Subgraph Isomorphism, Matching Relational Structures and Maximal Cliques , 1976, Inf. Process. Lett..

[10]  Wolfgang Robien,et al.  CSEARCH: a computer program for identification of organic compounds and fully automated assignment of carbon-13 nuclear magnetic resonance spectra , 1985, J. Chem. Inf. Comput. Sci..

[11]  W. Todd Wipke,et al.  Tree-structured maximal common subgraph searching. An example of parallel computation with a single sequential processor , 1989 .

[12]  Michael F. Lynch,et al.  The Automatic Detection of Chemical Reaction Sites , 1978, J. Chem. Inf. Comput. Sci..

[13]  Peter Willett,et al.  Use of a maximum common subgraph algorithm in the automatic identification of ostensible bond changes occurring in chemical reactions , 1981, J. Chem. Inf. Comput. Sci..

[14]  G. Levi A note on the derivation of maximal common subgraphs of two directed or undirected graphs , 1973 .

[15]  Michael F. Lynch,et al.  Automatic detection of structural similarities among chemical compounds , 1967 .

[16]  Solomon W. Golomb,et al.  Backtrack Programming , 1965, JACM.

[17]  J. J. McGregor,et al.  Backtrack search algorithms and the maximal common subgraph problem , 1982, Softw. Pract. Exp..

[18]  Shengang Yuan,et al.  Identification of maximal common substructures in structure/activity studies , 1990 .