Database Mining Using Soft Computing Techniques. An Integrated Neural Network-Fuzzy Logic-Genetic Algorithm Approach

Two different soft computing (SC) techniques (a competitive learning neural network and an integrated neural network-fuzzy logic-genetic algorithm approach) are employed in the analysis of a database subset obtained from the Cambridge Structural Database. The chemical problem chosen for study is relevant to the relationship between various metric parameters in transition metal imido (LnMdNZ, Z = carbon-based substituent) complexes and the chemical consequences of such relationships. The SC techniques confirmed and quantified the suspected relationship between the metal-nitrogen bond length and the metal-nitrogen-substituent bond angle for transition metal imidos: increased metal-nitrogen-carbon angles correlate with shortened metal-nitrogen distances. The mining effort also yielded an unexpected correlation between the NC distance and the MNC angle-shorter NC correlate with larger MNC. A fuzzy inference system is used to construct an MNred-NC-MNC hypersurface. This hypersurface suggests a complicated interdependence among NC, MNred, and the angle subtended by these two bonds. Also, major portions of the hypersurface are very flat, in regions where MNC is approaching linearity. The relationships are also seen to be influenced by whether the imido substituent is an alkyl or aryl group. Computationally, the present results are of particular interest in two respects. First, SC classification was able to isolate an "outlier" cluster. Identification of outliers is important as they may correspond to unreported experimental errors in the database or novel chemical entities, both of which warrant further investigation. Second, the SC database mining not only confirmed and quantified a suspected relationship (MNred versus MNC) within the data but also yielded a trend that was not suspected (NC versus MNC).

[1]  W. M. Davis,et al.  Osmium imido complexes: synthesis, reactivity, and SCF-X.alpha.-SW electronic structure , 1991 .

[2]  Jack D. Dunitz,et al.  From crystal statics to chemical dynamics , 1983 .

[3]  Olga Kennard,et al.  Tables of bond lengths determined by X-ray and neutron diffraction. Part 1. Bond lengths in organic compounds , 1987 .

[4]  P. Dyer,et al.  Novel bis(imido) complexes of molybdenum(VI): precursors to new alkene metathesis catalysts , 1994 .

[5]  D. C. Bradley,et al.  Nitrogen nuclear magnetic resonance spectroscopy as a probe of bonding, bending and fluxionality of the imido ligand , 1992 .

[6]  C. Redshaw,et al.  Bent versus linear imido ligands in five-co-ordinate molybdenum complexes , 1996 .

[7]  G. Parkin,et al.  A survey of terminal chalcogenido complexes of the transition metals: trends in their distribution and the variation of their ME bond lengths , 1997 .

[8]  Malcolm L. H. Green,et al.  Interconversion of oxo and imido ligands at a dimolybdenum centre: molecular and electronic structure of [{Mo(η-C5H4Me)(NPh)(µ-NPh)}2] , 1990 .

[9]  A. Sella,et al.  A steric preference for linear versus bent imido ligation? Synthesis and x-ray crystal structure of [Mo(NAr)2(edtc)2] (Ar = 2,6-iPr2C6H3; edtc = S2CNEt2) containing two linear imido moieties , 1993 .

[10]  Bobby G. Sumpter,et al.  Theory and Applications of Neural Computing in Chemical Science , 1994 .

[11]  D. Braga,et al.  Agostic interactions in organometallic compounds. A Cambridge Structural Database study , 1996 .

[12]  Marco Russo,et al.  FuGeNeSys-a fuzzy genetic neural system for fuzzy modeling , 1998, IEEE Trans. Fuzzy Syst..

[13]  W. Schaefer,et al.  Oxo-hydrido and imido-hydrido derivatives of permethyltantalocene. Structures of (.eta.5-C5Me5)2Ta(:O)H and (.eta.5-C5Me5)2Ta(:NC6H5)H: doubly or triply bonded tantalum oxo and imido ligands? , 1992 .

[14]  D. Rouvray Fuzzy Logic in Chemistry , 1997 .

[15]  Thomas R. Cundari,et al.  Transition Metal Imido Complexes , 1992 .

[16]  A G Orpen,et al.  Structural Systematics. 6.1 Apparent Flexibility of Metal Complexes in Crystals , 1996 .

[17]  Jun Deng,et al.  Structural Analysis of Transition Metal -X Substituent Interactions. Toward the Use of Soft Computing Methods for Catalyst Modeling , 2000, J. Chem. Inf. Comput. Sci..

[18]  B. Haymore,et al.  A BISPHENYLNITRENE COMPLEX OF MOLYBDENUM WITH A BENT NITRENE LIGAND. PREPARATION AND STRUCTURE OF CIS-BIS(PHENYLNITRENE)BIS(DIETHYLDITHIOCARBAMATO)MOLYBDENUM , 1979 .

[19]  Laszlo Zsolnai,et al.  Conformation of tripod Metal Templates in CH3C(CH2PPh2)3MLn (n = 2, 3): Neural Networks in Conformational Analysis† , 1996 .

[20]  H. Robertson,et al.  Molecular structure of tetrakis(tert-butylimido)osmium(VIII), determined in the gas phase by electron diffraction , 1994 .

[21]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences. , 1957 .

[22]  Armando Rocha,et al.  Comparison of Yager's level set method for fuzzy logic control with Mamdani's and Larsen's methods , 1993, IEEE Trans. Fuzzy Syst..

[23]  A. Guy Orpen Structural systematics in molecular inorganic chemistry , 1993 .

[24]  K. A. Joergensen MO explanation of the "unexpected" structure of (.eta.5-C5Me5)2Ta(:NC6H5)H , 1993 .

[25]  Marco Russo,et al.  Genetic fuzzy learning , 2000, IEEE Trans. Evol. Comput..

[26]  Eric N. Jacobsen,et al.  SCHIFF BASE CATALYSTS FOR THE ASYMMETRIC STRECKER REACTION IDENTIFIED AND OPTIMIZED FROM PARALLEL SYNTHETIC LIBRARIES , 1998 .