Predicting Toxicity against the fathead Minnow by Adaptive Fuzzy Partition

Recent progress in the development of powerful tools suitable to design and to classify large chemical libraries can be fruitfully extended also to the ecotoxicity domain. Amidst these methods, Fuzzy Logic concepts, basedon the possibility to handle the "concept of partial truth", constitute interesting approaches to developing general predictive models. In this work, a global strategy of Database Mining was applied on a data set of 568 chemicals, extracted from a toxicity database concerning the fathead minnow and divided into four classes, according to the toxicity ranges defined by the European Community legislation. Two large sets of molecular descriptors were tested on the 2D and 3D structures, and the best ones were selected with help of a procedure combining Genetic Algorithm concepts and stepwise method. After selecting the training set with a rational selection based on the Self Organizing Maps (SOM), structural-activity models were built by Adaptive Fuzzy Partition (AFP). This method consists in modeling relations between molecular descriptors and biological activities, by dynamically dividing the molecular descriptor hyperspace into a set of fuzzy subspaces. The best model was selected by a validation set, and its robustness was confirmed by predicting a test set of 80 chemicals never used to define the AFP models. An encouraging validation ratio of about 72% was obtained in the prediction of the experimental toxicity class. Furthermore, very similar results were obtained by using molecular descriptors computed on 2D or 3D structures.

[1]  César Hervás-Martínez,et al.  Computational Neural Networks in Conjunction with Principal Component Analysis for Resolving Highly Nonlinear Kinetics , 1997, J. Chem. Inf. Comput. Sci..

[2]  Lemont B. Kier,et al.  Molecular structure description , 1999 .

[3]  J Devillers,et al.  QSAR Modeling of Large Heterogeneous Sets of Molecules , 2001, SAR and QSAR in environmental research.

[4]  R. Leardi,et al.  Genetic algorithms applied to feature selection in PLS regression: how and when to use them , 1998 .

[5]  Willie J.G.M. Peijnenburg,et al.  QSARs for oxidation of phenols in the aqueous environment, suitable for risk assessment , 1996 .

[6]  M. Gupta,et al.  Theory of T -norms and fuzzy inference methods , 1991 .

[7]  J Devillers A General QSAR Model for Predicting the Acute Toxicity of Pesticides to Lepomis Macrochirus , 2001, SAR and QSAR in environmental research.

[8]  Marco Pintore,et al.  Adaptive fuzzy partition in database mining: application to olfaction , 2002, Data Sci. J..

[9]  G. Rabatel,et al.  Recognition of overlapping particles in granular product images using statistics and neural networks , 1995 .

[10]  M. Pintore,et al.  Molecular descriptor selection combining genetic algorithms and fuzzy logic: application to database mining procedures , 2002 .

[11]  F Ros,et al.  Database mining applied to central nervous system (CNS) activity. , 2001, European journal of medicinal chemistry.

[12]  Yinghua Lin,et al.  Building a Fuzzy System from Input-Output Data , 1994, J. Intell. Fuzzy Syst..

[13]  J. Hermens,et al.  Classifying environmental pollutants. 2: Separation of class 1 (baseline toxicity) and class 2 (‘polar narcosis’) type compounds based on chemical descriptors , 1996 .

[14]  Giuseppina C. Gini,et al.  Predictive Carcinogenicity: A Model for Aromatic Compounds, with Nitrogen-Containing Substituents, Based on Molecular Descriptors Using an Artificial Neural Network , 1999, J. Chem. Inf. Comput. Sci..

[15]  C. Russom,et al.  Predicting modes of toxic action from chemical structure: Acute toxicity in the fathead minnow (Pimephales promelas) , 1997 .

[16]  E Benfenati,et al.  Factors Influencing Predictive Models for Toxicology , 2001, SAR and QSAR in environmental research.

[17]  Michio Sugeno,et al.  A fuzzy-logic-based approach to qualitative modeling , 1993, IEEE Trans. Fuzzy Syst..