Progress and perspectives of quantitative structure-activity relationships used for ecological risk assessment of toxic organic compounds

Structure-activity relationship (SAR) and quantitative structure-activity relationship (QSAR), collectively referred to as (Q)SARs, play an important role in ecological risk assessment (ERA) of organic chemicals. (Q)SARs can fill the data gap for physical-chemical, environmental behavioral and ecotoxicological parameters of organic compounds; they can decrease experimental expenses and reduce the extent of experimental testing (especially animal testing); they can also be used to assess the uncertainty of the experimental data. With the development for several decades, (Q)SARs in environmental sciences show three features: application orientation, multidisciplinary integration, and intelligence. Progress of (Q)SAR technology for ERA of toxic organic compounds, including endpoint selection and mathematic methods for establishing simple, transparent, easily interpretable and portable (Q)SAR models, is reviewed. The recent development on defining application domains and diagnosing outliers is summarized. Model characterization with respect to goodness-of-fit, stability and predictive power is specially presented. The purpose of the review is to promote the development of (Q)SARs orientated to ERA of organic chemicals.

[1]  Paola Gramatica,et al.  QSAR prediction of estrogen activity for a large set of diverse chemicals under the guidance of OECD principles. , 2006, Chemical research in toxicology.

[2]  John D. Walker,et al.  Use of QSARs in international decision-making frameworks to predict health effects of chemical substances. , 2003, Environmental health perspectives.

[3]  J. Devillers Genetic algorithms in molecular modeling , 1996 .

[4]  Igor Linkov,et al.  Uncertainty in octanol-water partition coefficient: implications for risk assessment and remedial costs. , 2005, Environmental science & technology.

[5]  John D. Walker International Workshops on QSARs in the Environmental Sciences – The First 20 Years , 2003 .

[6]  R. Taft,et al.  The solvatochromic comparison method. 2. The .alpha.-scale of solvent hydrogen-bond donor (HBD) acidities , 1976 .

[7]  M. Pavan,et al.  The role of the European Chemicals Bureau in promoting the regulatory use of (Q)SAR methods , 2007, SAR and QSAR in environmental research.

[8]  Weida Tong,et al.  Regulatory application of SAR/QSAR for priority setting of endocrine disruptors: A perspective , 2003 .

[9]  R. M. Muir,et al.  Correlation of Biological Activity of Phenoxyacetic Acids with Hammett Substituent Constants and Partition Coefficients , 1962, Nature.

[10]  Guo-Li Shen,et al.  Genetic training of network using chaos concept: Application to QSAR studies of vibration modes of tetrahedral halides , 2002, J. Comput. Chem..

[11]  P Mátyus,et al.  Application of neural networks in structure–activity relationships , 1999, Medicinal research reviews.

[12]  Joop L. M. Hermens,et al.  Quantitative structure-activity relationships and mixture toxicity studies of chloro- and alkylanilines at an acute lethal toxicity level to the guppy (Poecilia reticulata). , 1984, Ecotoxicology and environmental safety.

[13]  Juhani Ruuskanen,et al.  Consensus kNN QSAR: a versatile method for predicting the estrogenic activity of organic compounds in silico. A comparative study with five estrogen receptors and a large, diverse set of ligands. , 2004, Environmental science & technology.

[14]  John D. Walker,et al.  Global Government applications of analogues, SAR s and QSAR s to predict aquatic toxicity, chemical or physical properties, environmental fate parameters and health effects of organic chemicals , 2002, SAR and QSAR in environmental research.

[15]  William G. Cochran,et al.  Experimental Designs, 2nd Edition , 1950 .

[16]  J. Polli,et al.  Apical sodium dependent bile acid transporter (ASBT, SLC10A2): a potential prodrug target. , 2006, Molecular pharmaceutics.

[17]  J. Hermens,et al.  Classifying environmental pollutants: Part 3. External validation of the classification system. , 2000, Chemosphere.

[18]  R. Taft,et al.  Solubility properties in polymers and biological media. 7. An analysis of toxicant properties that influence inhibition of bioluminescence in Photobacterium phosphoreum (the Microtox test). , 1986, Environmental science & technology.

[19]  R. A. McGill,et al.  Solubility properties in polymers and biological media. II. A new method for the characterisation of the adsorption of gases and vapours on solids. , 1987, Journal of chromatography.

[20]  Igor V. Tetko,et al.  Data modelling with neural networks: Advantages and limitations , 1997, J. Comput. Aided Mol. Des..

[21]  T. N. Rogers,et al.  A review of structure-based biodegradation estimation methods. , 2001, Journal of hazardous materials.

[22]  Gergana Dimitrova,et al.  A Stepwise Approach for Defining the Applicability Domain of SAR and QSAR Models , 2005, J. Chem. Inf. Model..

[23]  Toshio Fujita,et al.  The Correlation of Biological Activity of Plant Growth Regulators and Chloromycetin Derivatives with Hammett Constants and Partition Coefficients , 1963 .

[24]  Weida Tong,et al.  Influence of the structural diversity of data sets on the statistical quality of three-dimensional quantitative structure-activity relationship (3D-QSAR) models: predicting the estrogenic activity of xenoestrogens. , 2002, Chemical research in toxicology.

[25]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[26]  Ruisheng Zhang,et al.  Prediction of the Isoelectric Point of an Amino Acid Based on GA-PLS and SVMs , 2004, J. Chem. Inf. Model..

[27]  J. E. Jackson A User's Guide to Principal Components , 1991 .

[28]  David J. Livingstone Data Analysis for Chemists: Applications to QSAR and Chemical Product Design , 1996 .

[29]  Frank R. Burden,et al.  Use of Automatic Relevance Determination in QSAR Studies Using Bayesian Neural Networks , 2000, J. Chem. Inf. Comput. Sci..

[30]  Toby J. Mitchell,et al.  An algorithm for the construction of “ D -optimal” experimental designs , 2000 .

[31]  Liping Huang,et al.  The Fragment Constant Method for Predicting Octanol-Air Partition Coefficients of Persistent Organic Pollutants at Different Temperatures , 2006 .

[32]  Y. H. Zhao,et al.  QSAR study on the toxicity of substituted benzenes to the algae (Scenedesmus obliquus). , 2001, Chemosphere.

[33]  M. Karelson,et al.  Quantum-Chemical Descriptors in QSAR/QSPR Studies. , 1996, Chemical reviews.

[34]  Douglas M. Hawkins,et al.  The Problem of Overfitting , 2004, J. Chem. Inf. Model..

[35]  H. Kubinyi,et al.  Three-dimensional quantitative similarity-activity relationships (3D QSiAR) from SEAL similarity matrices. , 1998, Journal of medicinal chemistry.

[36]  Philip Howard,et al.  Practical considerations on the use of predictive models for regulatory purposes. , 2005, Environmental science & technology.

[37]  John D. Walker,et al.  Use of QSARs in international decision-making frameworks to predict ecologic effects and environmental fate of chemical substances. , 2003, Environmental health perspectives.

[38]  K. Schramm,et al.  Universal predictive models on octanol‐air partition coefficients at different temperatures for persistent organic pollutants , 2004, Environmental toxicology and chemistry.

[39]  M. Cronin,et al.  Validation of Vibrio fisheri acute toxicity data: mechanism of action-based QSARs for non-polar narcotics and polar narcotic phenols. , 1997, The Science of the total environment.

[40]  Paola Gramatica,et al.  The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models , 2003 .

[41]  Robert W. Taft,et al.  Polar and Steric Substituent Constants for Aliphatic and o-Benzoate Groups from Rates of Esterification and Hydrolysis of Esters1 , 1952 .

[42]  Paola Gramatica,et al.  Introduction General Considerations , 2022 .

[43]  Yu Chen,et al.  Evaluation of Quantitative Structure-Activity Relationship Methods for Large-Scale Prediction of Chemicals Binding to the Estrogen Receptor , 1998, J. Chem. Inf. Comput. Sci..

[44]  Y. Lei,et al.  A Comprehensive and Critical Compilation, Evaluation, and Selection of Physical–Chemical Property Data for Selected Polychlorinated Biphenyls , 2003 .

[45]  Wencong Lu,et al.  Support vector regression based QSPR for the prediction of some physicochemical properties of alkyl benzenes , 2005 .

[46]  Nina Nikolova-Jeliazkova,et al.  QSAR Applicability Domain Estimation by Projection of the Training Set in Descriptor Space: A Review , 2005, Alternatives to laboratory animals : ATLA.

[47]  Fumiyoshi Yamashita,et al.  QSAR Analysis of the Inhibition of Recombinant CYP 3A4 Activity by Structurally Diverse Compounds Using a Genetic Algorithm-Combined Partial Least Squares Method , 2003, Pharmaceutical Research.

[48]  T W Schultz,et al.  Development of quantitative structure-activity relationships for the toxicity of aromatic compounds to Tetrahymena pyriformis: comparative assessment of the methodologies. , 2001, Chemical research in toxicology.

[49]  K. Baumann,et al.  A systematic evaluation of the benefits and hazards of variable selection in latent variable regression. Part II. Practical applications , 2002 .

[50]  M. Charton,et al.  Nature of the ortho effect. II. Composition of the Taft steric parameters , 1969 .

[51]  D. Mackay,et al.  Linear solvation energy relationships. 44. Parameter estimation rules that allow accurate prediction of octanol/water partition coefficients and other solubility and toxicity properties of polychlorinated biphenyls and polycyclic aromatic hydrocarbons. , 1988, Environmental science & technology.

[52]  Comparative study of hydrocarbon, fluorocarbon, and aromatic bonded RP-HPLC stationary phases by linear solvation energy relationships. , 1999 .

[53]  D. Leahy,et al.  Intrinsic molecular volume as a measure of the cavity term in linear solvation energy relationships: octanol-water partition coefficients and aqueous solubilities. , 1986, Journal of pharmaceutical sciences.

[54]  Edward A. Meyers,et al.  Quantitative Separation of Hyperconjugation Effects from Steric Substituent Constants , 1961 .

[55]  Robert P. Sheridan,et al.  Similarity to Molecules in the Training Set Is a Good Discriminator for Prediction Accuracy in QSAR , 2004, J. Chem. Inf. Model..

[56]  Xuefeng Bruce Ling,et al.  Multiclass cancer classification and biomarker discovery using GA-based algorithms , 2005, Bioinform..

[57]  Guanghui Ding,et al.  Linear free energy relationships on rate constants for the gas-phase reactions of hydroxyl radicals with PAHs and PCDD/Fs. , 2005, Chemosphere.

[58]  Linear free energy relationships on rate constants for dechlorination by zero-valent iron , 2002, SAR and QSAR in environmental research.

[59]  Lu Xu,et al.  Study on structure-activity relationships of organic compounds: Three new topological indexes and their applications , 1993, J. Chem. Inf. Comput. Sci..

[60]  L. Hammett,et al.  Some Relations between Reaction Rates and Equilibrium Constants. , 1935 .

[61]  John M. Barnard,et al.  Clustering of chemical structures on the basis of two-dimensional similarity measures , 1992, J. Chem. Inf. Comput. Sci..

[62]  D. Knight,et al.  Accurate predictions of cellular response using QSPR: a feasibility test of rational design of polymeric biomaterials , 2004 .

[63]  M. Cronin,et al.  Pitfalls in QSAR , 2003 .

[64]  M T D Cronin,et al.  Evaluation of QSARs for ecotoxicity: A method for assigning quality and confidence , 2004, SAR and QSAR in environmental research.

[65]  Alexandru T. Balaban,et al.  Using real numbers as vertex invariants for third-generation topological indexes , 1992, J. Chem. Inf. Comput. Sci..

[66]  Robert Stanforth,et al.  The quality of QSAR models: problems and solutions , 2007, SAR and QSAR in environmental research.

[67]  Hein Putter,et al.  The bootstrap: a tutorial , 2000 .

[68]  Rajarshi Guha,et al.  Determining the Validity of a QSAR Model - A Classification Approach , 2005, J. Chem. Inf. Model..

[69]  D. Mackay,et al.  Applications of contaminant fate and bioaccumulation models in assessing ecological risks of chemicals: a case study for gasoline hydrocarbons. , 2004, Environmental science & technology.

[70]  George R. Famini,et al.  Using theoretical descriptors in quantitative structure activity relationships: Some physicochemical properties , 1992 .

[71]  J. Jaworska,et al.  Summary of a workshop on regulatory acceptance of (Q)SARs for human health and environmental endpoints. , 2003, Environmental health perspectives.

[72]  L. Hammett The Effect of Structure upon the Reactions of Organic Compounds. Benzene Derivatives , 1937 .

[73]  C. Hansch,et al.  A NEW SUBSTITUENT CONSTANT, PI, DERIVED FROM PARTITION COEFFICIENTS , 1964 .

[74]  John C Dearden,et al.  Guidelines for developing and using quantitative structure‐activity relationships , 2003, Environmental toxicology and chemistry.

[75]  Huazhong Wang,et al.  Application study on nonlinear dynamic FIR modeling using hybrid SVM-PLS method , 2004, Fifth World Congress on Intelligent Control and Automation (IEEE Cat. No.04EX788).

[76]  Corwin Hansch,et al.  A survey of Hammett substituent constants and resonance and field parameters , 1991 .

[77]  Tomasz Arodz,et al.  Computational methods in developing quantitative structure-activity relationships (QSAR): a review. , 2006, Combinatorial chemistry & high throughput screening.

[78]  R. Taft,et al.  The General Applicability of a Fixed Scale of Inductive Effects. II. Inductive Effects of Dipolar Substituents in the Reactivities of m- and p-Substituted Derivatives of Benzene1,2 , 1958 .

[79]  J. Hermens,et al.  Classifying environmental pollutants , 1992 .

[80]  Adam Ibrahim,et al.  Determination of sets of solute descriptors from chromatographic measurements. , 2004, Journal of chromatography. A.

[81]  Ovanes Mekenyan,et al.  Interspecies quantitative structure‐activity relationship model for aldehydes: Aquatic toxicity , 2004, Environmental toxicology and chemistry.

[82]  A. Tropsha,et al.  Beware of q2! , 2002, Journal of molecular graphics & modelling.

[83]  R. Taft,et al.  The solvatochromic comparison method. I. The .beta.-scale of solvent hydrogen-bond acceptor (HBA) basicities , 1976 .

[84]  A. Svenson,et al.  The importance of outlier detection and training set selection for reliable environmental QSAR predictions. , 2006, Chemosphere.

[85]  Andrew P Worth,et al.  Comparison of the applicability domain of a quantitative structure‐activity relationship for estrogenicity with a large chemical inventory , 2006, Environmental toxicology and chemistry.

[86]  R. Schwarzenbach,et al.  Linear free energy relationships used to evaluate equilibrium partitioning of organic compounds. , 2001, Environmental science & technology.

[87]  Robert S. Boethling,et al.  Molecular topology/fragment contribution method for predicting soil sorption coefficients , 1992 .

[88]  F. Burden,et al.  A quantitative structure--activity relationships model for the acute toxicity of substituted benzenes to Tetrahymena pyriformis using Bayesian-regularized neural networks. , 2000, Chemical research in toxicology.

[89]  K. Schramm,et al.  Quantitative structure-property relationships (QSPRs) on direct photolysis of PCDDs. , 2001, Chemosphere.

[90]  J. Edward Jackson,et al.  A User's Guide to Principal Components: Jackson/User's Guide to Principal Components , 2004 .

[91]  W. P. Ball,et al.  Polyparameter linear free energy relationships for estimating the equilibrium partition of organic compounds between water and the natural organic matter in soils and sediments. , 2005, Environmental science & technology.

[92]  K. Schramm,et al.  Is it possible to develop a QSPR model for direct photolysis half-lives of PAHs under irradiation of sunlight? , 2001, Environmental pollution.

[93]  R. Leardi Genetic algorithms in chemometrics and chemistry: a review , 2001 .

[94]  David W Salt,et al.  Judging the significance of multiple linear regression models. , 2005, Journal of medicinal chemistry.

[95]  Michael H. Abraham,et al.  Solubility properties in polymers and biological media. 4. Correlation of octanol/water partition coefficients with solvatochromic parameters , 1984 .

[96]  S. Free,et al.  A MATHEMATICAL CONTRIBUTION TO STRUCTURE-ACTIVITY STUDIES. , 1964, Journal of medicinal chemistry.

[97]  R. Kaliszan Quantitative structure-retention relationships applied to reversed-phase high-performance liquid chromatography , 1993 .

[98]  Gordon M. Crippen,et al.  Atomic physicochemical parameters for three-dimensional-structure-directed quantitative structure-activity relationships. 2. Modeling dispersive and hydrophobic interactions , 1987, J. Chem. Inf. Comput. Sci..

[99]  Ping Zhang Model Selection Via Multifold Cross Validation , 1993 .

[100]  Atam P. Dhawan,et al.  Use of genetic algorithms with backpropagation in training of feedforward neural networks , 1993, IEEE International Conference on Neural Networks.

[101]  Mark T D Cronin,et al.  Essential and desirable characteristics of ecotoxicity quantitative structure–activity relationships , 2003, Environmental toxicology and chemistry.

[102]  M. Hewitt,et al.  Assessing Applicability Domains of Toxicological QSARs: Definition, Confidence in Predicted Values, and the Role of Mechanisms of Action , 2007 .

[103]  David Hartsough,et al.  Toward an Optimal Procedure for Variable Selection and QSAR Model Building , 2001, J. Chem. Inf. Comput. Sci..

[104]  R D Cramer,et al.  Three-dimensional structure-activity relationships. , 1988, Trends in pharmacological sciences.

[105]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[106]  C. Hansch,et al.  Lipophilic character and biological activity of drugs. II. The parabolic case. , 1973, Journal of pharmaceutical sciences.

[107]  Gerald T Ankley,et al.  Overview of data and conceptual approaches for derivation of quantitative structure‐activity relationships for ecotoxicological effects of organic chemicals , 2003, Environmental toxicology and chemistry.

[108]  Ralph Kühne,et al.  Prediction of the sorption of organic compounds into soil organic matter from molecular structure. , 2006, Environmental science & technology.

[109]  R. Cramer,et al.  Comparative molecular field analysis (CoMFA). 1. Effect of shape on binding of steroids to carrier proteins. , 1988, Journal of the American Chemical Society.

[110]  Michael H. Abraham,et al.  Solubility properties in polymers and biological media. 8. An analysis of the factors that influence toxicities of organic nonelectrolytes to the golden orfe fish (Leuciscus idus melanotus) , 1987 .

[111]  T W Schultz,et al.  Structure-toxicity relationships for Tetrahymena: Aliphatic aldehydes , 1994, Bulletin of environmental contamination and toxicology.

[112]  George R. Famini,et al.  Using theoretical descriptors in quantitative structure-activity relationships: some toxicological indices. , 1991, Journal of medicinal chemistry.

[113]  X. Quan,et al.  Quantitative structure-property relationship studies on n-octanol/water partitioning coefficients of PCDD/Fs. , 2001, Chemosphere.