Causation or only correlation? Application of causal inference graphs for evaluating causality in nano-QSAR models.

In this paper, we suggest that causal inference methods could be efficiently used in Quantitative Structure-Activity Relationships (QSAR) modeling as additional validation criteria within quality evaluation of the model. Verification of the relationships between descriptors and toxicity or other activity in the QSAR model has a vital role in understanding the mechanisms of action. The well-known phrase "correlation does not imply causation" reflects insight statistically correlated with the endpoint descriptor may not cause the emergence of this endpoint. Hence, paradigmatic shifts must be undertaken when moving from traditional statistical correlation analysis to causal analysis of multivariate data. Methods of causal discovery have been applied for broader physical insight into mechanisms of action and interpretation of the developed nano-QSAR models. Previously developed nano-QSAR models for toxicity of 17 nano-sized metal oxides towards E. coli bacteria have been validated by means of the causality criteria. Using the descriptors confirmed by the causal technique, we have developed new models consistent with the straightforward causal-reasoning account. It was proven that causal inference methods are able to provide a more robust mechanistic interpretation of the developed nano-QSAR models.

[1]  Phillip L. Williams,et al.  Use of ion characteristics to predict relative toxicity of mono-, di- and trivalent metal ions: Caenorhabditis elegans LC50 , 1998 .

[2]  Carl V Phillips,et al.  Causal criteria and counterfactuals; nothing more (or less) than scientific common sense , 2006, Emerging themes in epidemiology.

[3]  Bernhard Schölkopf,et al.  Information-geometric approach to inferring causal directions , 2012, Artif. Intell..

[4]  Jerzy Leszczynski,et al.  Periodic table-based descriptors to encode cytotoxicity profile of metal oxide nanoparticles: a mechanistic QSTR approach. , 2014, Ecotoxicology and environmental safety.

[5]  João Ricardo Sato,et al.  Comparing Pearson, Spearman and Hoeffding's d Measure for Gene Expression Association Analysis , 2009, J. Bioinform. Comput. Biol..

[6]  Vesna Rastija,et al.  QSAR study of antioxidant activity of wine polyphenols. , 2009, European journal of medicinal chemistry.

[7]  Stephen R. Johnson,et al.  The Trouble with QSAR (or How I Learned To Stop Worrying and Embrace Fallacy) , 2008, J. Chem. Inf. Model..

[8]  Jerzy Leszczynski,et al.  Optimal descriptor as a translator of eclectic data into prediction of cytotoxicity for metal oxide nanoparticles under different conditions. , 2015, Ecotoxicology and environmental safety.

[9]  Boris M. Smirnov,et al.  Processes involving clusters and small particles in a buffer gas , 2011 .

[10]  Peilin Jia,et al.  Multi-species data integration and gene ranking enrich significant results in an alcoholism genome-wide association study , 2012, BMC Genomics.

[11]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[12]  Jerzy Leszczynski,et al.  How the “Liquid Drop” Approach Could Be Efficiently Applied for Quantitative Structure–Property Relationship Modeling of Nanofluids , 2015 .

[13]  A Donner,et al.  The estimation of intraclass correlation in the analysis of family data. , 1980, Biometrics.

[14]  Andrei N. Soklakov,et al.  Occam's Razor as a Formal Basis for a Physical Theory , 2001 .

[15]  Jerzy Leszczynski,et al.  Causal inference methods to assist in mechanistic interpretation of classification nano-SAR models , 2015 .

[16]  Suzana de Siqueira Santos,et al.  A comparative study of statistical methods used to identify dependencies between gene expression signals , 2014, Briefings Bioinform..

[17]  Mtw,et al.  Computation, causation, and discovery , 2000 .

[18]  Jerzy Leszczynski,et al.  Novel application of the CORAL software to model cytotoxicity of metal oxide nanoparticles to bacteria Escherichia coli. , 2012, Chemosphere.

[19]  Jon Williamson,et al.  Interpreting Causality in the Health Sciences , 2007 .

[20]  Jerzy Leszczynski,et al.  Using nano-QSAR to predict the cytotoxicity of metal oxide nanoparticles. , 2011, Nature nanotechnology.

[21]  Richard Scheines,et al.  Discovery Algorithms for Causally Sufficient Structures , 1993 .

[22]  Jan T. A. Koster 1. Causation, Prediction, and Search. 2nd edn. Peter Spirtes, Clark Glymour and Richard Scheines, MIT Press, Cambridge, MA, 2000. No. of pages: 543. ISBN 0‐262‐19440‐6 , 2003 .

[23]  Richard Scheines,et al.  Causation and Prediction: Axioms and Explications , 1993 .

[24]  Irene Luque Ruiz,et al.  Structural similarity and descriptor spaces for clustering and development of QSAR models. , 2013, Current computer-aided drug design.

[25]  Jerzy Leszczynski,et al.  From basic physics to mechanisms of toxicity: the "liquid drop" approach applied to develop predictive classification models for toxicity of metal oxide nanoparticles. , 2014, Nanoscale.