Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species.

Drug-induced liver injury is one of the main causes of drug attrition. The ability to predict the liver effects of drug candidates from their chemical structures is critical to help guide experimental drug discovery projects toward safer medicines. In this study, we have compiled a data set of 951 compounds reported to produce a wide range of effects in the liver in different species, comprising humans, rodents, and nonrodents. The liver effects for this data set were obtained as assertional metadata, generated from MEDLINE abstracts using a unique combination of lexical and linguistic methods and ontological rules. We have analyzed this data set using conventional cheminformatics approaches and addressed several questions pertaining to cross-species concordance of liver effects, chemical determinants of liver effects in humans, and the prediction of whether a given compound is likely to cause a liver effect in humans. We found that the concordance of liver effects was relatively low (ca. 39-44%) between different species, raising the possibility that species specificity could depend on specific features of chemical structure. Compounds were clustered by their chemical similarity, and similar compounds were examined for the expected similarity of their species-dependent liver effect profiles. In most cases, similar profiles were observed for members of the same cluster, but some compounds appeared as outliers. The outliers were the subject of focused assertion regeneration from MEDLINE as well as other data sources. In some cases, additional biological assertions were identified, which were in line with expectations based on compounds' chemical similarities. The assertions were further converted to binary annotations of underlying chemicals (i.e., liver effect vs no liver effect), and binary quantitative structure-activity relationship (QSAR) models were generated to predict whether a compound would be expected to produce liver effects in humans. Despite the apparent heterogeneity of data, models have shown good predictive power assessed by external 5-fold cross-validation procedures. The external predictive power of binary QSAR models was further confirmed by their application to compounds that were retrieved or studied after the model was developed. To the best of our knowledge, this is the first study for chemical toxicity prediction that applied QSAR modeling and other cheminformatics techniques to observational data generated by the means of automated text mining with limited manual curation, opening up new opportunities for generating and modeling chemical toxicology data.

[1]  J. Waring,et al.  Use of toxicogenomics to understand mechanisms of drug-induced hepatotoxicity during drug discovery and development. , 2009, Toxicology letters.

[2]  D. Young,et al.  Are the Chemical Structures in Your QSAR Correct , 2008 .

[3]  Peter V. Henstock,et al.  Cellular imaging predictions of clinical drug-induced liver injury. , 2008, Toxicological sciences : an official journal of the Society of Toxicology.

[4]  I. Tetko,et al.  ISIDA - Platform for Virtual Screening Based on Fragment and Pharmacophoric Descriptors , 2008 .

[5]  Alexandre Varnek,et al.  Building a chemical space based on fragment descriptors. , 2008, Combinatorial chemistry & high throughput screening.

[6]  Igor V. Tetko,et al.  Critical Assessment of QSAR Models of Environmental Toxicity against Tetrahymena pyriformis: Focusing on Applicability Domain and Overfitting by Variable Selection , 2008, J. Chem. Inf. Model..

[7]  P. Olinga,et al.  Microarray analysis in rat liver slices correctly predicts in vivo hepatotoxicity. , 2008, Toxicology and applied pharmacology.

[8]  Igor V. Tetko,et al.  Combinatorial QSAR Modeling of Chemical Toxicants Tested against Tetrahymena pyriformis , 2008, J. Chem. Inf. Model..

[9]  M. Natália D. S. Cordeiro,et al.  Computational chemistry approach for the early detection of drug‐induced idiosyncratic liver toxicity , 2008, J. Comput. Chem..

[10]  Alexander Golbraikh,et al.  Predictive QSAR modeling workflow, model applicability domains, and virtual screening. , 2007, Current pharmaceutical design.

[11]  Alexandre Varnek,et al.  QSPR Modeling of the AmIII/EuIII Separation Factor: How Far Can we Predict ? , 2007 .

[12]  F Peter Guengerich,et al.  Applying mechanisms of chemical toxicity to predict drug safety. , 2007, Chemical research in toxicology.

[13]  W. Suter Predictive value of in vitro safety studies. , 2006, Current opinion in chemical biology.

[14]  P. Bernardi,et al.  High concordance of drug-induced human hepatotoxicity with in vitro cytotoxicity measured in a novel cell-based model using high content screening , 2006, Archives of Toxicology.

[15]  Paul B Watkins,et al.  Drug‐induced liver injury: Summary of a single topic clinical research conference , 2006, Hepatology.

[16]  Alexandre Varnek,et al.  Substructural fragments: an universal language to encode reactions, molecular and supramolecular structures , 2005, J. Comput. Aided Mol. Des..

[17]  S. Tannenbaum,et al.  In vitro methods to study chemically-induced hepatotoxicity: a literature review. , 2005, Current drug metabolism.

[18]  G. Zlokarnik,et al.  In silico prediction of drug safety: despite progress there is abundant room for improvement. , 2004, Drug discovery today. Technologies.

[19]  Philippa R. N. Wolohan,et al.  Modelling in vitro hepatotoxicity using molecular interaction fields and SIMCA. , 2004, Journal of molecular graphics & modelling.

[20]  N. Kaplowitz,et al.  Drug-Induced Liver Injury , 2004, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[21]  Steven L. Dixon,et al.  In silico models for the prediction of dose-dependent human hepatotoxicity , 2003, J. Comput. Aided Mol. Des..

[22]  John M. Barnard,et al.  Clustering Methods and Their Uses in Computational Chemistry , 2003 .

[23]  K. Hornbuckle,et al.  Evaluation of the Characteristics of Safety Withdrawal of Prescription Drugs from Worldwide Pharmaceutical Markets-1960 to 1999 , 2001 .

[24]  P Smith,et al.  Concordance of the toxicity of pharmaceuticals in humans and in animals. , 2000, Regulatory toxicology and pharmacology : RTP.

[25]  G. Betton,et al.  The predictivity of the toxicity of pharmaceuticals in humans from animal data--an interim assessment. , 1998, Toxicology letters.

[26]  F. Ballet,et al.  Hepatotoxicity in drug development: detection, significance and solutions. , 1997, Journal of hepatology.

[27]  M. Schmid,et al.  [Acute hepatitis following administration of fansidar]. , 1990, Schweizerische medizinische Wochenschrift.