Relationship between molecular connectivity and carcinogenic activity: a confirmation with a new software program based on graph theory.

For a database of 826 chemicals tested for carcinogenicity, we fragmented the structural formula of the chemicals into all possible contiguous-atom fragments with size between two and eight (nonhydrogen) atoms. The fragmentation was obtained using a new software program based on graph theory. We used 80% of the chemicals as a training set and 20% as a test set. The two sets were obtained by random sorting. From the training sets, an average (8 computer runs with independently sorted chemicals) of 315 different fragments were significantly (p < 0.125) associated with carcinogenicity or lack thereof. Even using this relatively low level of statistical significance, 23% of the molecules of the test sets lacked significant fragments. For 77% of the molecules of the test sets, we used the presence of significant fragments to predict carcinogenicity. The average level of accuracy of the predictions in the test sets was 67.5%. Chemicals containing only positive fragments were predicted with an accuracy of 78.7%. The level of accuracy was around 60% for chemicals characterized by contradictory fragments or only negative fragments. In a parallel manner, we performed eight paired runs in which carcinogenicity was attributed randomly to the molecules of the training sets. The fragments generated by these pseudo-training sets were devoid of any predictivity in the corresponding test sets. Using an independent software program, we confirmed (for the complex biological endpoint of carcinogenicity) the validity of a structure-activity relationship approach of the type proposed by Klopman and Rosenkranz with their CASE program. ImagesFigure 1.Figure 2.Figure 3.Figure 4.Figure 5.Figure 6.

[1]  Dennis Bahler,et al.  The Induction of Rules for Predicting Chemical Carcinogenesis in Rodents , 1993, ISMB.

[2]  H S Rosenkranz,et al.  Use of structure-activity relationships in predicting carcinogenesis. , 1986, IARC scientific publications.

[3]  L. Hodes,et al.  A statistical-heuristic methods for automated selection of drugs for screening. , 1977, Journal of medicinal chemistry.

[4]  H S Rosenkranz,et al.  Testing by artificial intelligence: computational alternatives to the determination of mutagenicity. , 1992, Mutation research.

[5]  H. Rosenkranz,et al.  The structural basis of the mutagenicity of chemicals in Salmonella typhimurium: the National Toxicology Program Data Base. , 1990, Mutation research.

[6]  R. Tennant,et al.  Stratification of rodent carcinogenicity bioassay results to reflect relative human hazard. , 1993, Mutation research.

[7]  A M Richard,et al.  A CASE-SAR analysis of polycyclic aromatic hydrocarbon carcinogenicity. , 1990, Mutation research.

[8]  J. Ashby Structural analysis as a means of predicting carcinogenic potential. , 1978, British Journal of Cancer.

[9]  B. Ames,et al.  Second chronological supplement to the Carcinogenic Potency Database: standardized results of animal bioassays published through December 1984 and by the National Toxicology Program through May 1986. , 1987, Environmental health perspectives.

[10]  B. Ames,et al.  Chronological supplement to the Carcinogenic Potency Database: standardized results of animal bioassays published through December 1982. , 1986, Environmental health perspectives.

[11]  J. Ashby,et al.  The influence of chemical structure on the extent and sites of carcinogenesis for 522 rodent carcinogens and 55 different human carcinogen exposures. , 1993, Mutation research.

[12]  J. Huff,et al.  Long-term chemical carcinogenesis experiments for identifying potential human cancer hazards: collective database of the National Cancer Institute and National Toxicology Program (1976-1991). , 1991, Environmental health perspectives.

[13]  R. Tennant,et al.  Chemical structure, Salmonella mutagenicity and extent of carcinogenicity as indicators of genotoxic carcinogenesis among 222 chemicals tested in rodents by the U.S. NCI/NTP. , 1988, Mutation research.

[14]  C E Berkoff,et al.  Substructural analysis. A novel approach to the problem of drug design. , 1974, Journal of medicinal chemistry.

[15]  Frank Harary,et al.  Graph Theory , 2016 .

[16]  H S Rosenkranz,et al.  New structural concepts for predicting carcinogenicity in rodents: an artificial intelligence approach. , 1990, Teratogenesis, carcinogenesis, and mutagenesis.

[17]  B H Margolin,et al.  Prediction of chemical carcinogenicity in rodents from in vitro genetic toxicity assays. , 1987, Science.

[18]  R. Tennant,et al.  Classification according to chemical structure, mutagenicity to Salmonella and level of carcinogenicity of a further 39 chemicals tested for carcinogenicity by the U.S. National Toxicology Program. , 1991, Mutation research.

[19]  R. J. Feldmann,et al.  PATTERN RECOGNITION AND STRUCTURE-ACTIVITY RELATIONSHIP STUDIES, COMPUTER-ASSISTED PREDICTION OF ANTITUMOR ACTIVITY IN STRUCTURALLY DIVERSE DRUGS IN AN EXPERIMENTAL MOUSE BRAIN TUMOR SYSTEM , 1975 .

[20]  M. Pike,et al.  A carcinogenic potency database of the standardized results of animal bioassays , 1984, Environmental health perspectives.

[21]  J. Ashby Fundamental structural alerts to potential carcinogenicity or noncarcinogenicity. , 1985, Environmental mutagenesis.

[22]  H S Rosenkranz,et al.  Quantification of the predictivity of some short-term assays for carcinogenicity in rodents. , 1991, Mutation research.

[23]  R. Tennant,et al.  Definitive relationships among chemical structure, carcinogenicity and mutagenicity for 301 chemicals tested by the U.S. NTP. , 1991, Mutation research.

[24]  B. Ames,et al.  Third chronological supplement to the carcinogenic potency database: standardized results of animal bioassays published through December 1986 and by the National Toxicology Program through June 1987. , 1987, Environmental health perspectives.

[25]  J Ashby,et al.  Consideration of CASE predictions of genotoxic carcinogenesis for omeprazole, methapyrilene and azathioprine. , 1992, Mutation research.

[26]  H. Rosenkranz,et al.  The structural basis of the mutagenicity of chemicals in Salmonella typhimurium: the Gene-Tox data base. , 1990, Mutation research.

[27]  E Zeiger,et al.  Classification according to chemical structure, mutagenicity to Salmonella and level of carcinogenicity of a further 42 chemicals tested for carcinogenicity by the U.S. National Toxicology Program. , 1989, Mutation research.

[28]  H S Rosenkranz,et al.  Structure activity-based predictive toxicology: an efficient and economical method for generating non-congeneric data bases. , 1991, Mutagenesis.

[29]  M. Shapiro,et al.  Pattern recognitiion and structure-activity relationship studies. Computer-assisted prediction of antitumor activity in structurally diverse drugs in an experimental mouse brain tumor system. , 1975, Journal of medicinal chemistry.

[30]  Gilles Klopman,et al.  Computer simulation of physical-chemical properties of organic molecules. 1. Molecular system identification , 1981, J. Chem. Inf. Comput. Sci..

[31]  H S Rosenkranz,et al.  Prediction of environmental carcinogens: a strategy for the mid-1980s. , 1984, Environmental mutagenesis.

[32]  G Klopman,et al.  Structure-activity relations: maximizing the usefulness of mutagenicity and carcinogenicity databases. , 1991, Environmental health perspectives.

[33]  Gilles Klopman,et al.  Evaluation of quantitative structure-activity predictions. Comparison of the predictive power of an artificial intelligence system with human experts , 1990 .

[34]  Louis Hodes,et al.  A STATISTICAL-HEURISTIC METHOD FOR AUTOMATED SELECTION OF DRUGS FOR SCREENING , 1977 .

[35]  Alexandru T. Balaban,et al.  Applications of graph theory in chemistry , 1985, J. Chem. Inf. Comput. Sci..

[36]  L. Bernstein,et al.  Interspecies extrapolation in carcinogenesis: prediction between rats and mice. , 1989, Environmental health perspectives.

[37]  H S Rosenkranz,et al.  Structural basis of carcinogenicity in rodents of genotoxicants and non-genotoxicants. , 1990, Mutation research.