Predictive statistics and artificial intelligence in the U.S. National Cancer Institute's drug discovery program for cancer and AIDS

The National Cancer Institute's drug discovery program screens more than 20,000 chemical compounds and natural products a year for activity against a panel of 60 tumor cell lines in vitro. The result is an information‐rich database of patterns that form the basis for what we term an “information‐intensive” approach to the process of drug discovery. The first step was a demonstration, both by statistical methods (including the program COMPARE) and by neural networks, that patterns of activity in the screen can be used to predict a compound's mechanism of action. Given this finding, the overall plan has been to develop three large matrices of information: the first (designated A) gives the pattern of activity for each compound tested against each cell line in the screen; the second (S) encodes any of a number of types of 2‐D or 3‐D structural motifs for each compound; the third (T) indicates each cell's expression of molecular targets (e.g., from 2‐dimensional protein gel electrophoresis). Construction and updating of these matrices is an ongoing process. The matrices can be concatenated in various ways to test a variety of specific hypotheses about compounds screened, as well as to “prioritize” candidate compounds for testing. To aid in these efforts, we have developed the DISCOVERY program package, which integrates the matrix data for visual pattern recognition. The “information‐intensive” approach summarized here in some senses serves to bridge the perceived gap between screening and structure based drug design.

[1]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[2]  D. Scudiero,et al.  Evaluation of a soluble tetrazolium/formazan assay for cell growth and drug sensitivity in culture using human and other tumor cell lines. , 1988, Cancer research.

[3]  M. Boyd,et al.  Reproducibility and response patterns of the ic50 values and relative cell line sensitivities from the nci human tumor cell line drug screening project , 1988 .

[4]  D A Scudiero,et al.  Feasibility of drug screening with panels of human tumor cell lines using a microculture tetrazolium assay. , 1988, Cancer research.

[5]  D A Scudiero,et al.  Display and analysis of patterns of differential activity of drugs against human tumor cell lines: development of mean graph and COMPARE algorithm. , 1989, Journal of the National Cancer Institute.

[6]  D. Scudiero,et al.  Feasibility of a high-flux anticancer drug screen using a diverse panel of cultured human tumor cell lines. , 1991, Journal of the National Cancer Institute.

[7]  N. Anderson,et al.  An updated two‐dimensional gel database of rat liver proteins useful in gene regulation and drug effect studies , 1991 .

[8]  K D Paull,et al.  Halichondrin B and homohalichondrin B, marine natural products binding in the vinca domain of tubulin. Discovery of tubulin-based mechanism of action by analysis of differential cytotoxicity data. , 1991, The Journal of biological chemistry.

[9]  K. Paull,et al.  Exploratory data analytic techniques to evaluate anticancer agents screened in a cell culture panel. , 1992, Journal of biopharmaceutical statistics.

[10]  L V Rubinstein,et al.  Multidrug-resistant phenotype of disease-oriented panels of human tumor cell lines used for anticancer drug screening. , 1992, Cancer research.

[11]  J N Weinstein,et al.  Neural computing in cancer drug development: predicting mechanism of action. , 1992, Science.

[12]  M. Grever,et al.  Cell line-based screening for new anticancer drugs , 1994 .