Beyond data integration.

Pharmaceutical R&D organizations have no shortage of experimental data or annotation information. However, the sheer volume and complexity of this information results in a paralyzing inability to make effective use of it for predicting drug efficacy and safety. Data integration efforts are legion, but even in the rare instances where they succeed, they are found to be insufficient to advance programs because interpretation of query results becomes a research project in itself. In this review, we propose a coherent, interoperable platform comprising knowledge engineering and hypothesis generation components for rapidly making determinations of confidence in mechanism and safety (among other goals) using experimental data and expert knowledge.

[1]  Jim Giles,et al.  Key biology databases go wiki , 2007, Nature.

[2]  Allen Newell,et al.  Computer science as empirical inquiry: symbols and search , 1976, CACM.

[3]  George M. Milne,et al.  Chapter 35. Pharmaceutical productivity — the imperative for new paradigms , 2003 .

[4]  G. Pisano Can science be a business? Lessons from biotech. , 2006, Harvard business review.

[5]  Eric K. Neumann,et al.  Knowledge networks in the age of the Semantic Web , 2007, Briefings Bioinform..

[6]  Stephen Emmott,et al.  Towards 2020 Science , 2006 .

[7]  Eric A G Blomme,et al.  Toxicogenomics in drug discovery: from preclinical studies to clinical trials. , 2004, Chemico-biological interactions.

[8]  E. Neumann,et al.  Knowledge assembly for the life sciences. , 2002, Drug discovery today.

[9]  Stephen A. Racunas,et al.  HyBrow: a prototype system for computer-aided hypothesis evaluation , 2004, ISMB/ECCB.

[10]  Chitta Baral,et al.  Knowledge-based framework for hypothesis formation in biochemical networks , 2005, ECCB/JBI.

[11]  Stephen H. Friend,et al.  Toxicogenomics and drug discovery: will new technologies help us produce better drugs? , 2002, Nature Reviews Drug Discovery.

[12]  Phillip G. Febbo,et al.  Identification of the androgen-induced transcriptional program of human prostate cancer revealed by causal analysis , 2005 .

[13]  I. Kola,et al.  Can the pharmaceutical industry reduce attrition rates? , 2004, Nature Reviews Drug Discovery.

[14]  Lefkos Middleton,et al.  Disease-specific target selection: a critical first step down the right road. , 2005, Drug discovery today.

[15]  P D Karp,et al.  Pathway Databases: A Case Study in Computational Symbolic Theories , 2001, Science.

[16]  J. Ben Rosen,et al.  Pracniques: construction of nonlinear programming test problems , 1965, Commun. ACM.

[17]  Ramasamy Uthurusamy,et al.  EVOLVING DATA MINING INTO SOLUTIONS FOR INSIGHTS , 2002 .

[18]  David B. Searls,et al.  Data integration: challenges for drug discovery , 2005, Nature Reviews Drug Discovery.

[19]  William Loging,et al.  High-throughput electronic biology: mining information for drug discovery , 2007, Nature Reviews Drug Discovery.

[20]  E. Kunkel Systems biology in drug discovery , 2004, Nature Biotechnology.

[21]  Ramasamy Uthurusamy,et al.  Evolving data into mining solutions for insights , 2002, CACM.

[22]  P. Bork,et al.  Literature mining for the biologist: from information retrieval to biological discovery , 2006, Nature Reviews Genetics.