OCTANE: Oncology Clinical Trial Annotation Engine.

PURPOSE Many targeted therapies are currently available only via clinical trials. Therefore, routine precision oncology using biomarker-based assignment to drug depends on matching patients to clinical trials. A comprehensive and up-to-date trial database is necessary for optimal patient-trial matching. METHODS We describe processes for establishing and maintaining a clinical trial database, focusing on genomically informed trials. Furthermore, we present OCTANE (Oncology Clinical Trial Annotation Engine), an informatics framework supporting these processes in a scalable fashion. To illustrate how the framework can be applied at an institution, we describe how we implemented an instance of OCTANE at a large cancer center. OCTANE consists of three modules. The data aggregation module automates retrieval, aggregation, and update of trial information. The annotation module establishes the database schema, implements data integration necessary for automation, and provides an annotation interface. The update module monitors trial change logs, identifies critical change events, and alerts the annotators when manual intervention may be needed. RESULTS Using OCTANE, we annotated 5,439 oncology clinical trials (4,438 genomically informed trials) that collectively were associated with 1,453 drugs, 779 genes, and 252 cancer types. To date, we have used the database to screen 4,220 patients for trial eligibility. We compared the update module with expert review, and the module achieved 98.5% accuracy, 0% false-negative rate, and 2.3% false-positive rate. CONCLUSION OCTANE is a general informatics framework that can be helpful for establishing and maintaining a comprehensive database necessary for automating patient-trial matching, which facilitates the successful delivery of personalized cancer care on a routine basis. Several OCTANE components are publically available and may be useful to other precision oncology programs.

[1]  Alex H. Wagner,et al.  DGIdb 3.0: a redesign and expansion of the drug–gene interaction database , 2017, bioRxiv.

[2]  Xiaoying Wu,et al.  EliXR: an approach to eligibility criteria extraction and representation , 2011, J. Am. Medical Informatics Assoc..

[3]  Razelle Kurzrock,et al.  Association of Biomarker-Based Treatment Strategies With Response Rates and Progression-Free Survival in Refractory Malignant Neoplasms: A Meta-analysis. , 2016, JAMA oncology.

[4]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[5]  M. Hampshire,et al.  An Internet-Based Cancer Clinical Trials Matching Resource , 2005, Journal of medical Internet research.

[6]  Yaoyun Zhang,et al.  Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov , 2016, J. Am. Medical Informatics Assoc..

[7]  Funda Meric-Bernstam,et al.  Operationalization of Next-Generation Sequencing and Decision Support for Precision Oncology. , 2019, JCO clinical cancer informatics.

[8]  Amber M. Johnson,et al.  A decision support framework for genomically informed investigational cancer therapy. , 2015, Journal of the National Cancer Institute.

[9]  Tatiana A. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2004, Nucleic Acids Res..

[10]  Tingting Fu,et al.  Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics , 2017, Nucleic Acids Res..

[11]  Hua Xu,et al.  Automated identification of molecular effects of drugs (AIMED) , 2016, J. Am. Medical Informatics Assoc..

[12]  L. Siu,et al.  Molecular profiling for precision cancer therapies , 2020, Genome Medicine.