Benefits of a clinical data warehouse with data mining tools to collect data for a radiotherapy trial.

INTRODUCTION Collecting trial data in a medical environment is at present mostly performed manually and therefore time-consuming, prone to errors and often incomplete with the complex data considered. Faster and more accurate methods are needed to improve the data quality and to shorten data collection times where information is often scattered over multiple data sources. The purpose of this study is to investigate the possible benefit of modern data warehouse technology in the radiation oncology field. MATERIAL AND METHODS In this study, a Computer Aided Theragnostics (CAT) data warehouse combined with automated tools for feature extraction was benchmarked against the regular manual data-collection processes. Two sets of clinical parameters were compiled for non-small cell lung cancer (NSCLC) and rectal cancer, using 27 patients per disease. Data collection times and inconsistencies were compared between the manual and the automated extraction method. RESULTS The average time per case to collect the NSCLC data manually was 10.4 ± 2.1 min and 4.3 ± 1.1 min when using the automated method (p<0.001). For rectal cancer, these times were 13.5 ± 4.1 and 6.8 ± 2.4 min, respectively (p<0.001). In 3.2% of the data collected for NSCLC and 5.3% for rectal cancer, there was a discrepancy between the manual and automated method. CONCLUSIONS Aggregating multiple data sources in a data warehouse combined with tools for extraction of relevant parameters is beneficial for data collection times and offers the ability to improve data quality. The initial investments in digitizing the data are expected to be compensated due to the flexibility of the data analysis. Furthermore, successive investigations can easily select trial candidates and extract new parameters from the existing databases.

[1]  Shao Hui Huang,et al.  Point-of-care outcome assessment in the cancer clinic: audit of data quality. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[2]  Walter J Curran,et al.  Redesigning radiotherapy quality assurance: opportunities to develop an efficient, evidence-based system to support clinical trials--report of the National Cancer Institute Work Group on Radiotherapy Quality Assurance. , 2012, International journal of radiation oncology, biology, physics.

[3]  H. Prokosch,et al.  Perspectives for Medical Informatics , 2009, Methods of Information in Medicine.

[4]  Hans-Ulrich Prokosch,et al.  Experiences with an Interoperable Data Acquisition Platform for Multi-centric Research Networks Based on HL7 CDA , 2007, Methods of Information in Medicine.

[5]  I. Sarkar Biomedical informatics and translational medicine , 2010, Journal of Translational Medicine.

[6]  Patrick Granton,et al.  Radiomics: extracting more information from medical images using advanced feature analysis. , 2012, European journal of cancer.

[7]  E. Yorke,et al.  Improving normal tissue complication probability models: the need to adopt a "data-pooling" culture. , 2010, International journal of radiation oncology, biology, physics.

[8]  P. Lambin,et al.  Design of and technical challenges involved in a framework for multicentric radiotherapy treatment planning studies. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[9]  Vincenzo Valentini,et al.  International data-sharing for radiotherapy research: an open-source based infrastructure for multicentric clinical data mining. , 2014, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[10]  Shipeng Yu,et al.  The importance of patient characteristics for the prediction of radiation-induced lung toxicity. , 2009, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[11]  Damien C Weber,et al.  EORTC Radiation Oncology Group quality assurance platform: establishment of a digital central review facility. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[12]  Daniela Thorwarth,et al.  Implementation of hypoxia imaging into treatment planning and delivery. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[13]  Sebastian Garde,et al.  Towards shared patient records: An architecture for using routine data for nationwide research , 2006, Int. J. Medical Informatics.

[14]  Damien C Weber,et al.  Quality assurance for prospective EORTC radiation oncology trials: the challenges of advanced technology in a multicenter international setting. , 2011, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[15]  B J Mijnheer,et al.  Prediction of DVH parameter changes due to setup errors for breast cancer treatment based on 2D portal dosimetry. , 2008, Medical physics.

[16]  Philippe Lambin,et al.  An "in silico" clinical trial comparing free breathing, slow and respiration correlated computed tomography in lung cancer patients. , 2005, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[17]  Benjamin Movsas,et al.  Who enrolls onto clinical oncology trials? A radiation Patterns Of Care Study analysis. , 2007, International journal of radiation oncology, biology, physics.

[18]  N. Slevin,et al.  Comparison of patient-reported late treatment toxicity (LENT-SOMA) with quality of life (EORTC QLQ-C30 and QLQ-H&N35) assessment after head and neck radiotherapy. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[19]  Jonathan S. Einbinder,et al.  Evaluation of a data warehouse in an academic health sciences center , 2000, Int. J. Medical Informatics.

[20]  Richard McClatchey,et al.  A Data Model for Integrating Heterogeneous Medical Data in the Health-e-Child Project , 2008, HealthGrid.

[21]  Christel Daniel-Le Bozec,et al.  Integrating clinical research with the Healthcare Enterprise: From the RE-USE project to the EHR4CR platform , 2011, J. Biomed. Informatics.

[22]  Andre Dekker,et al.  The integration of PET-CT scans from different hospitals into radiotherapy treatment planning. , 2008, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[23]  R. Bharat Rao,et al.  Mining time-dependent patient outcomes from hospital patient records , 2002, AMIA.

[24]  Liora Alschuler,et al.  Model Formulation: Implementing Single Source: The STARBRITE Proof-of-Concept Study , 2007, J. Am. Medical Informatics Assoc..

[25]  Rachel L Richesson,et al.  Viewpoint: Data Standards in Clinical Research: Gaps, Overlaps, Challenges and Future Directions , 2007, J. Am. Medical Informatics Assoc..

[26]  Barbara Wixom,et al.  The benefits of data warehousing: why some organizations realize exceptional payoffs , 2002, Inf. Manag..

[27]  Robert A. Weinstein,et al.  Application of Information Technology: Development of a Clinical Data Warehouse for Hospital Infection Control , 2003, J. Am. Medical Informatics Assoc..

[28]  P. O'Brien,et al.  Obstacles to participation in randomised cancer clinical trials: A systematic review of the literature , 2012, Journal of medical imaging and radiation oncology.

[29]  Harlan M Krumholz,et al.  Participation in cancer clinical trials: race-, sex-, and age-based disparities. , 2004, JAMA.

[30]  Herman Pieterse,et al.  Richtsnoer voor Good Clinical Practice (CPMP/ICH/135/95). , 2004 .

[31]  Günter Schreier,et al.  Development of an electronic database for quality assurance of radiotherapy in the International Society of Paediatric Oncology (Europe) high risk neuroblastoma study. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[32]  P. Lambin,et al.  393 poster AD-HOC DATA SHARING INFRASTRUCTURE FOR RADIOTHERAPY RESEARCH COLLABORATION: A TOOL FOR MULTICENTRIC CLINICAL RESEARCH , 2011 .

[33]  Daniel L Rubin,et al.  A data warehouse for integrating radiologic and pathologic data. , 2008, Journal of the American College of Radiology : JACR.

[34]  Edward Tabor Review of Good Clinical Practice: A Question & Answer Reference Guide , 2012 .

[35]  Shipeng Yu,et al.  Development and external validation of prognostic model for 2-year survival of non-small-cell lung cancer patients treated with chemoradiotherapy. , 2009, International journal of radiation oncology, biology, physics.

[36]  J. Bradley,et al.  Combined PET/CT image characteristics for radiotherapy tumor response in lung cancer. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[37]  P. Lambin,et al.  Tumor volume combined with number of positive lymph node stations is a more important prognostic factor than TNM stage for survival of non-small-cell lung cancer patients treated with (chemo)radiotherapy. , 2008, International journal of radiation oncology, biology, physics.

[38]  D De Ruysscher,et al.  Comparison of Bayesian network and support vector machine models for two-year survival prediction in lung cancer patients treated with radiotherapy. , 2010, Medical physics.