Standardized data collection to build prediction models in oncology: a prototype for rectal cancer.

The advances in diagnostic and treatment technology are responsible for a remarkable transformation in the internal medicine concept with the establishment of a new idea of personalized medicine. Inter- and intra-patient tumor heterogeneity and the clinical outcome and/or treatment's toxicity's complexity, justify the effort to develop predictive models from decision support systems. However, the number of evaluated variables coming from multiple disciplines: oncology, computer science, bioinformatics, statistics, genomics, imaging, among others could be very large thus making traditional statistical analysis difficult to exploit. Automated data-mining processes and machine learning approaches can be a solution to organize the massive amount of data, trying to unravel important interaction. The purpose of this paper is to describe the strategy to collect and analyze data properly for decision support and introduce the concept of an 'umbrella protocol' within the framework of 'rapid learning healthcare'.

[1]  Kun Liu,et al.  Random projection-based multiplicative data perturbation for privacy preserving distributed data mining , 2006, IEEE Transactions on Knowledge and Data Engineering.

[2]  J. Flickinger,et al.  Machine Learning Approaches for Predicting Radiation Therapy Outcomes: A Clinician's Perspective. , 2015, International journal of radiation oncology, biology, physics.

[3]  C. Carlson,et al.  Functional Annotation of Putative Regulatory Elements at Cancer Susceptibility Loci , 2014, Cancer informatics.

[4]  Andre Dekker,et al.  VATE: VAlidation of high TEchnology based on large database analysis by learning machine , 2014 .

[5]  Karl W Broman,et al.  BayesMendel: an R Environment for Mendelian Risk Prediction , 2004, Statistical applications in genetics and molecular biology.

[6]  Jordi Giralt,et al.  Radiotherapy plus cetuximab for locoregionally advanced head and neck cancer: 5-year survival data from a phase 3 randomised trial, and relation between cetuximab-induced rash and survival. , 2010, The Lancet. Oncology.

[7]  Monique W. M. Jaspers,et al.  A comparison of usability methods for testing interactive health technologies: Methodological aspects and empirical evidence , 2009, Int. J. Medical Informatics.

[8]  Frank E. Harrell,et al.  Resampling, Validating, Describing, and Simplifying the Model , 2001 .

[9]  J. van Soest,et al.  Medicine is a science of uncertainty and an art of probability (Sir W. Osler). , 2015, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[10]  J. Deasy,et al.  Datamining approaches for modeling tumor control probability , 2010, Acta oncologica.

[11]  E. Elkin,et al.  Decision Curve Analysis: A Novel Method for Evaluating Prediction Models , 2006, Medical decision making : an international journal of the Society for Medical Decision Making.

[12]  Xinxiang Li,et al.  Prognostic Nomograms for Predicting Survival and Distant Metastases in Locally Advanced Rectal Cancers , 2014, PloS one.

[13]  S. Bentzen,et al.  Evaluation of early and late toxicities in chemoradiation trials. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[14]  N. D. de Keizer,et al.  Understanding Terminological Systems I: Terminology and Typology , 2000, Methods of Information in Medicine.

[15]  Patrick Granton,et al.  Radiomics: extracting more information from medical images using advanced feature analysis. , 2012, European journal of cancer.

[16]  Andre Dekker,et al.  Nomogram predicting response after chemoradiotherapy in rectal cancer using sequential PETCT imaging: a multicentric prospective study with external validation. , 2014, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[17]  Elena B. Elkin,et al.  Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers , 2008, BMC Medical Informatics Decis. Mak..

[18]  P. Lambin,et al.  Learning methods in radiation oncology ‘Rapid Learning health care in oncology’ – An approach towards decision support systems enabling customised radiotherapy’ q , 2013 .

[19]  Mithat Gönen,et al.  Predicting survival after curative colectomy for cancer: individualizing colon cancer staging. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[20]  Vincenzo Valentini,et al.  The future of predictive models in radiation oncology: from extensive data mining to reliable modeling of the results. , 2013, Future oncology.

[21]  Harold R. Solbrig,et al.  Validating RDF with Shape Expressions , 2014, ArXiv.

[22]  J. Hendry,et al.  Variability in the radiosensitivity of normal cells and tissues. Report from a workshop organised by the European Society for Therapeutic Radiology and Oncology in Edinburgh, UK, 19 September 1998. , 1999, International journal of radiation biology.

[23]  J. Skibber,et al.  An Individualized Conditional Survival Calculator for Patients with Rectal Cancer , 2013, Diseases of the colon and rectum.

[24]  D G Altman,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): the TRIPOD Statement , 2015, The British journal of surgery.

[25]  C. V. D. van de Velde,et al.  Quality assurance in the treatment of colorectal cancer: the EURECCA initiative. , 2014, Annals of oncology : official journal of the European Society for Medical Oncology.

[26]  Joseph O Deasy,et al.  A Bayesian network approach for modeling local failure in lung cancer , 2011, Physics in medicine and biology.

[27]  Christel Daniel-Le Bozec,et al.  An Ontological Approach for the Exploitation of Clinical Data , 2013, MedInfo.

[28]  J. van Soest,et al.  An umbrella protocol for standardized data collection (SDC) in rectal cancer: a prospective uniform naming and procedure convention to support personalized medicine. , 2014, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[29]  T. Panzarella,et al.  Accuracy of survival prediction by palliative radiation oncologists. , 2005, International journal of radiation oncology, biology, physics.

[30]  J F Fowler,et al.  21 years of biologically effective dose. , 2010, The British journal of radiology.

[31]  J. Finkelstein,et al.  How accurate are physicians' clinical predictions of survival and the available prognostic tools in estimating survival times in terminally ill cancer patients? A systematic review. , 2001, Clinical oncology (Royal College of Radiologists (Great Britain)).

[32]  P. Lambin,et al.  A prospective study comparing the predictions of doctors versus models for treatment outcome of lung cancer patients: a step toward individualized care and shared decision making. , 2014, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[33]  Karin Haustermans,et al.  EURECCA colorectal: multidisciplinary management: European consensus conference colon & rectum. , 2014, European journal of cancer.

[34]  I. Tannock,et al.  Randomised controlled trials and population-based observational research: partners in the evolution of medical evidence , 2014, British Journal of Cancer.

[35]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[36]  P. Lambin,et al.  Predicting outcomes in radiation oncology—multifactorial decision support systems , 2013, Nature Reviews Clinical Oncology.

[37]  S. Bentzen Preventing or reducing late side effects of radiation therapy: radiobiology meets molecular pathology , 2006, Nature Reviews Cancer.

[38]  M. Goitein,et al.  Fitting of normal tissue tolerance data to an analytic function. , 1991, International journal of radiation oncology, biology, physics.

[39]  Vincenzo Valentini,et al.  Nomograms for predicting local recurrence, distant metastases, and overall survival for patients with locally advanced rectal cancer on the basis of European randomized clinical trials. , 2011, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[40]  P. Harari,et al.  Exploitable mechanisms for combining drugs with radiation: concepts, achievements and future directions , 2007, Nature Clinical Practice Oncology.

[41]  Andre Dekker,et al.  Radiomics: the process and the challenges. , 2012, Magnetic resonance imaging.

[42]  Martin J. Murphy,et al.  Machine Learning in Radiation Oncology , 2015 .

[43]  P. Lambin,et al.  Is it time for tailored treatment of rectal cancer? From prescribing by consensus to prescribing by numbers. , 2012, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[44]  D De Ruysscher,et al.  Comparison of Bayesian network and support vector machine models for two-year survival prediction in lung cancer patients treated with radiotherapy. , 2010, Medical physics.

[45]  Frank Verhaegen,et al.  Modern clinical research: How rapid learning health care and cohort multiple randomised clinical trials complement traditional evidence based medicine , 2015, Acta oncologica.

[46]  A. Debucquoy,et al.  The ESTRO Breur Lecture 2010: toward a tailored patient approach in rectal cancer. , 2011, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[47]  Daniel J Sargent,et al.  The predictive and prognostic value of sex in early-stage colon cancer: a pooled analysis of 33,345 patients from the ACCENT database. , 2013, Clinical colorectal cancer.

[48]  Jihoon Kim,et al.  Grid Binary LOgistic REgression (GLORE): building shared models without sharing data , 2012, J. Am. Medical Informatics Assoc..

[49]  Karin Haustermans,et al.  Development and external validation of a predictive model for pathological complete response of rectal cancer patients including sequential PET-CT imaging. , 2011, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[50]  Elizabeth Eisenhauer,et al.  Nomograms for predicting survival of patients with newly diagnosed glioblastoma: prognostic factor analysis of EORTC and NCIC trial 26981-22981/CE.3. , 2008, The Lancet. Oncology.

[51]  Andrzej Niemierko,et al.  A free program for calculating EUD-based NTCP and TCP in external beam radiotherapy. , 2007, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[52]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[53]  A Abu-Hanna,et al.  Understanding Terminological Systems II: Experience with Conceptual and Formal Representation of Structure , 2000, Methods of Information in Medicine.

[54]  Andre Dekker,et al.  Distributed Learning to Protect Privacy in Multi-centric Clinical Studies , 2015, AIME.

[55]  R. Stupp,et al.  New prognostic factors and calculators for outcome prediction in patients with recurrent glioblastoma: a pooled analysis of EORTC Brain Tumour Group phase I and II clinical trials. , 2012, European journal of cancer.

[56]  Chalapathy Neti,et al.  Rapid-learning system for cancer care. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[57]  J. Borrás,et al.  Multidisciplinary Rectal Cancer Management: 2nd European Rectal Cancer Consensus Conference (EURECA-CC2). , 2009, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[58]  Mithat Gonen,et al.  Individualized prediction of colon cancer recurrence using a nomogram. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[59]  Joseph Finkelstein,et al.  The use and interpretation of quasi-experimental studies in medical informatics. , 2006, Journal of the American Medical Informatics Association : JAMIA.

[60]  T. Mok,et al.  Gefitinib or carboplatin-paclitaxel in pulmonary adenocarcinoma. , 2009, The New England journal of medicine.

[61]  Karol Sikora,et al.  Delivering affordable cancer care in high-income countries. , 2011, The Lancet. Oncology.

[62]  R. Glynne-Jones,et al.  The status of targeted agents in the setting of neoadjuvant radiation therapy in locally advanced rectal cancers. , 2013, Journal of gastrointestinal oncology.

[63]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[64]  M. Provencio,et al.  New molecular targeted therapies integrated with radiation therapy in lung cancer. , 2010, Clinical lung cancer.