A text-based data mining and toxicity prediction modeling system for a clinical decision support in radiation oncology: A preliminary study

The aim of this study is an integrated research for text-based data mining and toxicity prediction modeling system for clinical decision support system based on big data in radiation oncology as a preliminary research. The structured and unstructured data were prepared by treatment plans and the unstructured data were extracted by dose-volume data image pattern recognition of prostate cancer for research articles crawling through the internet. We modeled an artificial neural network to build a predictor model system for toxicity prediction of organs at risk. We used a text-based data mining approach to build the artificial neural network model for bladder and rectum complication predictions. The pattern recognition method was used to mine the unstructured toxicity data for dose-volume at the detection accuracy of 97.9%. The confusion matrix and training model of the neural network were achieved with 50 modeled plans (n = 50) for validation. The toxicity level was analyzed and the risk factors for 25% bladder, 50% bladder, 20% rectum, and 50% rectum were calculated by the artificial neural network algorithm. As a result, 32 plans could cause complication but 18 plans were designed as non-complication among 50 modeled plans. We integrated data mining and a toxicity modeling method for toxicity prediction using prostate cancer cases. It is shown that a preprocessing analysis using text-based data mining and prediction modeling can be expanded to personalized patient treatment decision support based on big data.

[1]  S. Webb,et al.  Use of artificial neural networks to predict biological outcomes for patients receiving radical radiotherapy of the prostate. , 2004, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[2]  Issam El Naqa,et al.  Big Data Analytics for Prostate Radiotherapy , 2016, Front. Oncol..

[3]  A B Jani,et al.  Comparison of late gastrointestinal and genitourinary toxicity of prostate cancer patients undergoing intensity-modulated versus conventional radiotherapy using localized fields , 2007, Prostate Cancer and Prostatic Diseases.

[4]  Mehmet Engin,et al.  Early prostate cancer diagnosis by using artificial neural networks and support vector machines , 2009, Expert Syst. Appl..

[5]  Huizhong Chen,et al.  Robust text detection in natural images with edge-enhanced Maximally Stable Extremal Regions , 2011, 2011 18th IEEE International Conference on Image Processing.

[6]  Andre Dekker,et al.  Standardized data collection to build prediction models in oncology: a prototype for rectal cancer. , 2016, Future oncology.

[7]  Raj M. Ratwani,et al.  Exploring methods for identifying related patient safety events using structured and unstructured data , 2015, J. Biomed. Informatics.

[8]  Liping Li,et al.  A hybrid solution for extracting structured medical information from unstructured data in medical records via a double-reading/entry system , 2016, BMC Medical Informatics and Decision Making.

[9]  M. Herk,et al.  Multiple comparisons permutation test for image based data mining in radiotherapy , 2013, Radiation oncology.

[10]  Timothy N. Showalter,et al.  Big Data and Comparative Effectiveness Research in Radiation Oncology: Synergy and Accelerated Discovery , 2015, Front. Oncol..

[11]  A. Burgun,et al.  Big Data and machine learning in radiation oncology: State of the art and future prospects. , 2016, Cancer letters.

[12]  Satyapal Rathee,et al.  An artificial neural network (ANN)-based lung-tumor motion predictor for intrafractional MR tumor tracking. , 2012, Medical physics.

[13]  Giuseppe Baldazzi,et al.  A support vector machine tool for adaptive tomotherapy treatments: Prediction of head and neck patients criticalities. , 2015, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.