A Markov decision process approach to temporal modulation of dose fractions in radiation therapy planning

The current state of the art in cancer treatment by radiation optimizes beam intensity spatially such that tumors receive high dose radiation whereas damage to nearby healthy tissues is minimized. It is common practice to deliver the radiation over several weeks, where the daily dose is a small constant fraction of the total planned. Such a 'fractionation schedule' is based on traditional models of radiobiological response where normal tissue cells possess the ability to repair sublethal damage done by radiation. This capability is significantly less prominent in tumors. Recent advances in quantitative functional imaging and biological markers are providing new opportunities to measure patient response to radiation over the treatment course. This opens the door for designing fractionation schedules that take into account the patient's cumulative response to radiation up to a particular treatment day in determining the fraction on that day. We propose a novel approach that, for the first time, mathematically explores the benefits of such fractionation schemes. This is achieved by building a stylistic Markov decision process (MDP) model, which incorporates some key features of the problem through intuitive choices of state and action spaces, as well as transition probability and reward functions. The structure of optimal policies for this MDP model is explored through several simple numerical examples.

[1]  Lei Xing,et al.  Formulating adaptive radiation therapy (ART) treatment planning into a closed-loop control framework , 2007, Physics in medicine and biology.

[2]  H. Rehbinder,et al.  Adaptive radiation therapy for compensation of errors in patient setup and treatment delivery. , 2004, Medical physics.

[3]  Isabelle Berry,et al.  Proton magnetic resonance spectroscopic imaging in newly diagnosed glioblastoma: predictive value for the site of postradiotherapy relapse in a prospective longitudinal study. , 2008, International journal of radiation oncology, biology, physics.

[4]  Dwight E Heron,et al.  PET-CT in Radiation Oncology: The Impact on Diagnosis, Treatment Planning, and Assessment of Treatment Response , 2008, American journal of clinical oncology.

[5]  M. Astrahan,et al.  Some implications of linear-quadratic-linear radiation dose-response with regard to hypofractionation. , 2008, Medical physics.

[6]  James L Tatum,et al.  Hypoxia: Importance in tumor biology, noninvasive measurement by imaging, and value of its measurement in the management of cancer therapy , 2006, International journal of radiation biology.

[7]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[8]  M. Alber,et al.  A comparison of forward and inverse treatment planning for intensity-modulated radiotherapy of head and neck cancer. , 2003, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[9]  Bram Stieltjes,et al.  MRI for identification of progression in brain tumors: from morphology to function , 2008, Expert review of neurotherapeutics.

[10]  Adam P Dicker,et al.  Biologically conformal treatment: biomarkers and functional imaging in radiation oncology. , 2008, Future oncology.

[11]  Warren B. Powell,et al.  Approximate Dynamic Programming - Solving the Curses of Dimensionality , 2007 .

[12]  Eva K. Lee,et al.  Operations research applied to radiotherapy, an NCI-NSF-sponsored workshop February 7-9, 2002. , 2003, International journal of radiation oncology, biology, physics.

[13]  M Partridge,et al.  A theoretical framework for prescribing radiotherapy dose distributions using patient-specific biological information. , 2008, Medical physics.

[14]  Andrew J. Schaefer,et al.  The Optimal Time to Initiate HIV Therapy Under Ordered Health States , 2008, Oper. Res..

[15]  X. Li,et al.  Direct aperture optimization-based intensity-modulated radiotherapy for whole breast irradiation. , 2007, International journal of radiation oncology, biology, physics.

[16]  G. W. Swan Optimization of Human Cancer Radiotherapy , 1981 .

[17]  Cedric X. Yu,et al.  New Developments in Intensity Modulated Radiation Therapy , 2006, Technology in cancer research & treatment.

[18]  E. Yorke,et al.  Reduced-order constrained optimization in IMRT planning , 2008, Physics in medicine and biology.

[19]  E. Hall,et al.  Radiobiology for the radiologist , 1973 .

[20]  Torsten Mattfeldt,et al.  Imaging proliferation in lung tumors with PET: 18F-FLT versus 18F-FDG. , 2003, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[21]  A. Tsuchiya,et al.  The QALY Model and Individual Preferences for Health States and Health Profiles over Time: A Systematic Review of the Literature , 2005, Medical decision making : an international journal of the Society for Medical Decision Making.

[22]  R. Jeraj,et al.  Re-optimization in adaptive radiotherapy. , 2002, Physics in medicine and biology.

[23]  P J Neumann,et al.  An Off-the-Shelf Help List , 2001, Medical decision making : an international journal of the Society for Medical Decision Making.

[24]  J. Fowler The linear-quadratic formula and progress in fractionated radiotherapy. , 1989, The British journal of radiology.

[25]  Susan T. Stewart,et al.  Utilities For Prostate Cancer Health States in Men Aged 60 and Older , 2005, Medical care.

[26]  A. Allen,et al.  Intensity-modulated radiotherapy for head and neck cancer. , 2007, Clinical advances in hematology & oncology : H&O.

[27]  Mustafa Y. Sir,et al.  Optimization of radiotherapy considering uncertainties caused by daily setup procedures and organ motion. , 2007 .

[28]  L. Peters,et al.  The role of PET-CT in the management of patients with advanced cancer of the head and neck , 2008, Current oncology reports.

[29]  Qing Hou,et al.  A method to improve spatial resolution and smoothness of intensity profiles in IMRT treatment planning. , 2004, Medical physics.

[30]  Michael C. Ferris,et al.  Digital Object Identifier (DOI) 10.1007/s10107-004-0530-y , 2004 .

[31]  F. Sassi Calculating QALYs, comparing QALY and DALY calculations. , 2006, Health policy and planning.

[32]  H. Minn,et al.  Imaging of tumor hypoxia to predict treatment sensitivity. , 2008, Current pharmaceutical design.

[33]  Andrew J. Schaefer,et al.  The Optimal Timing of Living-Donor Liver Transplantation , 2004, Manag. Sci..

[34]  Marko Seppänen,et al.  18F-EF5: A New PET Tracer for Imaging Hypoxia in Head and Neck Cancer , 2008, Journal of Nuclear Medicine.

[35]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[36]  A. Bezjak,et al.  Palliative thoracic radiotherapy for lung cancer: a systematic review. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.