Learning the Physics of Particle Transport via Transformers

Particle physics simulations are the cornerstone of nuclear engineering applications. Among them radiotherapy (RT) is crucial for society, with 50% of cancer patients receiving radiation treatments. For the most precise targeting of tumors, next generation RT treatments aim for real-time correction during radiation delivery, necessitating particle transport algorithms that yield precise dose distributions in sub-second times even in highly heterogeneous patient geometries. This is infeasible with currently available, purely physics based simulations. In this study, we present a data-driven dose calculation algorithm predicting the dose deposited by monoenergetic proton beams for arbitrary energies and patient geometries. Our approach frames particle transport as sequence modeling, where convolutional layers extract important spatial features into tokens and the transformer self-attention mechanism routes information between such tokens in the sequence and a beam energy token. We train our network and evaluate prediction accuracy using computationally expensive but accurate Monte Carlo (MC) simulations, considered the gold standard in particle physics. Our proposed model is 33 times faster than current clinical analytic pencil beam algorithms, improving upon their accuracy in the most heterogeneous and challenging geometries. With a relative error of 0.34 ± 0.2% and very high gamma pass rate of 99.59 ± 0.7% (1%, 3 mm), it also greatly outperforms the only published similar data-driven proton dose algorithm, even at a finer grid resolution. Offering MC precision 400 times faster, our model could overcome a major obstacle that has so far prohibited real-time adaptive proton treatments and significantly increase cancer treatment efficacy. Its potential to model physics interactions of other particles could also boost heavy ion treatment planning procedures limited by the speed of traditional methods.

[1]  Kengo Ito,et al.  A convolutional neural network approach for IMRT dose distribution prediction in prostate cancer patients , 2019, Journal of radiation research.

[2]  Martin Jaggi,et al.  On the Relationship between Self-Attention and Convolutional Layers , 2019, ICLR.

[3]  J. Lee,et al.  Denoising proton therapy Monte Carlo dose distributions in multiple tumor sites: A comparative neural networks architecture study. , 2021, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[4]  Steve B. Jiang,et al.  A feasibility study for predicting optimal radiation therapy dose distributions of prostate cancer patients from patient anatomy using deep learning , 2017, Scientific Reports.

[5]  Abulikemu Abuduweili,et al.  Escaping the Big Data Paradigm with Compact Transformers , 2021, ArXiv.

[6]  Ashish Vaswani,et al.  Stand-Alone Self-Attention in Vision Models , 2019, NeurIPS.

[7]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[8]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[9]  Ryan Neph,et al.  DeepMC: a deep learning method for efficient Monte Carlo beamlet dose calculation by predictive denoising in magnetic resonance-guided radiotherapy , 2020, Physics in medicine and biology.

[10]  D. Low,et al.  A technique for the quantitative evaluation of dose distributions. , 1998, Medical physics.

[11]  B. Heijmen,et al.  Automation in intensity modulated radiotherapy treatment planning-a review of recent innovations. , 2018, The British journal of radiology.

[12]  Kevin Souris,et al.  Fast multipurpose Monte Carlo simulation for proton therapy using multi- and many-core CPU architectures. , 2016, Medical physics.

[13]  Hongming Shan,et al.  Deep learning for accelerating Monte Carlo radiation transport simulation in intensity-modulated radiation therapy , 2019 .

[14]  Steve B. Jiang,et al.  Three-Dimensional Dose Prediction for Lung IMRT Patients with Deep Neural Networks: Robust Learning from Heterogeneous Beam Configurations , 2018, ArXiv.

[15]  Yaorong Ge,et al.  Fluence Map Prediction Using Deep Learning Models – Direct Plan Generation for Pancreas Stereotactic Body Radiation Therapy , 2020, Frontiers in Artificial Intelligence.

[16]  James Demmel,et al.  Large Batch Optimization for Deep Learning: Training BERT in 76 minutes , 2019, ICLR.

[17]  Kaiming He,et al.  Group Normalization , 2018, ECCV.

[18]  Steve B. Jiang,et al.  Deep dose plugin: towards real-time Monte Carlo dose calculation through a deep learning-based denoising algorithm , 2020, Mach. Learn. Sci. Technol..

[19]  Jiawei Fan,et al.  Automatic treatment planning based on three‐dimensional dose distribution predicted from deep learning technique , 2018, Medical physics.

[20]  Bengt Jönsson,et al.  Proton therapy of cancer: Potential clinical advantages and cost-effectiveness , 2005, Acta oncologica.

[21]  Matthieu Cord,et al.  Training data-efficient image transformers & distillation through attention , 2020, ICML.

[22]  Jiawei Fan,et al.  Data-driven dose calculation algorithm based on deep U-Net , 2020, Physics in medicine and biology.

[23]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[24]  Gregory C Sharp,et al.  Evaluation of CBCT scatter correction using deep convolutional neural networks for head and neck adaptive proton therapy , 2020, Physics in medicine and biology.

[25]  Hongming Shan,et al.  MCDNet – A Denoising Convolutional Neural Network to Accelerate Monte Carlo Radiation Transport Simulations: A Proof of Principle With Patient Dose From X-Ray CT Imaging , 2019, IEEE Access.

[26]  Steve B. Jiang,et al.  Improving proton dose calculation accuracy by using deep learning , 2020, Mach. Learn. Sci. Technol..

[27]  Georg Heigold,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2021, ICLR.

[28]  Xiaowei Liu,et al.  A preliminary study of a photon dose calculation algorithm using a convolutional neural network , 2020, Physics in medicine and biology.

[29]  Steve B. Jiang,et al.  A Feasibility Study on Deep Learning-Based Radiotherapy Dose Calculation , 2019, 1908.03159.

[30]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[31]  Jan J W Lagendijk,et al.  DeepDose: Towards a fast dose calculation engine for radiation therapy using deep learning , 2020, Physics in medicine and biology.

[32]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[33]  Steve B. Jiang,et al.  Boosting radiotherapy dose calculation accuracy with deep learning , 2020, Journal of applied clinical medical physics.

[34]  Vasant Kearney,et al.  DoseNet: a volumetric dose prediction algorithm using 3D fully-convolutional neural networks , 2018, Physics in medicine and biology.

[35]  Hojin Kim,et al.  Fluence-map generation for prostate intensity-modulated radiotherapy planning using a deep-neural-network , 2019, Scientific Reports.

[36]  Kuo Men,et al.  A feasibility study on an automated method to generate patient‐specific dose distributions for radiotherapy using deep learning , 2018, Medical physics.

[37]  Bo Liu,et al.  Improving CBCT Quality to CT Level using Deep-Learning with Generative Adversarial Network. , 2020, Medical physics.

[38]  G. Pereira,et al.  The Role of Imaging in Radiation Therapy Planning: Past, Present, and Future , 2014, BioMed research international.

[39]  A. Jemal,et al.  Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries , 2021, CA: a cancer journal for clinicians.

[40]  Kevin Gimpel,et al.  Gaussian Error Linear Units (GELUs) , 2016 .

[41]  Katia Parodi,et al.  Development of the open‐source dose calculation and optimization toolkit matRad , 2017, Medical physics.

[42]  Alex Lallement,et al.  Survey on deep learning for radiotherapy , 2018, Comput. Biol. Medicine.

[43]  B. Raaymakers,et al.  DeepDose: a robust deep learning-based dose engine for abdominal tumours in a 1.5 T MRI radiotherapy system , 2021, Physics in medicine and biology.

[44]  T. Nyholm,et al.  A review of substitute CT generation for MRI-only radiation therapy , 2017, Radiation oncology.

[45]  E. Hall,et al.  Radiation oncology: a century of achievements , 2004, Nature Reviews Cancer.

[46]  Levent Sagun,et al.  ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases , 2021, ICML.

[47]  L. Xing,et al.  Deep DoseNet: A deep neural network for accurate dosimetric transformation between different spatial resolutions and/or different dose calculation algorithms for precision radiation therapy. , 2019, Physics in medicine and biology.

[48]  Mark Chen,et al.  Language Models are Few-Shot Learners , 2020, NeurIPS.

[49]  Yang Lei,et al.  Cone-beam CT-derived relative stopping power map generation via deep learning for proton radiotherapy. , 2020, Medical physics.

[50]  U. Köthe,et al.  Long short-term memory networks for proton dose calculation in highly heterogeneous tissues. , 2020, Medical physics.

[51]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.