Survival Prediction Using Transformer-Based Categorical Feature Representation in the Treatment of Diffuse Large B-Cell Lymphoma

Diffuse large B-cell lymphoma (DLBCL) is a common and aggressive subtype of lymphoma, and accurate survival prediction is crucial for treatment decisions. This study aims to develop a robust survival prediction strategy to integrate various risk factors effectively, including clinical risk factors and Deauville scores in positron-emission tomography/computed tomography at different treatment stages using a deep-learning-based approach. We conduct a multi-institutional study on 604 DLBCL patients’ clinical data and validate the model on 220 patients from an independent institution. We propose a survival prediction model using transformer architecture and a categorical-feature-embedding technique that can handle high-dimensional and categorical data. Comparison with deep-learning survival models such as DeepSurv, CoxTime, and CoxCC based on the concordance index (C-index) and the mean absolute error (MAE) demonstrates that the categorical features obtained using transformers improved the MAE and the C-index. The proposed model outperforms the best-performing existing method by approximately 185 days in terms of the MAE for survival time estimation on the testing set. Using the Deauville score obtained during treatment resulted in a 0.02 improvement in the C-index and a 53.71-day improvement in the MAE, highlighting its prognostic importance. Our deep-learning model could improve survival prediction accuracy and treatment personalization for DLBCL patients.

[1]  G. Salles,et al.  Diffuse large B-cell lymphoma: new targets and novel therapies , 2021, Blood Cancer Journal.

[2]  Sheng-Chieh Chan,et al.  Prognostic Value of Baseline Radiomic Features of 18F-FDG PET in Patients with Diffuse Large B-Cell Lymphoma , 2020, Diagnostics.

[3]  Xin Huang,et al.  TabTransformer: Tabular Data Modeling Using Contextual Embeddings , 2020, ArXiv.

[4]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[5]  Noah Simon,et al.  OBLIQUE RANDOM SURVIVAL FORESTS. , 2019, The annals of applied statistics.

[6]  Ida Scheel,et al.  Time-to-Event Prediction with Neural Networks and Cox Regression , 2019, J. Mach. Learn. Res..

[7]  U. Jäger,et al.  CAR-T Cell Therapy in Diffuse Large B Cell Lymphoma: Hype and Hope , 2019, HemaSphere.

[8]  Marek J. Druzdzel,et al.  A Bayesian network interpretation of the Cox's proportional hazard model , 2018, Int. J. Approx. Reason..

[9]  T. El‐Galaly,et al.  FDG‐PET/CT in the management of lymphomas: current status and future directions , 2018, Journal of internal medicine.

[10]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[11]  Marek J. Druzdzel,et al.  Making Large Cox's Proportional Hazard Models Tractable in Bayesian Networks , 2016, Probabilistic Graphical Models.

[12]  J. Armitage,et al.  Non-Hodgkin lymphoma , 2012, The Lancet.

[13]  J. Min,et al.  Prognostic significance of interim ¹⁸F-FDG PET/CT after three or four cycles of R-CHOP chemotherapy in the treatment of diffuse large B-cell lymphoma. , 2011, European journal of cancer.

[14]  Michel Meignan,et al.  Report on the First International Workshop on interim-PET scan in lymphoma , 2009, Leukemia & lymphoma.

[15]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[16]  A. Urquhart,et al.  Hodgkin's and Non‐Hodgkin's Lymphoma of the Head and Neck , 2001, The Laryngoscope.

[17]  D. Sargent,et al.  Comparison of artificial neural networks with other statistical approaches , 2001, Cancer.

[18]  P. Lapuerta,et al.  Comparison of the performance of neural network methods and Cox regression for censored survival data , 2000 .

[19]  D Faraggi,et al.  A neural network model for survival data. , 1995, Statistics in medicine.

[20]  Emili Montserrat,et al.  A predictive model for aggressive non-Hodgkin's lymphoma. , 1993, The New England journal of medicine.

[21]  N Breslow,et al.  Estimators of the Mantel-Haenszel variance consistent in both sparse data and large-strata limiting models. , 1986, Biometrics.

[22]  N. Breslow,et al.  Analysis of Survival Data under the Proportional Hazards Model , 1975 .

[23]  Donald K. K. Lee,et al.  Theory and software for boosted nonparametric hazard estimation , 2021, SPACA.

[24]  Thomas Yu,et al.  Prediction of overall survival for patients with metastatic castration-resistant prostate cancer: development of a prognostic model through a crowdsourced challenge with open clinical trial data. , 2017, The Lancet. Oncology.

[25]  Svetlana Borovkova,et al.  Analysis of survival data , 2002 .

[26]  L. J. Wei,et al.  The Robust Inference for the Cox Proportional Hazards Model , 1989 .