Development and validation of machine learning models for predicting prognosis and guiding individualized postoperative chemotherapy: A real-world study of distal cholangiocarcinoma

Background Distal cholangiocarcinoma (dCCA), originating from the common bile duct, is greatly associated with a dismal prognosis. A series of different studies based on cancer classification have been developed, aimed to optimize therapy and predict and improve prognosis. In this study, we explored and compared several novel machine learning models that might lead to an improvement in prediction accuracy and treatment options for patients with dCCA. Methods In this study, 169 patients with dCCA were recruited and randomly divided into the training cohort (n = 118) and the validation cohort (n = 51), and their medical records were reviewed, including survival outcomes, laboratory values, treatment strategies, pathological results, and demographic information. Variables identified as independently associated with the primary outcome by least absolute shrinkage and selection operator (LASSO) regression, the random survival forest (RSF) algorithm, and univariate and multivariate Cox regression analyses were introduced to establish the following different machine learning models and canonical regression model: support vector machine (SVM), SurvivalTree, Coxboost, RSF, DeepSurv, and Cox proportional hazards (CoxPH). We measured and compared the performance of models using the receiver operating characteristic (ROC) curve, integrated Brier score (IBS), and concordance index (C-index) following cross-validation. The machine learning model with the best performance was screened out and compared with the TNM Classification using ROC, IBS, and C-index. Finally, patients were stratified based on the model with the best performance to assess whether they benefited from postoperative chemotherapy through the log-rank test. Results Among medical features, five variables, including tumor differentiation, T-stage, lymph node metastasis (LNM), albumin-to-fibrinogen ratio (AFR), and carbohydrate antigen 19-9 (CA19-9), were used to develop machine learning models. In the training cohort and the validation cohort, C-index achieved 0.763 vs. 0.686 (SVM), 0.749 vs. 0.692 (SurvivalTree), 0.747 vs. 0.690 (Coxboost), 0.745 vs. 0.690 (RSF), 0.746 vs. 0.711 (DeepSurv), and 0.724 vs. 0.701 (CoxPH), respectively. The DeepSurv model (0.823 vs. 0.754) had the highest mean area under the ROC curve (AUC) than other models, including SVM (0.819 vs. 0.736), SurvivalTree (0.814 vs. 0.737), Coxboost (0.816 vs. 0.734), RSF (0.813 vs. 0.730), and CoxPH (0.788 vs. 0.753). The IBS of the DeepSurv model (0.132 vs. 0.147) was lower than that of SurvivalTree (0.135 vs. 0.236), Coxboost (0.141 vs. 0.207), RSF (0.140 vs. 0.225), and CoxPH (0.145 vs. 0.196). Results of the calibration chart and decision curve analysis (DCA) also demonstrated that DeepSurv had a satisfactory predictive performance. In addition, the performance of the DeepSurv model was better than that of the TNM Classification in C-index, mean AUC, and IBS (0.746 vs. 0.598, 0.823 vs. 0.613, and 0.132 vs. 0.186, respectively) in the training cohort. Patients were stratified and divided into high- and low-risk groups based on the DeepSurv model. In the training cohort, patients in the high-risk group would not benefit from postoperative chemotherapy (p = 0.519). In the low-risk group, patients receiving postoperative chemotherapy might have a better prognosis (p = 0.035). Conclusions In this study, the DeepSurv model was good at predicting prognosis and risk stratification to guide treatment options. AFR level might be a potential prognostic factor for dCCA. For the low-risk group in the DeepSurv model, patients might benefit from postoperative chemotherapy.

[1]  Yuanlong Gu,et al.  Preoperative peripheral blood inflammatory markers especially the fibrinogen-to-lymphocyte ratio and novel FLR-N score predict the prognosis of patients with early-stage resectable extrahepatic cholangiocarcinoma , 2022, Frontiers in Oncology.

[2]  Laurent Guyon,et al.  Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models , 2022, bioRxiv.

[3]  Jingyu Wu,et al.  Development and validation of a deep learning model to predict survival of patients with esophageal cancer , 2022, Frontiers in Oncology.

[4]  Jinzhou Zhu,et al.  Development of a Deep Learning Model for Malignant Small Bowel Tumors Survival: A SEER-Based Study , 2022, Diagnostics.

[5]  Y. Yoon,et al.  Predicting Long-Term Mortality in Patients with Acute Heart Failure Using Machine Learning. , 2022, Journal of cardiac failure.

[6]  Cheng-Hong Yang,et al.  Machine learning approaches for the mortality risk assessment of patients undergoing hemodialysis , 2022, Therapeutic advances in chronic disease.

[7]  B. Teh,et al.  Cholangiocarcinoma , 2021, Nature Reviews Disease Primers.

[8]  Haitao Zhao,et al.  Mutational spectrum and precision oncology for biliary tract carcinoma , 2021, Theranostics.

[9]  D. Sia,et al.  Cell of origin in biliary tract cancers and clinical implications , 2021, JHEP reports : innovation in hepatology.

[10]  A. Zhu,et al.  Biliary tract cancer , 2021, The Lancet.

[11]  P. Rodrigues,et al.  Pathogenesis of Cholangiocarcinoma. , 2020, Annual review of pathology.

[12]  Chuan Hu,et al.  Risk factors and nomogram for newly diagnosis of bone metastasis in bladder cancer , 2020, Medicine.

[13]  G. Gores,et al.  Cholangiocarcinoma 2020: the next horizon in mechanisms and management , 2020, Nature Reviews Gastroenterology & Hepatology.

[14]  Wei Liu,et al.  Prognostic value of combined preoperative gamma-glutamyl transpeptidase to platelet ratio and fibrinogen in patients with HBV-related hepatocellular carcinoma after hepatectomy. , 2020, American journal of translational research.

[15]  B. Barlogie,et al.  Longer term follow-up of the randomized phase III trial SWOG S0777: bortezomib, lenalidomide and dexamethasone vs. lenalidomide and dexamethasone in patients (Pts) with previously untreated multiple myeloma without an intent for immediate autologous stem cell transplant (ASCT) , 2020, Blood Cancer Journal.

[16]  T. Pawlik,et al.  Molecular classification and therapeutic targets in extrahepatic cholangiocarcinoma. , 2020, Journal of hepatology.

[17]  Xinting Sang,et al.  CeRNA regulatory network-based analysis to study the roles of noncoding RNAs in the pathogenesis of intrahepatic cholangiocellular carcinoma , 2020, Aging.

[18]  Yingjie Tian,et al.  Joint Ranking SVM and Binary Relevance with Robust Low-Rank Learning for Multi-Label Classification , 2019, Neural Networks.

[19]  Jin Gu,et al.  Transcriptomic analysis and identification of prognostic biomarkers in cholangiocarcinoma , 2019, Oncology reports.

[20]  A. Forner,et al.  Clinical presentation, diagnosis and staging of cholangiocarcinoma , 2019, Liver international : official journal of the International Association for the Study of the Liver.

[21]  Jenni A. M. Sidey-Gibbons,et al.  Machine learning in medicine: a practical introduction , 2019, BMC Medical Research Methodology.

[22]  B. Barlogie,et al.  Longer term follow-up of the randomized phase III trial SWOG S0777: bortezomib, lenalidomide and dexamethasone vs. lenalidomide and dexamethasone in patients (Pts) with previously untreated multiple myeloma without an intent for immediate autologous stem cell transplant (ASCT) , 2018, Blood Cancer Journal.

[23]  Jian-yong Li,et al.  Albumin-to-Fibrinogen Ratio as an Independent Prognostic Parameter in Untreated Chronic Lymphocytic Leukemia: A Retrospective Study of 191 Cases , 2018, Cancer research and treatment : official journal of Korean Cancer Association.

[24]  Juping Du,et al.  Prognostic significance of pre-resection albumin/fibrinogen ratio in patients with non-small cell lung cancer: A propensity score matching analysis. , 2018, Clinica chimica acta; international journal of clinical chemistry.

[25]  O. Abe,et al.  Predicting prognosis of resected hepatocellular carcinoma by radiomics analysis with random survival forest. , 2018, Diagnostic and interventional imaging.

[26]  G. Song,et al.  Prognostic comparison of the 7th and 8th editions of the American Joint Committee on Cancer staging system for intrahepatic cholangiocarcinoma , 2018, Journal of hepato-biliary-pancreatic sciences.

[27]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[28]  G. Gores,et al.  Cholangiocarcinoma — evolving concepts and therapeutic strategies , 2018, Nature Reviews Clinical Oncology.

[29]  J. Piulats,et al.  Prognostic Factors and Decision Tree for Long-Term Survival in Metastatic Uveal Melanoma , 2017, Cancer research and treatment : official journal of Korean Cancer Association.

[30]  S. Theocharis,et al.  Clinical Value of Nutritional Status in Cancer: What is its Impact and how it Affects Disease Progression and Prognosis? , 2017, Nutrition and cancer.

[31]  Sohee Oh,et al.  Prognostic Influence of Preoperative Fibrinogen to Albumin Ratio for Breast Cancer , 2017, Journal of breast cancer.

[32]  Yun Wang,et al.  Fibrinogen promotes malignant biological tumor behavior involving epithelial–mesenchymal transition via the p-AKT/p-mTOR pathway in esophageal squamous cell carcinoma , 2017, Journal of Cancer Research and Clinical Oncology.

[33]  G. Spolverato,et al.  Comparative performances of the 7th and the 8th editions of the American Joint Committee on Cancer staging systems for intrahepatic cholangiocarcinoma , 2017, Journal of surgical oncology.

[34]  C. Compton,et al.  The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population‐based to a more “personalized” approach to cancer staging , 2017, CA: a cancer journal for clinicians.

[35]  Vickie Baracos,et al.  ESPEN guidelines on nutrition in cancer patients. , 2017, Clinical nutrition.

[36]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[37]  T. Gruenberger,et al.  Biliary cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. , 2016, Annals of oncology : official journal of the European Society for Medical Oncology.

[38]  K. Boberg,et al.  Expert consensus document: Cholangiocarcinoma: current knowledge and future perspectives consensus statement from the European Network for the Study of Cholangiocarcinoma (ENS-CCA) , 2016, Nature Reviews Gastroenterology &Hepatology.

[39]  Birgit Kasch,et al.  Next Generation , 2005, Im OP.

[40]  Y. Toiyama,et al.  Clinical impact of preoperative albumin to globulin ratio in gastric cancer patients with curative intent. , 2016, American journal of surgery.

[41]  Jignesh R. Parikh,et al.  Reverse Engineering and Evaluation of Prediction Models for Progression to Type 2 Diabetes , 2016, Journal of diabetes science and technology.

[42]  J. Gu,et al.  Prognostic significance of pretreatment serum levels of albumin, LDH and total bilirubin in patients with non-metastatic breast cancer. , 2015, Carcinogenesis.

[43]  T. Therneau,et al.  A New Clinically Based Staging System for Perihilar Cholangiocarcinoma , 2014, The American Journal of Gastroenterology.

[44]  H. Weng,et al.  Clinical and prognostic significance of preoperative plasma hyperfibrinogenemia in gallbladder cancer patients following surgical resection: a retrospective and in vitro study , 2014, BMC Cancer.

[45]  J. Bruix,et al.  “Very Early” Intrahepatic Cholangiocarcinoma in Cirrhotic Patients: Should Liver Transplantation Be Reconsidered in These Patients? , 2014, American journal of transplantation : official journal of the American Society of Transplantation and the American Society of Transplant Surgeons.

[46]  Todd F. DeLuca,et al.  Use of machine learning to shorten observation-based screening and diagnosis of autism , 2012, Translational Psychiatry.

[47]  D. Hanahan,et al.  Hallmarks of Cancer: The Next Generation , 2011, Cell.

[48]  Steven C Cunningham,et al.  Cholangiocarcinoma: Thirty-one-Year Experience With 564 Patients at a Single Institution , 2007, Annals of surgery.

[49]  S. Taylor-Robinson,et al.  Risk Factors for Intrahepatic and Extrahepatic Cholangiocarcinoma: a systematic review and meta-analysis. , 2019, Journal of hepatology.

[50]  Yang Wang,et al.  Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. , 2018, Cancer genomics & proteomics.