Machine learning identifies a profile of inadequate responder to methotrexate in rheumatoid arthritis

Abstract Objectives Around 30% of patients with RA have an inadequate response to MTX. We aimed to use routine clinical and biological data to build machine learning models predicting EULAR inadequate response to MTX and to identify simple predictive biomarkers. Methods Models were trained on RA patients fulfilling the 2010 ACR/EULAR criteria from the ESPOIR and Leiden EAC cohorts to predict the EULAR response at 9 months (± 6 months). Several models were compared on the training set using the AUROC. The best model was evaluated on an external validation cohort (tREACH). The model's predictions were explained using Shapley values to extract a biomarker of inadequate response. Results We included 493 therapeutic sequences from ESPOIR, 239 from EAC and 138 from tREACH. The model selected DAS28, Lymphocytes, Creatininemia, Leucocytes, AST, ALT, swollen joint count and corticosteroid co-treatment as predictors. The model reached an AUROC of 0.72 [95% CI (0.63, 0.80)] on the external validation set, where 70% of patients were responders to MTX. Patients predicted as inadequate responders had only 38% [95% CI (20%, 58%)] chance to respond and using the algorithm to decide to initiate MTX would decrease inadequate-response rate from 30% to 23% [95% CI: (17%, 29%)]. A biomarker was identified in patients with moderate or high activity (DAS28 > 3.2): patients with a lymphocyte count superior to 2000 cells/mm3 are significantly less likely to respond. Conclusion Our study highlights the usefulness of machine learning in unveiling subgroups of inadequate responders to MTX to guide new therapeutic strategies. Further work is needed to validate this approach.

[1]  A. Athreya,et al.  Clinical predictors of response to methotrexate in patients with rheumatoid arthritis: a machine learning approach using clinical trial data , 2022, Arthritis Research & Therapy.

[2]  J. C. Nieto‐González,et al.  Minimal disease activity (MDA) in patients with recent-onset psoriatic arthritis: predictive model based on machine learning , 2022, Arthritis Research & Therapy.

[3]  P. Cournède,et al.  Machine learning predicts response to TNF inhibitors in rheumatoid arthritis: results on the ESPOIR and ABIRISK cohorts , 2022, RMD Open.

[4]  A. Mobasheri,et al.  Osteoarthritis endotype discovery via clustering of biochemical marker data , 2022, Annals of the Rheumatic Diseases.

[5]  F. Markowetz,et al.  Multi-omic machine learning predictor of breast cancer therapy response , 2021, Nature.

[6]  L. Breiman Random Forests , 2001, Machine Learning.

[7]  Lara A. Kahale,et al.  2021 American College of Rheumatology Guideline for the Treatment of Rheumatoid Arthritis , 2021, Arthritis & rheumatology.

[8]  D. Ziemek,et al.  What Is the Persistence to Methotrexate in Rheumatoid Arthritis, and Does Machine Learning Outperform Hypothesis‐Based Approaches to Its Prediction? , 2021, ACR open rheumatology.

[9]  G. Collins,et al.  Minimum sample size for external validation of a clinical prediction model with a binary outcome , 2021, Statistics in medicine.

[10]  A. Darzi,et al.  Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis , 2021, npj Digital Medicine.

[11]  Gijs Geleijnse,et al.  Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival , 2021, Scientific Reports.

[12]  M. Hoogendoorn,et al.  Complex Machine-Learning Algorithms and Multivariable Logistic Regression on Par in the Prediction of Insufficient Clinical Response to Methotrexate in Rheumatoid Arthritis , 2021, Journal of personalized medicine.

[13]  Tsutomu Takeuchi,et al.  EULAR recommendations for the management of rheumatoid arthritis with synthetic and biological disease-modifying antirheumatic drugs: 2019 update , 2020, Annals of the Rheumatic Diseases.

[14]  Stephen C. J. Parker,et al.  Machine Learning to Predict Anti–Tumor Necrosis Factor Drug Responses of Rheumatoid Arthritis Patients by Integrating Clinical and Genetic Markers , 2019, Arthritis & rheumatology.

[15]  Mustafa Suleyman,et al.  Key challenges for delivering clinical impact with artificial intelligence , 2019, BMC Medicine.

[16]  Jianying Hu,et al.  Artificial intelligence and machine learning in clinical development: a translational perspective , 2019, npj Digital Medicine.

[17]  N. Wulffraat,et al.  Development and validation of a prognostic multivariable model to predict insufficient clinical response to methotrexate in rheumatoid arthritis , 2018, PloS one.

[18]  Anna Veronika Dorogush,et al.  CatBoost: gradient boosting with categorical features support , 2018, ArXiv.

[19]  A. Barton,et al.  Prediction of primary non-response to methotrexate therapy using demographic, clinical and psychosocial variables: results from the UK Rheumatoid Arthritis Medication Study (RAMS) , 2018, Arthritis Research & Therapy.

[20]  P. Dönnes,et al.  Methotrexate and BAFF interaction prevents immunization against TNF inhibitors , 2018, Annals of the rheumatic diseases.

[21]  A. Barton,et al.  Prediction of response to methotrexate in rheumatoid arthritis , 2018, Expert review of clinical immunology.

[22]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[23]  J. Isaacs,et al.  Mechanism of action of methotrexate in rheumatoid arthritis, and the search for biomarkers , 2016, Nature Reviews Rheumatology.

[24]  H. B. Hammer,et al.  Discordance between tender and swollen joint count as well as patient's and evaluator's global assessment may reduce likelihood of remission in patients with rheumatoid arthritis and psoriatic arthritis: data from the prospective multicentre NOR-DMARD study , 2016, Annals of the rheumatic diseases.

[25]  Matti Pirinen,et al.  Crowdsourced assessment of common genetic contribution to predicting anti-TNF treatment response in rheumatoid arthritis , 2016, Nature Communications.

[26]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[27]  H. Canhão,et al.  Old drugs, old problems: where do we stand in prediction of rheumatoid arthritis responsiveness to methotrexate and other synthetic DMARDs? , 2013, BMC Medicine.

[28]  Geoffrey I. Webb,et al.  Encyclopedia of Machine Learning , 2011, Encyclopedia of Machine Learning.

[29]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[30]  S. Saevarsdottir,et al.  Predictors of response to methotrexate in early DMARD naïve rheumatoid arthritis: results from the initial open-label phase of the SWEFOT trial , 2010, Annals of the rheumatic diseases.

[31]  A. Silman,et al.  UvA-DARE (Digital Academic Repository) 2010 Rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative Aletaha, , 2010 .

[32]  Yen-Wei Chen,et al.  Feature Selection Using Recursive Feature Elimination for Handwritten Digit Recognition , 2009, 2009 Fifth International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[33]  M. Dougados,et al.  The ESPOIR cohort: a ten-year follow-up of early arthritis in France: methodology and baseline characteristics of the 813 included patients. , 2007, Joint, bone, spine : revue du rhumatisme.

[34]  P. V. van Riel,et al.  The Disease Activity Score and the EULAR response criteria. , 2005, Clinical and experimental rheumatology.

[35]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[36]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[37]  S. F. Buck A Method of Estimation of Missing Values in Multivariate Data Suitable for Use with an Electronic Computer , 1960 .

[38]  Raveendhara R. Bannuru,et al.  American College of Rheumatology Guideline for the Treatment of Rheumatoid Arthritis , 2015 .

[39]  Rachel Knevel,et al.  Predicting arthritis outcomes--what can be learned from the Leiden Early Arthritis Clinic? , 2011, Rheumatology.

[40]  Jiawei Han,et al.  K-Means Clustering , 2021, Learn Data Mining Through Excel.

[41]  Trevor Hastie,et al.  Model Assessment and Selection , 2009 .