Analysis of clinical prognostic variables for Chronic Lymphocytic Leukemia decision-making problems

INTRODUCTION Chronic Lymphocytic Leukemia (CLL) is a disease with highly heterogeneous clinical course. A key goal is the prediction of patients with high risk of disease progression, which could benefit from an earlier or more intense treatment. In this work we introduce a simple methodology based on machine learning methods to help physicians in their decision making in different problems related to CLL. MATERIAL AND METHODS Clinical data belongs to a retrospective study of a cohort of 265 Caucasians who were diagnosed with CLL between 1997 and 2007 in Hospital Cabueñes (Asturias, Spain). Different machine learning methods were applied to find the shortest list of most discriminatory prognostic variables to predict the need of Chemotherapy Treatment and the development of an Autoimmune Disease. RESULTS Autoimmune disease occurrence was predicted with very high accuracy (>90%). Autoimmune disease development is currently an unpredictable severe complication of CLL. Chemotherapy Treatment has been predicted with a lower accuracy (80%). Risk analysis showed that the number of false positives and false negatives are well balanced. CONCLUSIONS Our study highlights the importance of prognostic variables associated with the characteristics of platelets, reticulocytes and natural killers, which are the main targets of the autoimmune haemolytic anemia and immune thrombocytopenia for autoimmune disease development, and also, the relevance of some clinical variables related with the immune characteristics of CLL patients that are not taking into account by current prognostic markers for predicting the need of chemotherapy. Because of its simplicity, this methodology could be implemented in spreadsheets.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[3]  Thomas Elboth,et al.  Flow noise reduction from superhydrophobic surfaces , 2012 .

[4]  Juan Luis Fernández-Martínez,et al.  From Bayes to Tarantola: New insights to understand uncertainty in inverse problems☆ , 2013 .

[5]  T J Hamblin,et al.  Unmutated Ig V(H) genes are associated with a more aggressive form of chronic lymphocytic leukemia. , 1999, Blood.

[6]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[7]  Feng Yang,et al.  Robust Feature Selection for Microarray Data Based on Multicriterion Fusion , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  M. Hallek,et al.  Serum beta(2)-microglobulin and serum thymidine kinase are independent predictors of progression-free survival in chronic lymphocytic leukemia and immunocytoma. , 1996, Leukemia & lymphoma.

[9]  Emili Montserrat,et al.  ZAP-70 expression as a surrogate for immunoglobulin-variable-region mutations in chronic lymphocytic leukemia. , 2003, The New England journal of medicine.

[10]  Andrea Acebes-Huerta,et al.  Prognostic significance of CD8 and CD4 T cells in chronic lymphocytic leukemia , 2010, Leukemia & lymphoma.

[11]  Michael Hallek,et al.  Guidelines for the diagnosis and treatment of chronic lymphocytic leukemia: a report from the International Workshop on Chronic Lymphocytic Leukemia updating the National Cancer Institute-Working Group 1996 guidelines. , 2008, Blood.

[12]  Enrique J. deAndrés-Galiana,et al.  Design of Biomedical Robots for the Analysis of Cancer, Neurodegenerative and Rare Diseases , 2015, ICMMI.

[13]  P. Davies The American heritage dictionary of the English language , 1981 .

[14]  Michael J. Tompkins,et al.  On the topography of the cost functional in linear and nonlinear inverse problems , 2012 .

[15]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[16]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[17]  Diana Nicoll Pocket Guide to Diagnostic Tests , 1992 .

[18]  D C Case,et al.  Clinical staging of chronic lymphocytic leukemia. , 1977, The Journal of the Maine Medical Association.

[19]  Blaz Zupan,et al.  Predictive data mining in clinical medicine: Current issues and guidelines , 2008, Int. J. Medical Informatics.

[20]  Hartmut Döhner,et al.  Molecular diagnostics in chronic lymphocytic leukemia – Pathogenetic and clinical implications , 2008, Leukemia & lymphoma.

[21]  Julian D Olden,et al.  Machine Learning Methods Without Tears: A Primer for Ecologists , 2008, The Quarterly Review of Biology.

[22]  J. Castillo,et al.  Plasma Cell Disorders. , 2016, Primary care.

[23]  Sudhir Gupta,et al.  Clinical and Immunological Features in IgM Deficiency , 2009, International Archives of Allergy and Immunology.

[24]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[25]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[26]  Mark J. Thomas,et al.  A new prognostic classification of chronic lymphocytic leukemia derived from a multivariate survival analysis , 1981, Cancer.

[27]  J. L. Fernández-Martínez,et al.  On the prediction of Hodgkin lymphoma treatment response , 2015, Clinical and Translational Oncology.

[28]  Enrique J. deAndrés-Galiana,et al.  Supervised Classification by Filter Methods and Recursive Feature Elimination Predicts Risk of Radiotherapy-Related Fatigue in Patients with Prostate Cancer , 2014, Cancer informatics.

[29]  Claude E. Shannon,et al.  The mathematical theory of communication , 1950 .

[30]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[31]  N. Chiorazzi,et al.  Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia. , 1999, Blood.

[32]  Steven L. Allen,et al.  Ig V gene mutation status and CD38 expression as novel prognostic indicators in chronic lymphocytic leukemia. , 1999, Blood.

[33]  John A. Swets,et al.  Signal Detection Theory and ROC Analysis in Psychology and Diagnostics: Collected Papers , 1996 .