On automated Flow Cytometric analysis for MRD estimation of Acute Lymphoblastic Leukaemia: A comparison among different approaches

Minimal Residual Disease (MRD) is a powerful risk-stratification marker in the treatment of primary and relapsed childhood Acute Lymphoblastic Leukemia. Flow Cytometry is a fast and sensitive method to detect MRD, however, the interpretation of these multi-parametric data demands intensive operator training and experience. In this paper we propose a comparison among three different approaches for the automatic detection of leukemic cells in FCM data samples. The first approach is a fully-discriminative baseline based on the Support Vector Machine. The second approach is divided in two phases, the first is an unsupervised feature learning using a Stacked Auto-Encoder Neural Network, the second phase consists of exploiting the previously trained Neural Network to perform inference on the data classification. The third approach is fully generative and based on the Gaussian Mixture Model estimation and Bayes decision. Results show that a generative model provides better accuracy on MRD estimation with respect to the other two approaches. All samples are collected by national diagnostic reference centers for children with ALL for medical purposes enrolled in the current treatment protocol AIEOP-BFM-2009.

[1]  Leslie L Robison,et al.  Acute lymphoblastic leukaemia , 2018, Radiopaedia.org.

[2]  Rainer Spang,et al.  Automated in-silico detection of cell populations in flow cytometry readouts and its application to leukemia disease monitoring , 2006, BMC Bioinformatics.

[3]  J J Shuster,et al.  Minimal residual disease detection in childhood precursor–B-cell acute lymphoblastic leukemia: relation to other risk factors. A Children's Oncology Group study , 2003, Leukemia.

[4]  Arndt Borkhardt,et al.  Use of allogeneic hematopoietic stem-cell transplantation based on minimal residual disease response improves outcomes for children with relapsed acute lymphoblastic leukemia in the intermediate-risk group. , 2013, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[5]  Michael N Dworzak,et al.  Prognostic significance and modalities of flow cytometric minimal residual disease detection in childhood acute lymphoblastic leukemia. , 2002, Blood.

[6]  Subhransu Maji,et al.  Deep filter banks for texture recognition and segmentation , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[9]  Martin Stanulla,et al.  Treatment of childhood acute lymphoblastic leukemia. , 2009, Seminars in hematology.

[10]  Iftekhar Naim,et al.  SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 1: Algorithm Design , 2014, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[11]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[12]  Maria Grazia Valsecchi,et al.  Risk of relapse of childhood acute lymphoblastic leukemia is predicted by flow cytometric measurement of residual disease on day 15 bone marrow. , 2009, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[13]  Nicu Sebe,et al.  The S-HOCK dataset: Analyzing crowds at the stadium , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Dario Campana,et al.  Minimal residual disease in acute lymphoblastic leukemia. , 2010, Hematology. American Society of Hematology. Education Program.

[15]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[16]  Arvind Gupta,et al.  Data reduction for spectral clustering to analyze high throughput flow cytometry data , 2010, BMC Bioinformatics.