Computer-aided detection of rare tumor populations in flow cytometry: an example with classic Hodgkin lymphoma.

OBJECTIVES Diagnosing classical Hodgkin lymphoma (cHL) by flow cytometry (FC) relies on an observer gating rare populations of Hodgkin/Reed Sternberg (HRS) cells. Here, we apply machine-learning methods to aid in the detection of rare tumor cell populations using data derived from clinical FC analysis of cHL as a model disease. METHODS FC data from 144 clinical cases using a nine-color FC reagent panel were analyzed using Python 2.7 and the "scikit-learn" module. RESULTS Seventy-eight 50 × 50 two-dimensional histograms were generated from routine FC data and a reciprocal power function applied to favor rare events. Data were classified by support vector machine (SVM), gradient boosting, and random forest classifiers. All three classifiers showed no statistical difference in performance, with 89%-92% accuracy on cross-validation. Nearly all classifiers misclassified the same set of cases, with more false-positive than false-negative cases. Dimensionality reduction by ensemble methods selected for data points in a CD5+/ CD40+/CD64- region. CONCLUSIONS All classifiers provide probabilistic confidences for each result, and diagnostic cutoffs can be chosen to minimize false negatives and serve as a screening tool. Computational exclusion of manually gated HRS cells had little impact on the overall performance of selected support vectors in SVM or dimensionality reduction, suggesting that features of the immune response in cHL may dictate the method accuracy. We hypothesize there are distinct inflammatory cells that suggest cHL.

[1]  B. Wood,et al.  Identification and purification of classical Hodgkin cells from lymph nodes by flow cytometry and flow cytometric cell sorting. , 2006, American journal of clinical pathology.

[2]  W. Chan The Reed–Sternberg cell in classical Hodgkin's disease , 2001, Hematological oncology.

[3]  Nitin J. Karandikar,et al.  Overexpression of CD7 in classical Hodgkin lymphoma‐infiltrating T lymphocytes , 2009, Cytometry. Part B, Clinical cytometry.

[4]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[5]  William G. Finn,et al.  Beyond gating: capturing the power of flow cytometry. , 2009, American journal of clinical pathology.

[6]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[7]  B. Wood,et al.  A six-color flow cytometry assay for immunophenotyping classical Hodgkin lymphoma in lymph nodes. , 2014, American journal of clinical pathology.

[8]  Greg Finak,et al.  Critical assessment of automated flow cytometry data analysis techniques , 2013, Nature Methods.

[9]  W. Symmans,et al.  Fine‐needle aspiration cytology of Hodgkin disease , 2001, Cancer.

[10]  Anju Thomas,et al.  Flow cytometry can diagnose classical hodgkin lymphoma in lymph nodes with high sensitivity and specificity. , 2009, American journal of clinical pathology.

[11]  Marc D. Smith,et al.  Detection of T‐regulatory cells has a potential role in the diagnosis of classical Hodgkin lymphoma , 2008, Cytometry. Part B, Clinical cytometry.

[12]  B. Wood,et al.  Increased expression of T cell antigens on T cells in classical Hodgkin lymphoma , 2010, Cytometry. Part B, Clinical cytometry.

[13]  S. Hudnall,et al.  Comparative flow immunophenotypic features of the inflammatory infiltrates of Hodgkin lymphoma and lymphoid hyperplasia , 2008, Cytometry. Part B, Clinical cytometry.

[14]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[15]  Tudung T Nguyen,et al.  Combined core needle biopsy and fine-needle aspiration with ancillary studies correlate highly with traditional techniques in the diagnosis of nodal-based lymphoma. , 2011, American journal of clinical pathology.