Application of Support Vector Machine and k-means clustering algorithms for robust chronic lymphocytic leukemia color cell segmentation

Chronic lymphocytic leukemia (CLL) is the most common type of blood cancer in Canadian adults. The relative 5-year survival rates for CLL in Canada is decreasing. CLL cell morphology maybe similar to normal lymphocytes and require a hematopathologist examination for diagnosis. There are a low number of related works on image analysis in CLL. This paper focuses on lymphocyte color cell segmentation using Support Vector Machine (SVM) and k-means clustering algorithms. The algorithm overcomes the occlusion problem when lymphocytes are tightly bound to the surrounding Red Blood Cells. Over and under-segmentation problems are significantly reduced. In this paper we used 440 lymphocyte images (normal and CLL), in which 140 images are used for segmentation accuracy measurement and 12 images for SVM training. The algorithm obtained 98.43% maximum accuracy for nucleus segmentation, and 98.69% for cell segmentation. The cytoplasm region can be extracted by 99.85% maximum accuracy with simple mask subtraction.

[1]  Nicholas Clintona,et al.  AN ACCURACY ASSESSMENT MEASURE FOR OBJECT BASED IMAGE SEGMENTATION , 2008 .

[2]  Adam Krzyzak,et al.  Automatic Clinical Image Segmentation Using Pathological Modelling, PCA and SVM , 2005, MLDM.

[3]  Libo Zeng,et al.  A method based on multispectral imaging technique for White Blood Cell segmentation , 2007, Comput. Biol. Medicine.

[4]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[5]  S. Gibson,et al.  High incidence of chronic lymphocytic leukemia (CLL) diagnosed by immunophenotyping: a population-based Canadian cohort. , 2009, Leukemia research.

[6]  Sameem Abdul Kareem,et al.  An Image Processing Application for the Localization and Segmentation of Lymphoblast Cell Using Peripheral Blood Images , 2012, Journal of Medical Systems.

[7]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[8]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[9]  Fionn Murtagh,et al.  Cluster Dissection and Analysis: Theory, Fortran Programs, Examples. , 1986 .

[10]  Guo-Can Feng,et al.  Kernel Bisecting k-means clustering for SVM training sample reduction , 2008, 2008 19th International Conference on Pattern Recognition.

[12]  Qi Li,et al.  A Chunking Method for Euclidean Distance Matrix Calculation on Large Dataset Using Multi-GPU , 2010, 2010 Ninth International Conference on Machine Learning and Applications.