Automatic detection of unstained viable cells in bright field images using a support vector machine with an improved training procedure

Detection of unstained viable cells in bright field images is an inherently difficult task due to the immense variability of cell appearance. Traditionally, it has required human observers. However, in high-throughput robotic systems, an automatic procedure is essential. In this paper, we formulate viable cell detection as a supervised, binary pattern recognition problem and show that a support vector machine (SVM) with an improved training algorithm provides highly effective cell identification. In the case of cell detection, the binary classification problem generates two classes, one of which is much larger than the other. In addition, the total number of samples is extremely large. This combination represents a difficult problem for SVMs. We solved this problem with an iterative training procedure ("Compensatory Iterative Sample Selection", CISS). This procedure, which was systematically studied under various class size ratios and overlap conditions, was found to outperform several commonly used methods, primarily owing to its ability to choose the most representative samples for the decision boundary. Its speed and accuracy are sufficient for use in a practical system.

[1]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[2]  Borivoj Vojnovic,et al.  An image analysis‐based approach for automated counting of cancer cell nuclei in tissue sections , 2003, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[3]  P J Sjöström,et al.  Artificial neural network-aided image analysis system for cell counting. , 1999, Cytometry.

[4]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[5]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  Joseph Drish,et al.  Obtaining Calibrated Probability Estimates from Support Vector Machines , 2001 .

[7]  Tomaso Poggio,et al.  A Trainable Object Detection System: Car Detection in Static Images , 1999 .

[8]  Federico Girosi,et al.  Support Vector Machines: Training and Applications , 1997 .

[9]  Helge J. Ritter,et al.  A neural classifier enabling high-throughput topological analysis of lymphocytes in tissue sections , 2001, IEEE Transactions on Information Technology in Biomedicine.

[10]  B. Erlanger,et al.  Routine large-scale production of monoclonal antibodies in a protein-free culture medium. , 1983, Journal of immunological methods.

[11]  Nello Cristianini,et al.  The Application of Support Vector Machines to Medical decision Support: A Case Study , 1999 .

[12]  B. B. Mishell,et al.  Selected Methods in Cellular Immunology , 1980 .

[13]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[14]  Nello Cristianini,et al.  The Kernel-Adatron Algorithm: A Fast and Simple Learning Procedure for Support Vector Machines , 1998, ICML.

[15]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[16]  Fumihito Arai,et al.  Cell Recognition by Image Processing : Recognition of Dead or Living Plant Cells by Neural Network , 1994 .

[17]  Helge J. Ritter,et al.  Human vs. machine: evaluation of fluorescence micrographs , 2003, Comput. Biol. Medicine.

[18]  D P Chakraborty,et al.  Maximum likelihood analysis of free-response receiver operating characteristic (FROC) data. , 1989, Medical physics.

[19]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[20]  Peng Xu,et al.  Support vector machines for multi-class signal classification with unbalanced samples , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[21]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[22]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[23]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[24]  Xi Long,et al.  A new preprocessing approach for cell recognition , 2005, IEEE Transactions on Information Technology in Biomedicine.

[25]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[26]  Katharina Morik,et al.  Combining Statistical Learning with a Knowledge-Based Approach - A Case Study in Intensive Care Monitoring , 1999, ICML.