Comparative Analysis of Feature Selection Methods for Blood Cell Recognition in Leukemia

This study analyses different methods of diagnostic feature selection in the problem of classification of the blood cells in leukemia. The analyzed methods belong to the wrapper and filter methods and cover wide range of approaches to feature selection problem. In particular they cover 7 methods, each of them working on different principle. As a results of this preprocessing stage we define the best (according to the applied method) set of features which is next used as the input for the Gaussian kernel SVM classifier. The last step of blood cell recognition is the integration of the results of application of all methods. The numerical results of experiments will be presented and analyzed.

[1]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.

[2]  G. McLachlan,et al.  Pattern Classification: A Unified View of Statistical and Neural Approaches. , 1998 .

[3]  Vipin Kumar,et al.  Introduction to Data Mining, (First Edition) , 2005 .

[4]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[5]  Andrzej Cichocki,et al.  Adaptive blind signal and image processing , 2002 .

[6]  Z. Cataltepe,et al.  A New PCA/ICA Based Feature Selection Method , 2007, 2007 IEEE 15th Signal Processing and Communications Applications.

[7]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[8]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[9]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[10]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[11]  José Luis Rojo-Álvarez,et al.  Kernel Methods in Bioengineering, Signal And Image Processing , 2007 .

[12]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[13]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[14]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[15]  Tomasz Markiewicz,et al.  Support Vector Machine for Recognition of White Blood Cells of Leukaemia , 2007 .

[16]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[17]  Andrzej Cichocki,et al.  Adaptive Blind Signal and Image Processing - Learning Algorithms and Applications , 2002 .

[18]  Alexander J. Smola,et al.  Learning with kernels , 1998 .