Mining cancer data with discrete particle swarm optimization and rule pruning

Cancer is one of the most the gravest problems facing mankind. In 2008, it is estimated that over 7.6 million lives have been claimed by cancer. Early and precise detection plays a key role in treating the disease and improve survivability of patient. Among data classification algorithms, discrete particle swarm optimization (DPSO), a technique based on standard PSO has proved to be competitive in predicting breast cancer, and in this paper, we implement a classifier using DPSO with new rule pruning procedure for detecting lung cancer and breast cancer, which are the most common cancer for men and women. Experiment shows the new pruning method further improves the classification accuracy, and the new approach is effective in making cancer prediction.

[1]  A. Jemal,et al.  Global Cancer Statistics , 2011 .

[2]  Ziqiang Wang,et al.  A PSO-Based Classification Rule Mining Algorithm , 2009, ICIC.

[3]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[4]  Guo-Chang Gu,et al.  Research on particle swarm optimization: a review , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[5]  SousaTiago,et al.  Particle swarm based Data Mining Algorithms for classification tasks , 2004 .

[6]  A. Jemal,et al.  Global cancer statistics , 2011, CA: a cancer journal for clinicians.

[7]  Wei-Chang Yeh,et al.  A new hybrid approach for mining breast cancer pattern using discrete particle swarm optimization and statistical method , 2009, Expert Syst. Appl..

[8]  A. Miller,et al.  The importance of early symptom recognition in the context of early detection and cancer survival. , 2009, European journal of cancer.

[9]  Gloria E. Phillips-Wren,et al.  Mining lung cancer patient data to assess healthcare resource utilization , 2008, Expert Syst. Appl..

[10]  Andrew Kusiak,et al.  Cancer gene search with data-mining and genetic algorithms , 2007, Comput. Biol. Medicine.

[11]  Tiago Ferra de Sousa,et al.  Particle Swarm based Data Mining Algorithms for classification tasks , 2004, Parallel Comput..

[12]  Xiangyang Wang,et al.  Feature selection based on rough sets and particle swarm optimization , 2007, Pattern Recognit. Lett..

[13]  Steven Guan,et al.  Feature selection for modular GA-based classification , 2004, Appl. Soft Comput..

[14]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[15]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[16]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.