Analysis of Selected Evolutionary Algorithms in Feature Selection and Parameter Optimization for Data Based Tumor Marker Modeling

In this paper we report on the use of evolutionary algorithms for optimizing the identification of classification models for selected tumor markers. Our goal is to identify mathematical models that can be used for classifying tumor marker values as normal or as elevated; evolutionary algorithms are used for optimizing the parameters for learning classification models. The sets of variables used as well as the parameter settings for concrete modeling methods are optimized using evolution strategies and genetic algorithms. The performance of these algorithms is analyzed as well as the population diversity progress. In the empirical part of this paper we document modeling results achieved for tumor markers CA 125 and CYFRA using a medical data base provided by the Central Laboratory of the General Hospital Linz; empirical tests are executed using HeuristicLab.

[1]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[2]  P. Lee,et al.  Evaluation of cytokeratin 19 fragment (CYFRA 21-1) as a tumor marker in malignant pleural effusion. , 1999, Japanese journal of clinical oncology.

[3]  David G. Stork,et al.  Pattern Classification (2nd ed.) , 1999 .

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Stephan M. Winkler,et al.  Genetic Algorithms and Genetic Programming - Modern Concepts and Practical Applications , 2009 .

[6]  David G. Stork,et al.  Pattern Classification , 1973 .

[7]  Enrique Alba,et al.  Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms , 2007, 2007 IEEE Congress on Evolutionary Computation.

[8]  Witold Jacak,et al.  Classification of tumor marker values using heuristic data mining methods , 2010, GECCO '10.

[9]  N Osman,et al.  Correlation of serum CA125 with stage, grade and survival of patients with epithelial ovarian cancer at a single centre. , 2008, Irish medical journal.

[10]  B. Yin,et al.  Ovarian cancer antigen CA125 is encoded by the MUC16 mucin gene , 2002, International journal of cancer.

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  J A Koepke,et al.  Molecular marker test standardization , 1992, Cancer.

[13]  Dipl. Ing. Karl Heinz Kellermayer NUMERISCHE OPTIMIERUNG VON COMPUTER-MODELLEN MITTELS DER EVOLUTIONSSTRATEGIE Hans-Paul Schwefel Birkhäuser, Basel and Stuttgart, 1977 370 pages Hardback SF/48 ISBN 3-7643-0876-1 , 1977 .

[14]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[15]  Witold Jacak,et al.  Feature selection in the analysis of tumor marker data using evolutionary algorithms , 2010 .

[16]  O. Nelles Nonlinear System Identification , 2001 .

[17]  Joachim Schneider,et al.  Cut-off-independent tumour marker evaluation using ROC approximation. , 2007, Anticancer research.