Three Methods for Covering Missing Input Data in XCS

Missing data pose a potential threat to learning and classification in that they may compromise the ability of a system to develop robust, generalized models of the environment in which they operate. This investigation reports on the effects of three approaches to covering these data using an XCS-style learning classifier system. Using fabricated datasets representing a wide range of missing value densities, it was found that missing data do not appear to adversely affect LCS learning and classification performance. Furthermore, three types of missing value covering were found to exhibit similar efficiency on these data, with respect to convergence rate and classification accuracy.

[1]  Sandip Sen,et al.  Newboole: A Fast GBML System , 1990, ML.

[2]  Bor-Sen Chen,et al.  Spectral estimation under nature missing data , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Martin V. Butz,et al.  An algorithmic description of XCS , 2000, Soft Comput..

[4]  Dennis R. Durbin,et al.  The learning classifier system: an evolutionary computation approach to knowledge discovery in epidemiologic surveillance , 2000, Artif. Intell. Medicine.

[5]  David A. Bell,et al.  EDM: A General Framework for Data Mining Based on Evidence Theory , 1996, Data Knowl. Eng..

[6]  Fengzhan Tian,et al.  Incremental learning of Bayesian networks with hidden variables , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[7]  Marzena Kryszkiewicz,et al.  Association Rules in Incomplete Databases , 1999, PAKDD.

[8]  M Juhola,et al.  Comparison of Genetic Algorithms and Other Classification Methods in the Diagnosis of Female Urinary Incontinence , 1999, Methods of Information in Medicine.

[9]  Marzena Kryszkiewicz,et al.  Incomplete Database Issues for Representative Association Rules , 1999, ISMIS.

[10]  John H. Holmes,et al.  The Effect of Missing Data on Learning Classifier System Learning Rate and Classification Performance , 2002, IWLCS.

[11]  Martin V. Butz,et al.  An Algorithmic Description of XCS , 2000, IWLCS.

[12]  Ke Wang,et al.  Using Decision Tree Induction for Discovering Holes in Data , 1998, PRICAI.

[13]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[14]  Vincent Ng,et al.  Quantitative association rules over incomplete data , 1998, SMC'98 Conference Proceedings. 1998 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.98CH36218).