A Method for Handling Numerical Attributes in GA-Based Inductive Concept Learners

This paper proposes a method for dealing with numerical attributes in inductive concept learning systems based on genetic algorithms. The method uses constraints for restricting the range of values of the attributes and novel stochastic operators for modifying the constraints. These operators exploit information on the distribution of the values of an attribute. The method is embedded into a GA based system for inductive logic programming. Results of experiments on various data sets indicate that the method provides an effective local discretization tool for GA based inductive concept learners.

[1]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[2]  Marek Kretowski,et al.  An Evolutionary Algorithm Using Multivariate Discretization for Decision Rule Induction , 1999, PKDD.

[3]  Filippo Neri,et al.  Search-Intensive Concept Induction , 1995, Evolutionary Computation.

[4]  Federico Divina,et al.  Evolutionary Concept Learning , 2002, GECCO.

[5]  Jaume Bacardit,et al.  Evolution Of Adaptive Discretization Intervals For A Rule-based Genetic Learning System , 2002, GECCO.

[6]  Wim Van Laer From Propositional to First Order Logic in Machine Learning and Data Mining - Induction of first ord , 2002 .

[7]  Ron Kohavi,et al.  Error-Based and Entropy-Based Discretization of Continuous Features , 1996, KDD.

[8]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[9]  Jaime G. Carbonell,et al.  An Overview of Machine Learning , 1983 .

[10]  Jaume Bacardit,et al.  Evolution of Multi-adaptive Discretization Intervals for a Rule-Based Genetic Learning System , 2002, IBERAMIA.

[11]  Keki B. Irani,et al.  Multi-interval discretization of continuos attributes as pre-processing for classi cation learning , 1993, IJCAI 1993.

[12]  Tom M. Mitchell,et al.  Machine Learning and Data Mining , 2012 .

[13]  Tom M. Mitchell,et al.  Generalization as Search , 2002 .

[14]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[15]  Ian H. Witten,et al.  Weka machine learning algorithms in java , 2000 .

[16]  Federico Divina,et al.  Non-universal Suffrage Selection Operators Favor Population Diversity in Genetic Algorithms , 2003, GECCO.

[17]  A. Debnath,et al.  Structure-activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. Correlation with molecular orbital energies and hydrophobicity. , 1991, Journal of medicinal chemistry.

[18]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .