A parellel genetic algorithm approach for automated discovery of censored production rules

Exceptions, which focus on a very small portion of dataset, have been discarded as noise in machine learning. It is interesting to discover exceptions, as they challenge the existing knowledge and often lead to the growth of knowledge in new directions. Discovering exceptions from voluminous data sets still remains a great challenge. In Knowledge Discovery in Databases (KDD), it is also significant to extract the knowledge in a form that is flexible and efficient enough for approximate reasoning. A Censored Production Rule (CPR) is a special kind of knowledge structure that is represented as an augmented production rule of the form: If P Then D Unless C, where C (Censor) is an exception to rule. If P Then D part of CPR holds frequently while Unless C part holds rarely. Discovery of CPRs not only provides a computational mechanism to exhibit variable precision logic but also results in the discovery of a set of rules of considerably reduced size. In this paper, a Parallel Genetic Algorithm approach is suggested for automated discovery of Censored Production Rules. The parallel Genetic Algorithm involves both data and control parallelism. A fitness function that incorporates the constraints of CPRs, is proposed. The experimental results establish the effectiveness of the proposed algorithm.

[1]  Filippo Neri,et al.  Search-Intensive Concept Induction , 1995, Evolutionary Computation.

[2]  Kamal Kant Bharadwaj,et al.  A Cumulative Learning Approach to Data Mining Employing Censored Production Rules (CPRs) , 2005, IEC.

[3]  H.S. Lopes,et al.  A parallel genetic algorithm for rule discovery in large databases , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[4]  Xavier Llorà,et al.  Knowledge-independent data mining with fine-grained parallel evolutionary algorithms , 2001 .

[5]  Alex A. Freitas,et al.  Discovering interesting prediction rules with a genetic algorithm , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[6]  Kamal Kant Bharadwaj,et al.  Adaptive Hierarchical Censored Production Rule-Based System: A Generic Algorithm Approach , 1996, SBIA.

[7]  Wilson Rivera,et al.  Scalable Parallel Genetic Algorithms , 2001, Artificial Intelligence Review.

[8]  Kamal K. Bharadwaj,et al.  Evolutionary Approach for Automated Discovery of Censored Production Rules , 2007 .

[9]  Balaji Padmanabhan,et al.  A Belief-Driven Method for Discovering Unexpected Patterns , 1998, KDD.

[10]  Erick Cantú-Paz,et al.  A Survey of Parallel Genetic Algorithms , 2000 .

[11]  Abraham Silberschatz,et al.  What Makes Patterns Interesting in Knowledge Discovery Systems , 1996, IEEE Trans. Knowl. Data Eng..

[12]  Wynne Hsu,et al.  Finding Interesting Patterns Using User Expectations , 1999, IEEE Trans. Knowl. Data Eng..

[13]  Wojciech Kwedlo,et al.  A Parallel Evolutionary Algorithm for Discovery of Decision Rules , 2003, PPAM.

[14]  Filippo Neri,et al.  Exploring the Power of Genetic Search in Learning Symbolic Classifiers , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Ryszard S. Michalski,et al.  Variable Precision Logic , 1986, Artif. Intell..

[16]  Enrique Alba,et al.  Parallelism and evolutionary algorithms , 2002, IEEE Trans. Evol. Comput..

[17]  Kamal Kant Bharadwaj,et al.  Parallelization of hierarchical censored production rules , 1995, Inf. Softw. Technol..

[18]  Einoshin Suzuki,et al.  Undirected Discovery of Interesting Exception Rules , 2002, Int. J. Pattern Recognit. Artif. Intell..

[19]  Jan M. Zytkow,et al.  Unified algorithm for undirected discovery of exception rules , 2005, Int. J. Intell. Syst..

[20]  Deborah R. Carvalho,et al.  A Genetic Algorithm With Sequential Niching For Discovering Small-disjunct Rules , 2002, GECCO.

[21]  Gregory Piatetsky-Shapiro,et al.  The KDD process for extracting useful knowledge from volumes of data , 1996, CACM.

[22]  Kamal Kant Bharadwaj,et al.  Hierarchical Censored Production Rules (HCPRs) system , 1992, Data Knowl. Eng..

[23]  Nicholas J. Radcliffe,et al.  A Genetic Algorithm-Based Approach to Data Mining , 1996, KDD.

[24]  Kamal Kant Bharadwaj,et al.  Some learning techniques in hierarchical censored production rules (HCPRs) system , 1998 .

[25]  Cosimo Anglano,et al.  A Network Genetic Algorithm for Concept Learning , 1997, ICGA.