Mining quantitative association rules based on evolutionary computation and its application to atmospheric pollution

This research presents the mining of quantitative association rules based on evolutionary computation techniques. First, a real-coded genetic algorithm that extends the well-known binary-coded CHC algorithm has been projected to determine the intervals that define the rules without needing to discretize the attributes. The proposed algorithm is evaluated in synthetic datasets under different levels of noise in order to test its performance and the reported results are then compared to that of a multi-objective differential evolution algorithm, recently published. Furthermore, rules from real-world time series such as temperature, humidity, wind speed and direction of the wind, ozone, nitrogen monoxide and sulfur dioxide have been discovered with the objective of finding all existing relations between atmospheric pollution and climatological conditions.

[1]  Euntai Kim,et al.  A genetic feature weighting scheme for pattern recognition , 2007, Integr. Comput. Aided Eng..

[2]  Maybin K. Muyeba,et al.  An algorithm to mine general association rules from tabular data , 2007, Inf. Sci..

[3]  Stefan Kramer,et al.  Analyzing microarray data using quantitative association rules , 2005, ECCB/JBI.

[4]  Reda Alhajj,et al.  Utilizing Genetic Algorithms to Optimize Membership Functions for Fuzzy Weighted Association Rules Mining , 2006, Applied Intelligence.

[5]  Marco Vannucci,et al.  Meaningful discretization of continuous features for association rules mining by means of a SOM , 2004, ESANN.

[6]  Doheon Lee,et al.  Identification of temporal association rules from time-series microarray data sets , 2009, DTMBIO '08.

[7]  Marc Parizeau,et al.  Using Evolutionary Optimization Techniques for Scheduling Water Pipe Renewal Considering a Short Planning Horizon , 2008, Comput. Aided Civ. Infrastructure Eng..

[8]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1971 .

[9]  Hojjat Adeli,et al.  DISCRETE COST OPTIMIZATION OF COMPOSITE FLOORS USING A FLOATING-POINT GENETIC ALGORITHM , 2001 .

[10]  Francisco Herrera,et al.  Learning the membership function contexts for mining fuzzy association rules by using genetic algorithms , 2009, Fuzzy Sets Syst..

[11]  Tao-ming Cheng,et al.  Integrating Messy Genetic Algorithms and Simulation to Optimize Resource Utilization , 2009, Comput. Aided Civ. Infrastructure Eng..

[12]  Nitin Gupta,et al.  Mining Quantitative Association Rules in Protein Sequences , 2006, Selected Papers from AusDM.

[13]  Chengqi Zhang,et al.  Genetic algorithm-based strategy for identifying association rules without specifying actual minimum support , 2009, Expert Syst. Appl..

[14]  Stefan Kramer,et al.  Quantitative association rules based on half-spaces: an optimization approach , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[15]  Kamal C. Sarma,et al.  FUZZY GENETIC ALGORITHM FOR OPTIMIZATION OF STEEL STRUCTURES , 2000 .

[16]  Baoping Yan,et al.  Mining Quantitative Association Rules on Overlapped Intervals , 2005, ADMA.

[17]  Terje Haukaas,et al.  Reliability‐Based Optimal Design of Electrical Transmission Towers Using Multi‐Objective Genetic Algorithms , 2007, Comput. Aided Civ. Infrastructure Eng..

[18]  Gilles Venturini,et al.  SIA: A Supervised Inductive Algorithm with Genetic Search for Learning Attributes based Concepts , 1993, ECML.

[19]  Shijin Li,et al.  Discovery Association Rules in Time Series of Hydrology , 2007, 2007 IEEE International Conference on Integration Technology.

[20]  Ester Bernadó-Mansilla,et al.  First approach toward on-line evolution of association rules with learning classifier systems , 2008, GECCO '08.

[21]  Larry J. Eshelman The CHC Adaptive Search Algo-rithm , 1991 .

[22]  Reda Alhajj,et al.  Genetic algorithm based framework for mining fuzzy association rules , 2005, Fuzzy Sets Syst..

[23]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[24]  Eleni I. Vlahogianni,et al.  Spatio‐Temporal Short‐Term Urban Traffic Volume Forecasting Using Genetically Optimized Modular Networks , 2007, Comput. Aided Civ. Infrastructure Eng..

[25]  Erhan Akin,et al.  Rough particle swarm optimization and its applications in data mining , 2008, Soft Comput..

[26]  Oscar Cordón,et al.  Feature-based image registration by means of the CHC evolutionary algorithm , 2006, Image Vis. Comput..

[27]  Yehuda Lindell,et al.  A Statistical Theory for Quantitative Association Rules , 1999, KDD '99.

[28]  S. Sahu,et al.  Improved space–time forecasting of next day ozone concentrations in the eastern US , 2009 .

[29]  Bilal Alatas,et al.  MODENAR: Multi-objective differential evolution algorithm for mining numeric association rules , 2008, Appl. Soft Comput..

[30]  Pavel Brazdil,et al.  Proceedings of the European Conference on Machine Learning , 1993 .

[31]  María José del Jesús,et al.  Evolutionary Fuzzy Rule Induction Process for Subgroup Discovery: A Case Study in Marketing , 2007, IEEE Transactions on Fuzzy Systems.

[32]  D. Godbold,et al.  Effects of acid rain on forest processes , 1994 .

[33]  Jesús S. Aguilar-Ruiz,et al.  Natural Encoding for Evolutionary Supervised Learning , 2007, IEEE Transactions on Evolutionary Computation.

[34]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[35]  Sancho Salcedo-Sanz,et al.  An evolutionary multiclass algorithm for automatic classification of high range resolution radar targets , 2009, Integr. Comput. Aided Eng..