Mining gene network by combined association rules and genetic algorithm

As gene expression datasets become larger and larger, mining the interaction between genes from large scale gene expression data will be rendered difficult extremely. In this paper, we proposed to combine association rules with genetic algorithm (GA) to mine gene networks from global gene expression data. Firstly, the association rules algorithm is improved from the limitation of the frequent itemsets, the compression of the transaction database and the stored forms of the records in the transaction database for decreasing the computational complexity of this algorithm: Then, the mined rules are further optimized by the genetic algorithm. The efficient combination between two algorithms is realized through the reasonable design of Coding/Decoding, fitness function and genetic manipulation in GA. An optimizing selection operator is introduced in order to enhance the searching efficiency of GA. Finally, the computational tests have showed that the method combined by association rule and genetic algorithm can mine some important interactions between genes from global gene expression datasets.

[1]  Gregory F. Cooper,et al.  Discovery of Causal Relationships in a Gene-Regulation Pathway from a Mixture of Experimental and Observational DNA Microarray Data , 2001, Pacific Symposium on Biocomputing.

[2]  Chad Creighton,et al.  Mining gene expression databases for association rules , 2003, Bioinform..

[3]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[4]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[5]  Patrik D'haeseleer,et al.  Genetic network inference: from co-expression clustering to reverse engineering , 2000, Bioinform..

[6]  Mingyi Wang,et al.  A hybrid Bayesian network learning method for constructing gene networks , 2007, Comput. Biol. Chem..

[7]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[8]  Carsten Peterson,et al.  Random Boolean network models and the yeast transcriptional network , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[9]  R. Somogyi,et al.  The gene expression matrix: towards the extraction of genetic network architectures , 1997 .

[10]  Xinkun Wang,et al.  An effective structure learning method for constructing gene networks , 2006, Bioinform..

[11]  Madhukar S. Dasika,et al.  A Mixed Integer Linear Programming (MILP) Framework for Inferring Time Delay in Gene Regulatory Networks , 2004, Pacific Symposium on Biocomputing.

[12]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[13]  Hsinchun Chen,et al.  Large-scale regulatory network analysis from microarray data: modified Bayesian network learning and association rule mining , 2007, Decis. Support Syst..

[14]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[15]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[16]  Michael Ruogu Zhang,et al.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. , 1998, Molecular biology of the cell.