mDBN: motif based learning of gene regulatory networks using dynamic bayesian networks

Solutions for deriving the most consistent Bayesian gene regulatory network model from given data sets using evolutionary algorithms typically only result in locally optimal solutions. Further, due to genetic drift, merely increasing the size of the population does not overcome this limitation. In this paper, we propose a two-stage genetic algorithm that systematically searches the whole search space using frequent subgraph mining techniques. The approach finds representative patterns present in different local optimal solutions in the first stage and then combines these frequent subgraphs (motifs) in the second stage to converge to the global optima. We apply the algorithm to both synthetic and real life networks of yeast and E.coli and show the effectiveness of our approach.

[1]  David Maxwell Chickering,et al.  Learning Bayesian Networks is , 1994 .

[2]  Dirk Husmeier,et al.  Sensitivity and specificity of inferring genetic regulatory interactions from microarray experiments with dynamic Bayesian networks , 2003, Bioinform..

[3]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[4]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[5]  Arthur C. Sanderson,et al.  Multimodal Function Optimization Using Minimal Representation Size Clustering and Its Application to Planning Multipaths , 1997, Evolutionary Computation.

[6]  David Maxwell Chickering,et al.  Finding Optimal Bayesian Networks , 2002, UAI.

[7]  Shuhei Kimura,et al.  Inference of S-system models of genetic networks using a cooperative coevolutionary algorithm , 2005, Bioinform..

[8]  Xuan Vinh Nguyen,et al.  GlobalMIT: learning globally optimal dynamic bayesian network with the mutual information test criterion , 2011, Bioinform..

[9]  Jiawei Han,et al.  Mining coherent dense subgraphs across massive biological networks for functional discovery , 2005, ISMB.

[10]  Madhu Chetty,et al.  Reconstructing genetic networks with concurrent representation of instantaneous and time-delayed interactions , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[11]  Madhu Chetty,et al.  Combining Instantaneous and Time-Delayed Interactions between Genes - A Two Phase Algorithm Based on Information Theory , 2011, Australasian Conference on Artificial Intelligence.

[12]  Hitoshi Iba,et al.  Reverse engineering gene regulatory network from microarray data using linear time-variant model , 2010, BMC Bioinformatics.

[13]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[14]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[15]  Fabian Mörchen,et al.  Optimizing time series discretization for knowledge discovery , 2005, KDD '05.

[16]  Ramesh Ram,et al.  A Markov-Blanket-Based Model for Gene Regulatory Network Inference , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Marcel J. T. Reinders,et al.  Studying the Conditions for Learning Dynamic Bayesian Networks to Discover Genetic Regulatory Networks , 2003, Simul..

[18]  Xuan Vinh Nguyen,et al.  Simultaneous learning of instantaneous and time-delayed genetic interactions using novel information theoretic scoring technique , 2011, BMC Systems Biology.

[19]  Guoliang Xue,et al.  Applying two-level simulated annealing on Bayesian structure learning to infer genetic networks , 2004 .

[20]  Paul P. Wang,et al.  Advances to Bayesian network inference for generating causal networks from observational biological data , 2004, Bioinform..

[21]  Madhu Chetty,et al.  Clustered Memetic Algorithm With Local Heuristics for Ab Initio Protein Structure Prediction , 2013, IEEE Transactions on Evolutionary Computation.

[22]  Luis M. de Campos,et al.  A Scoring Function for Learning Bayesian Networks based on Mutual Information and Conditional Independence Tests , 2006, J. Mach. Learn. Res..

[23]  Lyle H. Ungar,et al.  Using prior knowledge to improve genetic network reconstruction from microarray data , 2004, Silico Biol..

[24]  Bartek Wilczynski,et al.  BNFinder: exact and efficient method for learning Bayesian networks , 2008, Bioinform..

[25]  A. C. Martínez-Estudillo,et al.  Hybridization of evolutionary algorithms and local search by means of a clustering method , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[26]  H. Iba,et al.  Inferring Gene Regulatory Networks using Differential Evolution with Local Search Heuristics , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[27]  David E. Goldberg,et al.  Optimizing Global-Local Search Hybrids , 1999, GECCO.

[28]  P. Brazhnik,et al.  Gene networks: how to put the function in genomics. , 2002, Trends in biotechnology.