GreenSim: A Network Simulator for Comprehensively Validating and Evaluating New Machine Learning Techniques for Network Structural Inference

Networks are very important in many fields of machine learning research. Within networks research, inferring the structure of unknown networks is often a key problem; e.g. of genetic regulatory networks. However, there are very few well-known biological networks, and good simulation is essential for validating and evaluating novel structural inference techniques. Further, the importance of large, genome-wide structural inference is increasingly recognised, but there does not appear to be a good simulator available for large networks. This paper presents GreenSim, a simulator that helps address this gap. GreenSim automatically generates large, genome-size networks with more biologically realistic structural characteristics and 2nd-order non-linear regulatory functions. The simulator itself and the novel method used for generating a network structure with appropriate in- and out-degree distributions may also generalise easily to other types of network. GreenSim is available online at: http://syntilect.com/cgf/pubs:software

[1]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[2]  V. Anne Smith,et al.  Using Bayesian Network Inference Algorithms to Recover Molecular Genetic Regulatory Networks , 2002 .

[3]  Alvis Brazma,et al.  Modelling gene networks at different organisational levels , 2005, FEBS letters.

[4]  Ilya Shmulevich,et al.  On Learning Gene Regulatory Networks Under the Boolean Network Model , 2003, Machine Learning.

[5]  Matti Nykter,et al.  Simulation of microarray data with realistic characteristics , 2006, BMC Bioinformatics.

[6]  Christopher R. Myers,et al.  Universally Sloppy Parameter Sensitivities in Systems Biology Models , 2007, PLoS Comput. Biol..

[7]  Dario Floreano,et al.  Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods , 2009, J. Comput. Biol..

[8]  Michael A. Beer,et al.  Whole-genome discovery of transcription factor binding sites by network-level conservation. , 2003, Genome research.

[9]  D. Lazer,et al.  Inferring Social Network Structure using Mobile Phone Data , 2006 .

[10]  J. Collins,et al.  Size matters: network inference tackles the genome scale , 2007, Molecular systems biology.

[11]  L Glass,et al.  Counting and classifying attractors in high dimensional dynamical systems. , 1996, Journal of theoretical biology.

[12]  Paul P. Wang,et al.  Advances to Bayesian network inference for generating causal networks from observational biological data , 2004, Bioinform..

[13]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[14]  Richard Bonneau,et al.  The inferelator 2.0: A scalable framework for reconstruction of dynamic regulatory network models , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[15]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[16]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[17]  Vasile Palade,et al.  Evaluating Clustering Algorithms for Genetic Regulatory Network Structural Inference , 2009, SGAI Conf..

[18]  Yoganand Balagurunathan,et al.  Noise factor analysis for cDNA microarrays. , 2004, Journal of biomedical optics.

[19]  Hsinchun Chen,et al.  Large-scale regulatory network analysis from microarray data: modified Bayesian network learning and association rule mining , 2007, Decis. Support Syst..

[20]  David Lazer,et al.  Inferring friendship network structure by using mobile phone data , 2009, Proceedings of the National Academy of Sciences.

[21]  S. Goodreau,et al.  The Role of Commercial Sex Venues in the HIV Epidemic Among Men Who Have Sex With Men , 2010, Epidemiology.

[22]  Vasile Palade,et al.  Machine Learning and Genetic Regulatory Networks: A Review and a Roadmap , 2009, Foundations of Computational Intelligence.

[23]  Alvis Brazma,et al.  Current approaches to gene regulatory network modelling , 2007, BMC Bioinformatics.

[24]  Tao Zhou,et al.  Evolving model of weighted networks inspired by scientific collaboration networks , 2005 .