University of Birmingham Cooperative co-evolutionary module identification with application to cancer disease module discovery

—Module identification or community detection in complex networks has become increasingly important in many scientific fields because it provides insight into the relationship and interaction between network function and topology. In recent years, module identification algorithms based on stochastic optimization algorithms such as Evolutionary Algorithms have been demonstrated to be superior to other algorithms on small to medium scale networks. However, the scalability and resolution limit problems of these module identification algorithms have not been fully addressed, which impeded their application to real-world networks. This paper proposes a novel module identi-fication algorithm called Cooperative Co-evolutionary Module Identification to address these two problems. The proposed algorithm employs a cooperative co-evolutionary framework to handle large scale networks. We also incorporate a recursive partitioning scheme into the algorithm to effectively address the resolution limit problem. The performance of our algorithm is evaluated on twelve benchmark complex networks. As a medical application, we apply our algorithm to identify disease modules that differentiate low and high grade glioma tumours to gain insights into the molecular mechanisms that underpin the progression of glioma. Experimental results show that the proposed algorithm has a very competitive performance compared with other state-of-the-art module identification algorithms.

[1]  Iñaki Inza,et al.  Dealing with the evaluation of supervised classification algorithms , 2015, Artificial Intelligence Review.

[2]  Qingfu Zhang,et al.  An Evolutionary Many-Objective Optimization Algorithm Based on Dominance and Decomposition , 2015, IEEE Transactions on Evolutionary Computation.

[3]  Xiaohua Hu,et al.  Dynamic identifying protein functional modules based on adaptive density modularity in protein-protein interaction networks , 2015, BMC Bioinformatics.

[4]  Chia-Hsuan Yeh,et al.  Social Networks and Asset Price Dynamics , 2015, IEEE Transactions on Evolutionary Computation.

[5]  Yong Wang,et al.  Locating Multiple Optimal Solutions of Nonlinear Equation Systems Based on Multiobjective Optimization , 2015, IEEE Transactions on Evolutionary Computation.

[6]  Mark Hoogendoorn,et al.  Parameter Control in Evolutionary Algorithms: Trends and Challenges , 2015, IEEE Transactions on Evolutionary Computation.

[7]  Marissa Friedman,et al.  Glioblastoma: Molecular Pathways, Stem Cells and Therapeutic Targets , 2015, Cancers.

[8]  M. Tadesse,et al.  Pathway and Network Approaches for Identification of Cancer Signature Markers from Omics Data , 2015, Journal of Cancer.

[9]  Erwan Le Martelot,et al.  Fast multi-scale detection of overlapping communities using local criteria , 2014, Computing.

[10]  Tapabrata Ray,et al.  Differential Evolution With Dynamic Parameters Selection for Optimization Problems , 2014, IEEE Transactions on Evolutionary Computation.

[11]  Yuren Zhou,et al.  Performance Analysis of Evolutionary Algorithms for the Minimum Label Spanning Tree Problem , 2014, IEEE Transactions on Evolutionary Computation.

[12]  Andrew M. Tyrrell,et al.  Evolving Classifiers to Recognize the Movement Characteristics of Parkinson's Disease Patients , 2014, IEEE Transactions on Evolutionary Computation.

[13]  Kalyanmoy Deb,et al.  An Evolutionary Many-Objective Optimization Algorithm Using Reference-Point-Based Nondominated Sorting Approach, Part I: Solving Problems With Box Constraints , 2014, IEEE Transactions on Evolutionary Computation.

[14]  Xiaodong Li,et al.  Cooperative Co-Evolution With Differential Grouping for Large Scale Optimization , 2014, IEEE Transactions on Evolutionary Computation.

[15]  Xiaodong Li,et al.  Cooperative Coevolution With Route Distance Grouping for Large-Scale Capacitated Arc Routing Problems , 2014, IEEE Transactions on Evolutionary Computation.

[16]  Alexander Bailey,et al.  Genetic Programming for the Automatic Inference of Graph Models for Complex Networks , 2014, IEEE Transactions on Evolutionary Computation.

[17]  Maoguo Gong,et al.  Complex Network Clustering by Multiobjective Discrete Particle Swarm Optimization Based on Decomposition , 2014, IEEE Transactions on Evolutionary Computation.

[18]  Yanhui Hu,et al.  Integrating protein-protein interaction networks with phenotypes reveals signs of interactions , 2013, Nature Methods.

[19]  Judith A. Blake,et al.  Ten Quick Tips for Using the Gene Ontology , 2013, PLoS Comput. Biol..

[20]  H. Aburatani,et al.  The critical role of cyclin D2 in cell cycle progression and tumorigenicity of glioblastoma stem cells , 2013, Oncogene.

[21]  Shan He,et al.  Disease module identification from an integrated transcriptomic and interactomic network using evolutionary community extraction , 2013 .

[22]  Yong Wang,et al.  An improved (μ + λ)-constrained differential evolution for constrained optimization , 2013, Inf. Sci..

[23]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[24]  Qiang Huang,et al.  Community Detection Using Cooperative Co-evolutionary Differential Evolution , 2012, PPSN.

[25]  Anthony Brabazon,et al.  Comparing methods for module identification in grammatical evolution , 2012, GECCO '12.

[26]  Clara Pizzuti,et al.  A Multiobjective Genetic Algorithm to Find Communities in Complex Networks , 2012, IEEE Transactions on Evolutionary Computation.

[27]  Yong Wang,et al.  Combining Multiobjective Optimization With Differential Evolution to Solve Constrained Optimization Problems , 2012, IEEE Transactions on Evolutionary Computation.

[28]  Yong Wang,et al.  Community Detection in Social and Biological Networks Using Differential Evolution , 2012, LION.

[29]  Weidong Tian,et al.  An iterative network partition algorithm for accurate identification of dense network modules , 2011, Nucleic acids research.

[30]  Maoguo Gong,et al.  Memetic algorithm for community detection in networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  Andreas Noack,et al.  Multilevel local search algorithms for modularity clustering , 2011, JEAL.

[32]  Lazaros G. Papageorgiou,et al.  Module detection in complex networks using integer optimisation , 2010, Algorithms for Molecular Biology.

[33]  Efrén Mezura-Montes,et al.  Differential evolution in constrained numerical optimization: An empirical study , 2010, Inf. Sci..

[34]  S. Burma,et al.  Epidermal growth factor receptor in glioma: signal transduction, neuropathology, imaging, and radioresistance. , 2010, Neoplasia.

[35]  Yonghong Xiao,et al.  Pattern of retinoblastoma pathway inactivation dictates response to CDK4/6 inhibition in GBM , 2010, Proceedings of the National Academy of Sciences.

[36]  Saraswati Sukumar,et al.  The Hox genes and their roles in oncogenesis , 2010, Nature Reviews Cancer.

[37]  Ville Tirronen,et al.  Recent advances in differential evolution: a survey and experimental analysis , 2010, Artificial Intelligence Review.

[38]  Francisco Herrera,et al.  A study on the use of non-parametric tests for analyzing the evolutionary algorithms’ behaviour: a case study on the CEC’2005 Special Session on Real Parameter Optimization , 2009, J. Heuristics.

[39]  T. Murata,et al.  Advanced modularity-specialized label propagation algorithm for detecting communities in networks , 2009, 0910.1154.

[40]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[41]  Juergen Thiele,et al.  The 2008 World Health Organization classification system for myeloproliferative neoplasms , 2009, Cancer.

[42]  Kevin E. Bassler,et al.  Improved community structure detection using a modified fine-tuning strategy , 2009, ArXiv.

[43]  Xin Yao,et al.  Large scale evolutionary optimization using cooperative coevolution , 2008, Inf. Sci..

[44]  Robert Clarke,et al.  Gene Module Identification from Microarray Data Using Nonnegative Independent Component Analysis , 2008, Gene regulation and systems biology.

[45]  Amedeo Caflisch,et al.  Efficient modularity optimization by multistep greedy algorithm and vertex mover refinement. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[46]  Weixiong Zhang,et al.  Identifying network communities with a high resolution. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Aidong Zhang,et al.  A “Seed-Refine” Algorithm for Detecting Protein Complexes From Protein Interaction Data , 2007, IEEE Transactions on NanoBioscience.

[48]  Bin Liu,et al.  Michigan Molecular Interactions (MiMI): putting the jigsaw puzzle together , 2006, Nucleic Acids Res..

[49]  Robert A Lustig,et al.  Multicentric Glioblastoma Multiforme in a Patient with BRCA‐1 Invasive Breast Cancer , 2006, The breast journal.

[50]  Javier Béjar,et al.  Clustering algorithm for determining community structure in large networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[51]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[52]  P. Bork,et al.  Proteome survey reveals modularity of the yeast cell machinery , 2006, Nature.

[53]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Mike Tyers,et al.  BioGRID: a general repository for interaction datasets , 2005, Nucleic Acids Res..

[55]  P. Kleihues,et al.  Population-based studies on incidence, survival rates, and genetic alterations in astrocytic and oligodendroglial gliomas. , 2005, Journal of neuropathology and experimental neurology.

[56]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[57]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[58]  M. Newman,et al.  Identifying the role that animals play in their social networks , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[59]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[60]  Hanno Steen,et al.  Development of human protein reference database as an initial platform for approaching systems biology in humans. , 2003, Genome research.

[61]  M. Newman Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[62]  M. Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[63]  A. Arenas,et al.  Self-similar community structure in a network of human interactions. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[64]  A. Arenas,et al.  Macro- and micro-structure of trust networks , 2002, cond-mat/0206240.

[65]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[66]  M E Newman,et al.  Scientific collaboration networks. I. Network construction and fundamental results. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[67]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[68]  Alain Hertz,et al.  A framework for the description of evolutionary algorithms , 2000, Eur. J. Oper. Res..

[69]  Vladimir Batagelj,et al.  Some analyses of Erdős collaboration graph , 2000, Soc. Networks.

[70]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[71]  N. Smorodinsky,et al.  The breast cancer-associated MUC1 gene generates both a receptor and its cognate binding protein. , 1999, Cancer research.

[72]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[73]  John Scott Social Network Analysis , 1988 .

[74]  Brian W. Kernighan,et al.  An efficient heuristic procedure for partitioning graphs , 1970, Bell Syst. Tech. J..

[75]  Simon Lehnerer,et al.  Community Detection in Complex Networks using Genetic Algorithms , 2018, SKILL.

[76]  X. Yao,et al.  University of Birmingham DiME : a scalable disease module identification algorithm with application to glioma progression , 2014 .

[77]  Yijie Wang,et al.  Functional module identification in protein interaction networks by interaction patterns , 2014, Bioinform..

[78]  M. Lones,et al.  Evolving Classifiers to Recognise the Movement Characteristics of Parkinson’s Disease Patients , 2013 .

[79]  N. Gulbahce,et al.  Network medicine: a network-based approach to human disease , 2010, Nature Reviews Genetics.

[80]  BMC Bioinformatics BioMed Central Methodology article A new measure for functional similarity of gene products based on Gene Ontology , 2006 .

[81]  R. Storn,et al.  Differential Evolution - A simple and efficient adaptive scheme for global optimization over continuous spaces , 2004 .

[82]  Gary D Bader,et al.  BMC Bioinformatics Methodology article Statistical significance for hierarchical clustering in genetic association and microarray expression studies , 2003 .

[83]  Zfe T Sn,et al.  Differential Evolution-A simple and efficient adaptive scheme for global optimization over continuous spaces , 1996 .