Active module identification in biological networks

This thesis addresses the problem of active module identification in biological networks. Active module identification is a research topic in network biology that aims to identify regions in network showing striking changes in activity. It is often associated with a given cellular response and expected to reveal dynamic and process-specific information. The key research questions for this thesis are the practical formulations of active module identification problem,the design of effective, efficient and robust algorithms to identify active modules, and the right way to interpret identified active module. This thesis contributes by proposing three different algorithm frameworks to address the research question from three different aspects. It first explores an integrated approach of combining both gene differential expression and differential correlation, formulates it as a multi-objective problem, and solves it on both simulated data and real world data. Then the thesis investigates a novel approach that brings in prior knowledge of biological process, and balances between pure data-driven search and prior information guidance. Finally, the thesis presents a brand new framework of identifying active module and topological communities simultaneously using evolutionary multitasking, accompanied with a series of task-specific algorithm designs and improvements, and provides a new way of integrating topological information to help the interpretation of active module.

[1]  J. Hopfield,et al.  From molecular to modular cell biology , 1999, Nature.

[2]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[3]  Hua Xu,et al.  Evolutionary multitasking in permutation-based combinatorial optimization problems: Realization with TSP, QAP, LOP, and JSP , 2016, 2016 IEEE Region 10 Conference (TENCON).

[4]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[5]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[6]  K. Aihara,et al.  Uncovering signal transduction networks from high-throughput data by integer linear programming , 2008, Nucleic acids research.

[7]  Trey Ideker,et al.  Molecular networks in context , 2015, Nature Biotechnology.

[8]  M. Newman,et al.  Mixing patterns in networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[9]  Rohitash Chandra,et al.  Evolutionary Multi-task Learning for Modular Training of Feedforward Neural Networks , 2016, ICONIP.

[10]  Stan Pounds,et al.  Estimating the Occurrence of False Positives and False Negatives in Microarray Studies by Approximating and Partitioning the Empirical Distribution of P-values , 2003, Bioinform..

[11]  Hongyu Zhao,et al.  COSINE: COndition-SpecIfic sub-NEtwork identification using a global optimization method , 2011, Bioinform..

[12]  Nico P. E. Vermeulen,et al.  Involvement of the Pleiotropic Drug Resistance Response, Protein Kinase C Signaling, and Altered Zinc Homeostasis in Resistance of Saccharomyces cerevisiae to Diclofenac , 2011, Applied and Environmental Microbiology.

[13]  Joshua M. Stuart,et al.  A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules , 2003, Science.

[14]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  E. Blalock,et al.  Hippocampal and Cognitive Aging across the Lifespan: A Bioenergetic Shift Precedes and Increased Cholesterol Trafficking Parallels Memory Impairment , 2009, The Journal of Neuroscience.

[16]  Massimo Marchiori,et al.  Error and attacktolerance of complex network s , 2004 .

[17]  Kay Chen Tan,et al.  Multiobjective Multifactorial Optimization in Evolutionary Multitasking , 2017, IEEE Transactions on Cybernetics.

[18]  Peter J. Fleming,et al.  An Overview of Evolutionary Algorithms in Multiobjective Optimization , 1995, Evolutionary Computation.

[19]  Markus J. Herrgård,et al.  Network-based prediction of human tissue-specific metabolism , 2008, Nature Biotechnology.

[20]  Ivor W. Tsang,et al.  Memetic Search With Interdomain Learning: A Realization Between CVRP and CARP , 2015, IEEE Transactions on Evolutionary Computation.

[21]  David J. Reiss,et al.  Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks , 2006, BMC Bioinformatics.

[22]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[23]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[24]  Bernhard Sendhoff,et al.  Multi co-objective evolutionary optimization: Cross surrogate augmentation for computationally expensive problems , 2012, 2012 IEEE Congress on Evolutionary Computation.

[25]  Benno Schwikowski,et al.  Discovering regulatory and signalling circuits in molecular interaction networks , 2002, ISMB.

[26]  Alain Pétrowski,et al.  A clearing procedure as a niching method for genetic algorithms , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[27]  Raj Rao Nadakuditi,et al.  Graph spectra and the detectability of community structure in networks , 2012, Physical review letters.

[28]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[29]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[30]  Taesung Park,et al.  Identification of differentially expressed subnetworks based on multivariate ANOVA , 2009, BMC Bioinformatics.

[31]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Haifeng Du,et al.  A genetic algorithm with local search strategy for improved detection of community structure , 2010, Complex..

[33]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[34]  Ivor W. Tsang,et al.  Memes as building blocks: a case study on evolutionary optimization + transfer learning for routing problems , 2015, Memetic Comput..

[35]  M. Newman Analysis of weighted networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[36]  Yew-Soon Ong,et al.  Towards a new Praxis in optinformatics targeting knowledge re-use in evolutionary computation: simultaneous problem learning and optimization , 2016, Evolutionary Intelligence.

[37]  Yew-Soon Ong,et al.  Evolutionary multitasking in bi-level optimization , 2015 .

[38]  Tobias Müller,et al.  Bioinformatics Applications Note Systems Biology Bionet: an R-package for the Functional Analysis of Biological Networks , 2022 .

[39]  Qiang Huang,et al.  Cooperative Co-Evolutionary Module Identification With Application to Cancer Disease Module Discovery , 2016, IEEE Transactions on Evolutionary Computation.

[40]  A. Wagner How the global structure of protein interaction networks evolves , 2002, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[41]  Yong Wang,et al.  Community Detection in Social and Biological Networks Using Differential Evolution , 2012, LION.

[42]  S. Horvath,et al.  Statistical Applications in Genetics and Molecular Biology , 2011 .

[43]  Yew-Soon Ong,et al.  Multifactorial Evolution: Toward Evolutionary Multitasking , 2016, IEEE Transactions on Evolutionary Computation.

[44]  Lothar Thiele,et al.  A systematic comparison and evaluation of biclustering methods for gene expression data , 2006, Bioinform..

[45]  Yew-Soon Ong,et al.  Evolutionary Multitasking: A Computer Science View of Cognitive Multitasking , 2016, Cognitive Computation.

[46]  Qiang Huang,et al.  Community Detection Using Cooperative Co-evolutionary Differential Evolution , 2012, PPSN.

[47]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[48]  Arlindo L. Oliveira,et al.  Biclustering algorithms for biological data analysis: a survey , 2004, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[49]  Zexuan Zhu,et al.  Active module identification in intracellular networks using a memetic algorithm with a new binary decoding scheme , 2017, BMC Genomics.

[50]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Xin Yao,et al.  DiME: A Scalable Disease Module Identification Algorithm with Application to Glioma Progression , 2014, PloS one.

[52]  Daniele Muraro,et al.  An integrative analysis of gene expression and molecular interaction data to identify dys-regulated sub-networks in inflammatory bowel disease , 2016, BMC Bioinformatics.

[53]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Sharon L. Milgram,et al.  The Small World Problem , 1967 .

[55]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[56]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[57]  Steven Skiena,et al.  The Algorithm Design Manual , 2020, Texts in Computer Science.

[58]  Ronghua Shang,et al.  Community detection based on modularity and an improved genetic algorithm , 2013 .

[59]  Bor-Sen Chen,et al.  Integrated cellular network of transcription regulations and protein-protein interactions , 2010, BMC Systems Biology.

[60]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[61]  T. Ideker,et al.  Integrative approaches for finding modular structure in biological networks , 2013, Nature Reviews Genetics.

[62]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[63]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[64]  Yu Xia,et al.  Condition specific subnetwork identification using an optimization model , 2008 .

[65]  M. Newman Communities, modules and large-scale structure in networks , 2011, Nature Physics.

[66]  D. Allison,et al.  Microarray data analysis: from disarray to consolidation and consensus , 2006, Nature Reviews Genetics.

[67]  Piet Van Mieghem,et al.  On the Robustness of Complex Networks by Using the Algebraic Connectivity , 2008, Networking.

[68]  David Lusseau,et al.  The emergent properties of a dolphin social network , 2003, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[69]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[70]  Christina Backes,et al.  An integer linear programming approach for finding deregulated subgraphs in regulatory networks , 2011, Nucleic acids research.

[71]  Eli Upfal,et al.  Algorithms for Detecting Significantly Mutated Pathways in Cancer , 2010, RECOMB.

[72]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[73]  Yew-Soon Ong,et al.  Memetic Computation—Past, Present & Future [Research Frontier] , 2010, IEEE Computational Intelligence Magazine.

[74]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[75]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[76]  Jing Zhu,et al.  Edge-based scoring and searching method for identifying condition-responsive protein-protein interaction sub-network , 2007, Bioinform..

[77]  Martin Klammer,et al.  Identifying differentially regulated subnetworks from phosphoproteomic data , 2010, BMC Bioinformatics.

[78]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.