Identification of Minimum Set of Master Regulatory Genes in Gene Regulatory Networks

Identification of master regulatory genes is one of the primary challenges in systems biology. The minimum dominating set problem is a powerful paradigm in analyzing such complex networks. In these models, genes stand as nodes and their interactions are assumed as edges. Here, members of a minimal dominating set could be regarded as master genes. As finitely many minimum dominating sets may exist in a network, it is difficult to identify which one represents the most appropriate set of master genes. In this paper, we develop a weighted gene regulatory network problem with two objectives as a version of the dominating set problem. Collective influence of each gene is considered as its weight. The first objective aims to find a master regulatory genes set with minimum cardinality, and the second objective identifies the one with maximum weight. The model is converted to a single objective using a parameter varying between zero and one. The model is implemented on three human networks, and the results are reported and compared with the existing model of weighted network. Parametric programming in linear optimization and logistic regression are also implemented on the arisen relaxed problem to provide a deeper understanding of the results. Learned from computational results in parametric analysis, for some ranges of priorities in objectives, the identified master regulatory genes are invariant, while some of them are identified for all priorities. This would be an indication that such genes have higher degree of being master regulatory ones, specially on the noisy networks.

[1]  Hong Yan,et al.  Comparative analysis of housekeeping and tissue-specific driver nodes in human protein interaction networks , 2016, BMC Bioinformatics.

[2]  Colin Cooper,et al.  Lower Bounds and Algorithms for Dominating Sets in Web Graphs , 2005, Internet Math..

[3]  A. Ghaffari-Hadigheh,et al.  Semidefinite Relaxation for the Dominating Set Problem , 2021 .

[4]  Hamid Bolouri,et al.  A method for estimating stochastic noise in large genetic regulatory networks , 2005, Bioinform..

[5]  A. Land,et al.  An Automatic Method for Solving Discrete Programming Problems , 1960, 50 Years of Integer Programming.

[6]  Tatsuya Akutsu,et al.  Structural controllability of unidirectional bipartite networks , 2013, Scientific Reports.

[7]  Peter J. Slater,et al.  Fundamentals of domination in graphs , 1998, Pure and applied mathematics.

[8]  C. Hwang Multiple Objective Decision Making - Methods and Applications: A State-of-the-Art Survey , 1979 .

[9]  Tianhai Tian,et al.  A continuous optimization approach for inferring parameters in mathematical models of regulatory networks , 2014, BMC Bioinformatics.

[10]  Dr. Susumu Ohno Major Sex-Determining Genes , 1979, Monographs on Endocrinology.

[11]  Roberto Cordone,et al.  An integer optimization approach for reverse engineering of gene regulatory networks , 2013, Discret. Appl. Math..

[12]  H. Kitano A robustness-based approach to systems-oriented drug design , 2007, Nature Reviews Drug Discovery.

[13]  Volkhard Helms,et al.  Identification of key player genes in gene regulatory networks , 2016, BMC Systems Biology.

[14]  Abdel-Rahman Hedar,et al.  Hybrid Genetic Algorithm for Minimum Dominating Set Problem , 2010, ICCSA.

[15]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[16]  Luonan Chen,et al.  Part mutual information for quantifying direct associations in networks , 2016, Proceedings of the National Academy of Sciences.

[17]  Margaret B. Cozzens,et al.  Dominating sets in social network graphs , 1988 .

[18]  Volkhard Helms,et al.  Integrative network-based approach identifies key genetic elements in breast invasive carcinoma , 2015, BMC Genomics.

[19]  Peter J. Fleming,et al.  An Overview of Evolutionary Algorithms in Multiobjective Optimization , 1995, Evolutionary Computation.

[20]  C. L. Liu,et al.  Introduction to Combinatorial Mathematics. , 1971 .

[21]  R. S. Laundy,et al.  Multiple Criteria Optimisation: Theory, Computation and Application , 1989 .

[22]  K. B. Williams,et al.  Management Models and Industrial Applications of Linear Programming , 1962 .

[23]  T. Akutsu,et al.  Minimum dominating set-based methods for analyzing biological networks. , 2016, Methods.

[24]  睦憲 柳浦,et al.  Combinatorial Optimization : Theory and Algorithms (3rd Edition), B. Korte and J. Vygen 著, 出版社 Springer, 発行 2006年, 全ページ 597頁, 価格 53.45ユーロ, ISBN 3-540-25684-9 , 2006 .

[25]  L. Lasdon,et al.  On a bicriterion formation of the problems of integrated system identification and system optimization , 1971 .

[26]  Kaisa Miettinen,et al.  Nonlinear multiobjective optimization , 1998, International series in operations research and management science.

[27]  Hitoshi Iba,et al.  Reconstruction of Gene Regulatory Networks from Gene Expression Data Using Decoupled Recurrent Neural Network Model , 2013 .

[28]  Yaochu Jin,et al.  Reconstructing biological gene regulatory networks: where optimization meets big data , 2014, Evol. Intell..

[29]  Joseph Douglas Horton,et al.  On Covering Sets and Error-Correcting Codes , 1971, J. Comb. Theory, Ser. A.

[30]  Edward R. Dougherty,et al.  Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks , 2002, Bioinform..

[31]  Martin Romauch,et al.  A mathematical program to refine gene regulatory networks , 2009, Discret. Appl. Math..

[32]  Xiao-Fei Zhang,et al.  Determining minimum set of driver nodes in protein-protein interaction networks , 2015, BMC Bioinformatics.

[33]  Fei Liu,et al.  Inference of Gene Regulatory Network Based on Local Bayesian Networks , 2016, PLoS Comput. Biol..

[34]  Michal Linial,et al.  Using Bayesian Networks to Analyze Expression Data , 2000, J. Comput. Biol..

[35]  Tatsuya Akutsu,et al.  Analysis of critical and redundant nodes in controlling directed and undirected complex networks using dominating sets , 2014, J. Complex Networks.

[36]  Hernán A. Makse,et al.  Influence maximization in complex networks through optimal percolation , 2015, Nature.

[37]  Albert-László Barabási,et al.  Destruction perfected , 2015, Nature.

[38]  W. Wong,et al.  Modeling gene regulation from paired expression and chromatin accessibility data , 2017, Proceedings of the National Academy of Sciences.

[39]  Ali Reza Khanteymoori,et al.  A robust gene regulatory network inference method base on Kalman filter and linear regression , 2018, PloS one.

[40]  Zheng Li,et al.  Large-scale dynamic gene regulatory network inference combining differential equation models with local dynamic Bayesian network analysis , 2011, Bioinform..

[41]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[42]  E. Polak,et al.  On Multicriteria Optimization , 1976 .

[43]  A. Bonato,et al.  Graphs and Hypergraphs , 2022 .

[44]  Jens Vygen,et al.  The Book Review Column1 , 2020, SIGACT News.

[45]  Wei-Po Lee,et al.  A clustering-based approach for inferring recurrent neural networks as gene regulatory networks , 2008, Neurocomputing.

[46]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[47]  Ioannis P. Androulakis,et al.  Informative gene selection and design of regulatory networks using integer optimization , 2008, Comput. Chem. Eng..

[48]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[49]  Ting Chen,et al.  Modeling Gene Expression with Differential Equations , 1998, Pacific Symposium on Biocomputing.

[50]  Jie Wu,et al.  On constructing k-connected k-dominating set in wireless ad hoc and sensor networks , 2006, J. Parallel Distributed Comput..

[51]  Tatsuya Akutsu,et al.  Dominating scale-free networks with variable scaling exponent: heterogeneous networks are not difficult to control , 2012 .

[52]  Noga Alon,et al.  Transversal numbers of uniform hypergraphs , 1990, Graphs Comb..

[53]  Xinkun Wang,et al.  An effective structure learning method for constructing gene networks , 2006, Bioinform..

[54]  M. Ehrgott,et al.  Primal-Dual Simplex Method for Multiobjective Linear Programming , 2007 .

[55]  Haiyuan Yu,et al.  HINT: High-quality protein interactomes and their applications in understanding human disease , 2012, BMC Systems Biology.

[56]  Abraham Charnes,et al.  Optimal Estimation of Executive Compensation by Linear Programming , 1955 .

[57]  D. di Bernardo,et al.  How to infer gene networks from expression profiles , 2007, Molecular systems biology.

[58]  Albert-László Barabási,et al.  Controllability of complex networks , 2011, Nature.