Computational methods for Gene Regulatory Networks reconstruction and analysis: A review

In the recent years, the vast amount of genetic information generated by new-generation approaches, have led to the need of new data handling methods. The integrative analysis of diverse-nature gene information could provide a much-sought overview to study complex biological systems and processes. In this sense, Gene Regulatory Networks (GRN) arise as an increasingly-promising tool for the modelling and analysis of biological processes. This review is an attempt to summarize the state of the art in the field of GRNs. Essential points in the field are addressed, thereof: (a) the type of data used for network generation, (b) machine learning methods and tools used for network generation, (c) model optimization and (d) computational approaches used for network validation. This survey is intended to provide an overview of the subject for readers to improve their knowledge in the field of GRN for future research.

[1]  Sriyankar Acharyya,et al.  Neural model of gene regulatory network: a survey on supportive meta-heuristics , 2016, Theory in Biosciences.

[2]  Albertha J. M. Walhout,et al.  What does biologically meaningful mean? A perspective on gene regulatory network validation , 2011, Genome Biology.

[3]  S. Yang,et al.  Advances in DNA sequencing technologies for high resolution HLA typing. , 2015, Human immunology.

[4]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[5]  Peter J Woolf,et al.  Reconstructing transcriptional regulatory networks using three-way mutual information and Bayesian networks. , 2010, Methods in molecular biology.

[6]  David E. Williams,et al.  Application of a fuzzy neural network model in predicting polycyclic aromatic hydrocarbon-mediated perturbations of the Cyp1b1 transcriptional regulatory network in mouse skin. , 2013, Toxicology and applied pharmacology.

[7]  Vanathi Gopalakrishnan,et al.  Novel Application of Junction Trees to the Interpretation of Epigenetic Differences among Lung Cancer Subtypes , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[8]  Yufeng Liu,et al.  Joint estimation of multiple dependent Gaussian graphical models with applications to mouse genomics , 2016, Biometrika.

[9]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[10]  Ying Li,et al.  RefNetBuilder: a platform for construction of integrated reference gene regulatory networks from expressed sequence tags , 2011, BMC Bioinformatics.

[11]  Arpad Kelemen,et al.  Computational dynamic approaches for temporal omics data with applications to systems medicine , 2017, BioData Mining.

[12]  Abhijeet Pataskar,et al.  Computational challenges in modeling gene regulatory events , 2016, Transcription.

[13]  Tom C Freeman,et al.  Petri Net computational modelling of Langerhans cell Interferon Regulatory Factor Network predicts their role in T cell activation , 2017, Scientific Reports.

[14]  Francesco C Stingo,et al.  miRNA–target gene regulatory networks: A Bayesian integrative approach to biomarker selection with application to kidney cancer , 2015, Biometrics.

[15]  Chris Wiggins,et al.  ARACNE: An Algorithm for the Reconstruction of Gene Regulatory Networks in a Mammalian Cellular Context , 2004, BMC Bioinformatics.

[16]  Pierre Geurts,et al.  On protocols and measures for the validation of supervised methods for the inference of biological networks , 2013, Front. Genet..

[17]  David Page,et al.  Area under the Precision-Recall Curve: Point Estimates and Confidence Intervals , 2013, ECML/PKDD.

[18]  I S Kohane,et al.  Mutual information relevance networks: functional genomic clustering using pairwise entropy measurements. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[19]  Sergi Valverde,et al.  Structural determinants of criticality in biological networks , 2015, Front. Physiol..

[20]  Xiaodong Wang,et al.  Reverse engineering gene regulatory networks from measurement with missing values , 2016, EURASIP J. Bioinform. Syst. Biol..

[21]  Gianluca Bontempi,et al.  On the Impact of Entropy Estimation on Transcriptional Regulatory Network Inference Based on Mutual Information , 2008, EURASIP J. Bioinform. Syst. Biol..

[22]  Kevin Kontos,et al.  Information-Theoretic Inference of Large Transcriptional Regulatory Networks , 2007, EURASIP J. Bioinform. Syst. Biol..

[23]  A. Guiseppi-Elie,et al.  Stable Gene Regulatory Network Modeling From Steady-State Data † , 2016, Bioengineering.

[24]  Philippe Salembier,et al.  NetBenchmark: a bioconductor package for reproducible benchmarks of gene regulatory network inference , 2015, BMC Bioinformatics.

[25]  Tanya Barrett,et al.  The Gene Expression Omnibus Database , 2016, Statistical Genomics.

[26]  Maria Simak,et al.  Exploring candidate biological functions by Boolean Function Networks for Saccharomyces cerevisiae , 2017, PloS one.

[27]  Michael Seifert,et al.  regNet: an R package for network-based propagation of gene expression alterations , 2018, Bioinform..

[28]  Paul Workman,et al.  SiGNet: A signaling network data simulator to enable signaling network inference , 2017, PloS one.

[29]  Haiyan Huang,et al.  Review on statistical methods for gene network reconstruction using expression data. , 2014, Journal of theoretical biology.

[30]  Mingyi Wang,et al.  Inferring large-scale gene regulatory networks using a low-order constraint-based algorithm. , 2010, Molecular bioSystems.

[31]  E. O’Shea,et al.  Living with noisy genes: how cells function reliably with inherent variability in gene expression. , 2007, Annual review of biophysics and biomolecular structure.

[32]  C. Zhou,et al.  Construction of gene/protein interaction networks for primary myelofibrosis and KEGG pathway-enrichment analysis of molecular compounds. , 2015, Genetics and molecular research : GMR.

[33]  A. Verri,et al.  Enhancing Interpretability of Gene Signatures with Prior Biological Knowledge , 2016, Microarrays.

[34]  Fabio Stella,et al.  Gene network inference using continuous time Bayesian networks: a comparative study and application to Th17 cell differentiation , 2014, BMC Bioinformatics.

[35]  Joshua E. S. Socolar,et al.  Global control of cell-cycle transcription by coupled CDK and network oscillators , 2008, Nature.

[36]  Samik Ghosh,et al.  Harnessing Diversity towards the Reconstructing of Large Scale Gene Regulatory Networks , 2013, PLoS Comput. Biol..

[37]  S. Pineda,et al.  Integrative eQTL analysis of tumor and host omics data in individuals with bladder cancer , 2017, Genetic epidemiology.

[38]  Simon Rosenfeld,et al.  Mathematical descriptions of biochemical networks: stability, stochasticity, evolution. , 2011, Progress in biophysics and molecular biology.

[39]  Alfred O. Hero,et al.  Using Directed Information to Build Biologically Relevant Influence Networks , 2007, J. Bioinform. Comput. Biol..

[40]  Jie Tan,et al.  Cross-platform normalization of microarray and RNA-seq data for machine learning applications , 2016, PeerJ.

[41]  Jun Wang,et al.  Boolean analysis reveals systematic interactions among low-abundance species in the human gut microbiome , 2017, PLoS Comput. Biol..

[42]  N. Díaz-Díaz,et al.  Gene Network Biological Validity Based on Gene-Gene Interaction Relevance , 2014, TheScientificWorldJournal.

[43]  G. Pados,et al.  Identification of cancer biomarkers of prognostic value using specific gene regulatory networks (GRN): a novel role of RAD51AP1 for ovarian and lung cancers , 2017, Carcinogenesis.

[44]  G. Altay,et al.  Structural influence of gene networks on their inference: analysis of C3NET. , 2011 .

[45]  Shriprakash Sinha Integration of prior biological knowledge and epigenetic information enhances the prediction accuracy of the Bayesian Wnt pathway. , 2014, Integrative biology : quantitative biosciences from nano to macro.

[46]  A. Masoudi-Nejad,et al.  Computational Analysis of Reciprocal Association of Metabolism and Epigenetics in the Budding Yeast: A Genome-Scale Metabolic Model (GSMM) Approach , 2014, PloS one.

[47]  Juan J. Diaz-Montana,et al.  Development and use of the Cytoscape app GFD-Net for measuring semantic dissimilarity of gene networks , 2014, F1000Research.

[48]  Jessica C. Mar,et al.  Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data , 2018, BMC Bioinformatics.

[49]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..

[50]  Lili Liu,et al.  Comparative study of discretization methods of microarray data for inferring transcriptional regulatory networks , 2010, BMC Bioinformatics.

[51]  Pedro Mayorga,et al.  Foundations for Modeling the Dynamics of gene regulatory Networks: a Multilevel-Perspective Review , 2014, J. Bioinform. Comput. Biol..

[52]  K. Tomczak,et al.  The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge , 2015, Contemporary oncology.

[53]  Andrew E. Teschendorff,et al.  DART: Denoising Algorithm based on Relevance network Topology improves molecular pathway activity inference , 2011, BMC Bioinformatics.

[54]  Fabian J Theis,et al.  Decoding the Regulatory Network for Blood Development from Single-Cell Gene Expression Measurements , 2015, Nature Biotechnology.

[55]  M. Gerstein,et al.  Analyzing protein function on a genomic scale: the importance of gold-standard positives and negatives for network prediction. , 2004, Current opinion in microbiology.

[56]  Jose Luis Vazquez Noguera,et al.  Structure Optimization for Large Gene Networks Based on Greedy Strategy , 2018, Comput. Math. Methods Medicine.

[57]  David Page,et al.  Area under the Precision-Recall Curve: Point Estimates and Confidence Intervals , 2013, ECML/PKDD.

[58]  Sergio Contrino,et al.  ArrayExpress—a public repository for microarray gene expression data at the EBI , 2004, Nucleic Acids Res..

[59]  Mark S. Cohen,et al.  The utility of data-driven feature selection: Re: Chu et al. 2012 , 2014, NeuroImage.

[60]  Reda Alhajj,et al.  Employing decomposable partially observable Markov decision processes to control gene regulatory networks , 2017, Artif. Intell. Medicine.

[61]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[62]  Hisanori Kiryu,et al.  SCODE: an efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation , 2016, bioRxiv.

[63]  Toshihisa Takagi,et al.  The DDBJ Japanese Genotype-phenotype Archive for genetic and phenotypic human data , 2014, Nucleic Acids Res..

[64]  Carlos D. Barranco,et al.  Incorporating biological knowledge for construction of fuzzy networks of gene associations , 2016, Appl. Soft Comput..

[65]  Benjamin Haibe-Kains,et al.  Untangling statistical and biological models to understand network inference: the need for a genomics network ontology , 2014, Front. Genet..

[66]  Philippe Kourilsky,et al.  The natural defense system and the normative self model , 2016, F1000Research.

[67]  Miha Mraz,et al.  Fuzzy Logic as a Computational Tool for Quantitative Modelling of Biological Systems with Uncertain Kinetic Data , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[68]  Giancarlo Mauri,et al.  CABeRNET: a Cytoscape app for augmented Boolean models of gene regulatory NETworks , 2015, BMC Bioinformatics.

[69]  Koray D. Kaya,et al.  NRL-Regulated Transcriptome Dynamics of Developing Rod Photoreceptors. , 2016, Cell reports.

[70]  M. Reinders,et al.  Genetic network modeling. , 2002, Pharmacogenomics.

[71]  Tian Zheng,et al.  Inference of Regulatory Gene Interactions from Expression Data Using Three‐Way Mutual Information , 2009, Annals of the New York Academy of Sciences.

[72]  Martin Hofmann-Apitius,et al.  Text mining for systems biology. , 2014, Drug discovery today.

[73]  Hong Ling,et al.  Novel recurrent neural network for modelling biological networks: Oscillatory p53 interaction dynamics , 2013, Biosyst..

[74]  Rudiyanto Gunawan,et al.  Optimal design of gene knockout experiments for gene regulatory network inference , 2015, Bioinform..

[75]  G. Stein,et al.  Loss of RUNX1 is associated with aggressive lung adenocarcinomas , 2018, Journal of cellular physiology.

[76]  Constantin F. Aliferis,et al.  The max-min hill-climbing Bayesian network structure learning algorithm , 2006, Machine Learning.

[77]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[78]  Ilya Shmulevich,et al.  Structure learning for Bayesian networks as models of biological networks. , 2013, Methods in molecular biology.

[79]  Xiaoping Zhou,et al.  Whole transcriptome analysis with sequencing: methods, challenges and potential solutions , 2015, Cellular and Molecular Life Sciences.

[80]  Rui Chen,et al.  A scalable algorithm for structure identification of complex gene regulatory network from temporal expression data , 2016, bioRxiv.

[81]  Ping Lin,et al.  Evaluation and improvement of the regulatory inference for large co-expression networks with limited sample size , 2017, BMC Systems Biology.

[82]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[83]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[84]  Tatsuya Akutsu,et al.  Identifying a Probabilistic Boolean Threshold Network From Samples , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[85]  Edward R. Dougherty,et al.  Validation of gene regulatory network inference based on controllability , 2013, Front. Genet..

[86]  Eberhard O. Voit,et al.  Computational Analysis of Biochemical Systems: A Practical Guide for Biochemists and Molecular Biologists , 2000 .

[87]  Mark Craven,et al.  A review of active learning approaches to experimental design for uncovering biological networks , 2017, PLoS Comput. Biol..

[88]  Kathleen Marchal,et al.  SynTReN: a generator of synthetic gene expression data for design and analysis of structure learning algorithms , 2006, BMC Bioinformatics.

[89]  Amir Alavi,et al.  The feasibility of genome-scale biological network inference using Graphics Processing Units , 2017, Algorithms for Molecular Biology.

[90]  Duncan Fyfe Gillies,et al.  A Review of Feature Selection and Feature Extraction Methods Applied on Microarray Data , 2015, Adv. Bioinformatics.

[91]  Clair Gallagher,et al.  Towards next generation CHO cell biology: Bioinformatics methods for RNA‐Seq‐based expression profiling , 2015, Biotechnology journal.

[92]  Matthew N McCall,et al.  Estimation of Gene Regulatory Networks. , 2013, Postdoc journal : a journal of postdoctoral research and postdoctoral affairs.

[93]  Hector Zenil,et al.  Evaluating Network Inference Methods in Terms of Their Ability to Preserve the Topology and Complexity of Genetic Networks , 2015, Seminars in cell & developmental biology.

[94]  Z. Duan,et al.  Inferring gene and protein interactions using PubMed citations and consensus Bayesian networks , 2017, PLoS ONE.

[95]  Alessandro Vullo,et al.  Ensembl 2015 , 2014, Nucleic Acids Res..

[96]  Xiaodong Wang,et al.  Regularized EM algorithm for sparse parameter estimation in nonlinear dynamic systems with application to gene regulatory network inference , 2014, EURASIP J. Bioinform. Syst. Biol..

[97]  Abdollah Homaifar,et al.  Identifying time-delayed gene regulatory networks via an evolvable hierarchical recurrent neural network , 2017, BioData Mining.

[98]  Zhuo Wang,et al.  CMIP: a software package capable of reconstructing genome-wide regulatory networks using gene expression data , 2016, BMC Bioinformatics.

[99]  Hector Zenil,et al.  HiDi: an efficient reverse engineering schema for large‐scale dynamic regulatory network reconstruction using adaptive differentiation , 2017, Bioinform..

[100]  Norberto Díaz-Díaz,et al.  Gene network coherence based on prior knowledge using direct and indirect relationships , 2015, Comput. Biol. Chem..

[101]  Dario Floreano,et al.  Generating Realistic In Silico Gene Networks for Performance Assessment of Reverse Engineering Methods , 2009, J. Comput. Biol..

[102]  A. Swaroop,et al.  Next generation sequencing technology and genomewide data analysis: Perspectives for retinal research , 2016, Progress in Retinal and Eye Research.

[103]  Richard Bonneau,et al.  DREAM3: Network Inference Using Dynamic Context Likelihood of Relatedness and the Inferelator , 2010, PloS one.

[104]  Bei Sun,et al.  Bioinformatic Analysis Identifies Three Potentially Key Differentially Expressed Genes in Peripheral Blood Mononuclear Cells of Patients with Takayasu’s Arteritis , 2017, Cell journal.

[105]  Jing Liu,et al.  A time series driven decomposed evolutionary optimization approach for reconstructing large-scale gene regulatory networks based on fuzzy cognitive maps , 2017, BMC Bioinformatics.

[106]  V. Belcastro Reverse engieering gene regulatory networks : elucidation of trancriptome organization, gene function and gene regulation in mammalian systems , 2011 .

[107]  N. D. Clarke,et al.  Towards a Rigorous Assessment of Systems Biology Models: The DREAM3 Challenges , 2010, PloS one.

[108]  Gerardo Coello,et al.  ARACNe-based inference, using curated microarray data, of Arabidopsis thaliana root transcriptional regulatory networks , 2014, BMC Plant Biology.

[109]  J. T. Dunnen,et al.  Next generation sequencing technology: Advances and applications. , 2014, Biochimica et biophysica acta.

[110]  Dario Floreano,et al.  GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods , 2011, Bioinform..

[111]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[112]  Diogo M. Camacho,et al.  Wisdom of crowds for robust gene network inference , 2012, Nature Methods.

[113]  Jiajia Chen,et al.  Biological Networks for Cancer Candidate Biomarkers Discovery , 2016, Cancer informatics.

[114]  S Fuhrman,et al.  Reveal, a general reverse engineering algorithm for inference of genetic network architectures. , 1998, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[115]  Stephanie A. Tammen,et al.  Epigenetics: the link between nature and nurture. , 2013, Molecular aspects of medicine.

[116]  Riet De Smet,et al.  Advantages and limitations of current network inference methods , 2010, Nature Reviews Microbiology.

[117]  Ulf Leser,et al.  Estimating genome-wide regulatory activity from multi-omics data sets using mathematical optimization , 2017, BMC Systems Biology.

[118]  Zalmiyah Zakaria,et al.  A review on the computational approaches for gene regulatory network construction , 2014, Comput. Biol. Medicine.

[119]  Tianyu Kang,et al.  A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data , 2017, BMC Bioinformatics.

[120]  Sandra Heise,et al.  TRaCE+: Ensemble inference of gene regulatory networks from transcriptional expression profiles of gene knock-out experiments , 2016, BMC Bioinformatics.

[121]  Christophe Lemetre,et al.  Artificial Neural Network Inference (ANNI): A Study on Gene-Gene Interaction for Biomarkers in Childhood Sarcomas , 2014, PloS one.

[122]  Rina Dechter,et al.  Generalized best-first search strategies and the optimality of A* , 1985, JACM.

[123]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[124]  Xiao-Lu Wang,et al.  Utilizing benchmarked dataset and gene regulatory network to investigate hub genes in postmenopausal osteoporosis , 2020, Journal of cancer research and therapeutics.

[125]  Georgina Stegmayer,et al.  Extreme learning machines for reverse engineering of gene regulatory networks from expression time series , 2018, Bioinform..

[126]  Nicholas J. Radcliffe,et al.  Genetic set recombination and its application to neural network topology optimisation , 1993, Neural Computing & Applications.

[127]  Cláudio T. Silva,et al.  Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks , 2014, IEEE Transactions on Visualization and Computer Graphics.

[128]  Nicole Radde,et al.  Inferring Gene Regulatory Networks from Expression Data , 2019 .

[129]  R. Bellman Dynamic programming. , 1957, Science.

[130]  Michael Hecker,et al.  Gene regulatory network inference: Data integration in dynamic models - A review , 2009, Biosyst..

[131]  Liang Liang,et al.  Diagnostic significance and potential function of miR-338-5p in hepatocellular carcinoma: A bioinformatics study with microarray and RNA sequencing data , 2017, Molecular medicine reports.

[132]  Edward R. Dougherty,et al.  Validation of gene regulatory networks: scientific and inferential , 2011, Briefings Bioinform..

[133]  E. Davidson,et al.  Gene regulatory networks for development. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[134]  Ying Li,et al.  From milliseconds to lifetimes: tracking the dynamic behavior of transcription factors in gene networks. , 2015, Trends in genetics : TIG.

[135]  Chunjin Li,et al.  Analysis of miRNA expression profiles in melatonin-exposed GC-1 spg cell line. , 2018, Gene.

[136]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[137]  H. Mahjub,et al.  F-MAP: A Bayesian approach to infer the gene regulatory network using external hints , 2017, PloS one.

[138]  Benjamin Haibe-Kains,et al.  Quantitative assessment and validation of network inference methods in bioinformatics , 2014, Front. Genet..