Hypergraph models of biological networks to identify genes critical to pathogenic viral response

Background: Representing biological networks as graphs is a powerful approach to reveal underlying patterns, signatures, and critical components from high-throughput biomolecular data. However, graphs do not natively capture the multi-way relationships present among genes and proteins in biological systems. Hypergraphs are generalizations of graphs that naturally model multi-way relationships and have shown promise in modeling systems such as protein complexes and metabolic reactions. In this paper we seek to understand how hypergraphs can more faithfully identify, and potentially predict, important genes based on complex relationships inferred from genomic expression data sets. Results: We compiled a novel data set of transcriptional host response to pathogenic viral infections and formulated relationships between genes as a hypergraph where hyperedges represent significantly perturbed genes, and vertices represent individual biological samples with specific experimental conditions. We find that hypergraph betweenness centrality is a superior method for identification of genes important to viral response when compared with graph centrality. Conclusions: Our results demonstrate the utility of using hypergraphs to represent complex biological systems and highlight central important responses in common to a variety of highly pathogenic viruses.

[1]  Johan Lindberg,et al.  Correlation Network Analysis for Data Integration and Biomarker Selectionw , 2007 .

[2]  Qibin Zhang,et al.  Temporal Proteome and Lipidome Profiles Reveal Hepatitis C Virus-Associated Reprogramming of Hepatocellular Metabolism and Bioenergetics , 2010, PLoS pathogens.

[3]  J. Collins,et al.  Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles , 2007, PLoS biology.

[4]  Loc Tran Hypergraph and protein function prediction with gene expression data , 2012, ArXiv.

[5]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008 .

[6]  Emad Ramadan,et al.  A hypergraph model for the yeast protein complex network , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[7]  Ralph Baric,et al.  Unified feature association networks through integration of transcriptomic and proteomic data , 2019, PLoS Comput. Biol..

[8]  Cliff Joslyn,et al.  Hypernetwork science via high-order hypergraph walks , 2019, EPJ Data Science.

[9]  Matthew D. Dyer,et al.  The Landscape of Human Proteins Interacting with Viruses and Other Pathogens , 2008, PLoS pathogens.

[10]  Linyuan Lu,et al.  On a hypergraph probabilistic graphical model , 2018, Annals of Mathematics and Artificial Intelligence.

[11]  Vito Latora,et al.  Simplicial models of social contagion , 2018, Nature Communications.

[12]  Brenda Praggastis,et al.  Hypernetwork Science: From Multidimensional Networks to Computational Topology , 2020, Unifying Themes in Complex Systems X.

[13]  Ulrik Brandes,et al.  What is network science? , 2013, Network Science.

[14]  Riet De Smet,et al.  Advantages and limitations of current network inference methods , 2010, Nature Reviews Microbiology.

[15]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[16]  Claude Berge,et al.  Hypergraphs - combinatorics of finite sets , 1989, North-Holland mathematical library.

[17]  K. Knobeloch,et al.  USP18 – a multifunctional component in the interferon response , 2018, Bioscience reports.

[18]  Mark Gerstein,et al.  The Importance of Bottlenecks in Protein Networks: Correlation with Gene Essentiality and Expression Dynamics , 2007, PLoS Comput. Biol..

[19]  Daniel Ashlock,et al.  Identification of critical connectors in the directed reaction-centric graphs of microbial metabolic networks , 2019, BMC Bioinformatics.

[20]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Chengjun Li,et al.  The effect of inhibition of PP1 and TNFα signaling on pathogenesis of SARS coronavirus , 2016, BMC Systems Biology.

[22]  Albert-Lszl Barabsi,et al.  Network Science , 2016, Encyclopedia of Big Data.

[23]  Alessandro Sperduti,et al.  Heterogeneous networks integration for disease-gene prioritization with node kernels , 2020, Bioinform..

[24]  Steffen Klamt,et al.  Hypergraphs and Cellular Networks , 2009, PLoS Comput. Biol..

[25]  Thomas O. Metz,et al.  A Network Integration Approach to Predict Conserved Regulators Related to Pathogenicity of Influenza and SARS-CoV Respiratory Viruses , 2013, PloS one.

[26]  Lee Ann McCue,et al.  A model of cyclic transcriptomic behavior in the cyanobacterium Cyanothece sp. ATCC 51142. , 2011, Molecular bioSystems.

[27]  Antonio Sanfilippo,et al.  Identification and Validation of Ifit1 as an Important Innate Immune Bottleneck , 2012, PloS one.

[28]  Uthsav Chitra,et al.  Random Walks on Hypergraphs with Applications to Disease-Gene Prioritization , 2017 .

[29]  Mark Minas Hypergraphs as a Uniform Diagram Representation Model , 1998, TAGT.

[30]  Rob Knight,et al.  Impact of Dietary Resistant Starch on the Human Gut Microbiome, Metaproteome, and Metabolome , 2017, mBio.

[31]  Alice Patania,et al.  The shape of collaborations , 2017, EPJ Data Science.

[32]  Guillermo Restrepo,et al.  Formal structure of periodic system of elements , 2019, Proceedings of the Royal Society A.

[33]  Thomas O. Metz,et al.  Pathogenic Influenza Viruses and Coronaviruses Utilize Similar and Contrasting Approaches To Control Interferon-Stimulated Gene Responses , 2014, mBio.

[34]  Hyunjin Yoon,et al.  Bottlenecks and Hubs in Inferred Networks Are Important for Virulence in Salmonella typhimurium , 2009, J. Comput. Biol..

[35]  Luay Nakhleh,et al.  Properties of metabolic graphs: biological organization or representation artifacts? , 2011, BMC Bioinformatics.

[36]  Jie Lyu,et al.  The ZZ-type zinc finger of ZZZ3 modulates the ATAC complex-mediated histone acetylation and gene activation , 2018, Nature Communications.

[37]  Jean H. Chang,et al.  Host Regulatory Network Response to Infection with Highly Pathogenic H5N1 Avian Influenza Virus , 2011, Journal of Virology.

[38]  Antonio Sanfilippo,et al.  Modeling Dynamic Regulatory Processes in Stroke , 2012, PLoS Comput. Biol..

[39]  H. Link,et al.  Systematic identification of metabolites controlling gene expression in E. coli , 2019, Nature Communications.

[40]  Lisa M. Bramer,et al.  The Role of EGFR in Influenza Pathogenicity: Multiple Network-Based Approaches to Identify a Key Regulator of Non-lethal Infections , 2019, Front. Cell Dev. Biol..