BiomeNet: a database for construction and analysis of functional interaction networks for any species with a sequenced genome

Abstract Motivation Owing to advanced DNA sequencing and genome assembly technology, the number of species with sequenced genomes is rapidly increasing. The aim of the recently launched Earth BioGenome Project is to sequence genomes of all eukaryotic species on Earth over the next 10 years, making it feasible to obtain genomic blueprints of the majority of animal and plant species by this time. Genetic models of the sequenced species will later be subject to functional annotation, and a comprehensive molecular network should facilitate functional analysis of individual genes and pathways. However, network databases are lagging behind genome sequencing projects as even the largest network database provides gene networks for less than 10% of sequenced eukaryotic genomes, and the knowledge gap between genomes and interactomes continues to widen. Results We present BiomeNet, a database of 95 scored networks comprising over 8 million co-functional links, which can build and analyze gene networks for any species with the sequenced genome. BiomeNet transfers functional interactions between orthologous proteins from source networks to the target species within minutes and automatically constructs gene networks with the quality comparable to that of existing networks. BiomeNet enables assembly of the first-in-species gene networks not available through other databases, which are highly predictive of diverse biological processes and can also provide network analysis by extracting subnetworks for individual biological processes and network-based gene prioritizations. These data indicate that BiomeNet could enhance the benefits of decoding the genomes of various species, thus improving our understanding of the Earth’ biodiversity. Availability and implementation The BiomeNet is freely available at http://kobic.re.kr/biomenet/. Supplementary information Supplementary data are available at Bioinformatics online.

[1]  Chan Yeong Kim,et al.  Network-based genetic investigation of virulence-associated phenotypes in methicillin-resistant Staphylococcus aureus , 2018, Scientific Reports.

[2]  Jung Eun Shim,et al.  EcoliNet: a database of cofunctional gene network for Escherichia coli , 2015, Database J. Biol. Databases Curation.

[3]  C. Dessimoz,et al.  Bidirectional Best Hits Miss Many Orthologs in Duplication-Rich Clades such as Plants and Animals , 2013, Genome biology and evolution.

[4]  The Gene Ontology Consortium,et al.  Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[5]  G. Sumara,et al.  A Probabilistic Functional Network of Yeast Genes , 2004 .

[6]  Baldomero Oliva,et al.  BIPS: BIANA Interolog Prediction Server. A tool for protein–protein interaction inference , 2012, Nucleic Acids Res..

[7]  Sunmo Yang,et al.  HumanNet v2: human gene networks for disease research , 2018, Nucleic Acids Res..

[8]  Insuk Lee,et al.  MaizeNet: a co-functional network for network-assisted systems genetics in Zea mays. , 2019, The Plant journal : for cell and molecular biology.

[9]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[10]  Hyojin Kim,et al.  FlyNet: a versatile network prioritization server for the Drosophila community , 2015, Nucleic Acids Res..

[11]  Robert B. Russell,et al.  InterPreTS: protein Interaction Prediction through Tertiary Structure , 2003, Bioinform..

[12]  Chao Xie,et al.  Fast and sensitive protein alignment using DIAMOND , 2014, Nature Methods.

[13]  Christian E. V. Storm,et al.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. , 2001, Journal of molecular biology.

[14]  M. Gerstein,et al.  Annotation transfer between genomes: protein-protein interologs and protein-DNA regulogs. , 2004, Genome research.

[15]  J. Bennetzen,et al.  Brachypodium distachyon and Setaria viridis: Model Genetic Systems for the Grasses. , 2015, Annual review of plant biology.

[16]  Hyojin Kim,et al.  RiceNet v2: an improved network prioritization server for rice genes , 2015, Nucleic Acids Res..

[17]  Gary D. Bader,et al.  Cytoscape.js: a graph theory library for visualisation and analysis , 2015, Bioinform..

[18]  The Gene Ontology Consortium Expansion of the Gene Ontology knowledgebase and resources , 2016, Nucleic Acids Res..

[19]  Hyojin Kim,et al.  MouseNet v2: a database of gene networks for studying the laboratory mouse and eight other model vertebrates , 2015, Nucleic Acids Res..

[20]  Max J. Feldman,et al.  Grasses suppress shoot-borne roots to conserve water during drought , 2016, Proceedings of the National Academy of Sciences.

[21]  Insuk Lee,et al.  A Genome-Scale Co-Functional Network of Xanthomonas Genes Can Accurately Reconstruct Regulatory Circuits Controlled by Two-Component Signaling Systems , 2019, Molecules and cells.

[22]  J. Heitman,et al.  Network-assisted genetic dissection of pathogenicity and drug resistance in the opportunistic human pathogenic fungus Cryptococcus neoformans , 2015, Scientific Reports.

[23]  Insuk Lee,et al.  JiffyNet: a web-based instant protein network modeler for newly sequenced species , 2013, Nucleic Acids Res..

[24]  Insuk Lee,et al.  Construction of Functional Gene Networks Using Phylogenetic Profiles. , 2017, Methods in molecular biology.

[25]  Jung Eun Shim,et al.  Weighted mutual information analysis substantially improves domain-based functional network models , 2016, Bioinform..

[26]  Chan Yeong Kim,et al.  Network Integrative Genomic and Transcriptomic Analysis of Carbapenem-Resistant Klebsiella pneumoniae Strains Identifies Genes for Antibiotic Resistance and Virulence , 2019, mSystems.

[27]  E. Marcotte,et al.  Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana , 2010, Nature Biotechnology.

[28]  Jung Eun Shim,et al.  From sequencing data to gene functions: co-functional network approaches , 2017, Animal cells and systems.

[29]  Insuk Lee,et al.  SoyNet: a database of co-functional networks for soybean Glycine max , 2016, Nucleic Acids Res..

[30]  I-Min A. Chen,et al.  Genomes OnLine database (GOLD) v.7: updates and new features , 2018, Nucleic Acids Res..

[31]  Chan Yeong Kim,et al.  Functional gene networks based on the gene neighborhood in metagenomes , 2017 .

[32]  Damian Szklarczyk,et al.  STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets , 2018, Nucleic Acids Res..

[33]  A. Iyer-Pascuzzi,et al.  TomatoNet: A Genome-wide Co-functional Network for Unveiling Complex Traits of Tomato, a Model Crop for Fleshy Fruits. , 2017, Molecular plant.

[34]  Juan Carlos Castilla-Rubio,et al.  Earth BioGenome Project: Sequencing life for the future of life , 2018, Proceedings of the National Academy of Sciences.

[35]  Insuk Lee,et al.  Complementarity between distance- and probability-based methods of gene neighbourhood identification for pathway reconstruction. , 2014, Molecular bioSystems.

[36]  Eiru Kim,et al.  A network of human functional gene interactions from knockout fitness screens in cancer cells , 2019, Life Science Alliance.

[37]  Zhou Du,et al.  agriGO v2.0: a GO analysis toolkit for the agricultural community, 2017 update , 2017, Nucleic Acids Res..

[38]  E. Marcotte,et al.  It's the machine that matters: Predicting gene function and phenotype from protein networks. , 2010, Journal of proteomics.

[39]  Hyojin Kim,et al.  YeastNet v3: a public database of data-specific and integrated functional gene networks for Saccharomyces cerevisiae , 2013, Nucleic Acids Res..

[40]  Jung Eun Shim,et al.  Pathway-specific protein domains are predictive for human diseases , 2019, PLoS Comput. Biol..

[41]  Hyojin Kim,et al.  WormNet v3: a network-assisted hypothesis-generating server for Caenorhabditis elegans , 2014, Nucleic Acids Res..

[42]  Chan Yeong Kim,et al.  Function-driven discovery of disease genes in zebrafish using an integrated genomics big data resource , 2016, Nucleic acids research.

[43]  Chan Yeong Kim,et al.  Network-assisted investigation of virulence and antibiotic-resistance systems in Pseudomonas aeruginosa , 2016, Scientific Reports.

[44]  Insuk Lee,et al.  Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling , 2015, PloS one.

[45]  Hyojin Kim,et al.  AraNet v2: an improved database of co-functional gene networks for the study of Arabidopsis thaliana and 27 other nonmodel plant species , 2014, Nucleic Acids Res..