UNIPred-Web: a web tool for the integration and visualization of biomolecular networks for protein function prediction

BackgroundOne of the main issues in the automated protein function prediction (AFP) problem is the integration of multiple networked data sources. The UNIPred algorithm was thereby proposed to efficiently integrate —in a function-specific fashion— the protein networks by taking into account the imbalance that characterizes protein annotations, and to subsequently predict novel hypotheses about unannotated proteins. UNIPred is publicly available as R code, which might result of limited usage for non-expert users. Moreover, its application requires efforts in the acquisition and preparation of the networks to be integrated. Finally, the UNIPred source code does not handle the visualization of the resulting consensus network, whereas suitable views of the network topology are necessary to explore and interpret existing protein relationships.ResultsWe address the aforementioned issues by proposing UNIPred-Web, a user-friendly Web tool for the application of the UNIPred algorithm to a variety of biomolecular networks, already supplied by the system, and for the visualization and exploration of protein networks. We support different organisms and different types of networks —e.g., co-expression, shared domains and physical interaction networks. Users are supported in the different phases of the process, ranging from the selection of the networks and the protein function to be predicted, to the navigation of the integrated network. The system also supports the upload of user-defined protein networks. The vertex-centric and the highly interactive approach of UNIPred-Web allow a narrow exploration of specific proteins, and an interactive analysis of large sub-networks with only a few mouse clicks.ConclusionsUNIPred-Web offers a practical and intuitive (visual) guidance to biologists interested in gaining insights into protein biomolecular functions. UNIPred-Web provides facilities for the integration of networks, and supplies a framework for the imbalance-aware protein network integration of nine organisms, the prediction of thousands of GO protein functions, and a easy-to-use graphical interface for the visual analysis, navigation and interpretation of the integrated networks and of the functional predictions.

[1]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[2]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[3]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[4]  Timo Honkela,et al.  International Joint Conference on Neural Networks, IJCNN 2004, Budapest, Hungary, July 25-29, 2004 , 2004 .

[5]  Gabriele Ausiello,et al.  MINT: the Molecular INTeraction database , 2006, Nucleic Acids Res..

[6]  Michael I. Jordan,et al.  A critical assessment of Mus musculus gene function prediction using integrated genomic evidence , 2008, Genome Biology.

[7]  Yuanfang Guan,et al.  A Genomewide Functional Network for the Laboratory Mouse , 2008, PLoS Comput. Biol..

[8]  Huey-Ling Kao,et al.  Browsing Multidimensional Molecular Networks with the Generic Network Browser (N‐Browse) , 2008, Current protocols in bioinformatics.

[9]  David Manset,et al.  XML-based approaches for the integration of heterogeneous bio-molecular data , 2009, BMC Bioinformatics.

[10]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[11]  Emek Demir,et al.  A layout algorithm for undirected compound graphs , 2009, Inf. Sci..

[12]  Dennis B. Troup,et al.  NCBI GEO: archive for high-throughput functional genomic data , 2008, Nucleic Acids Res..

[13]  Quaid Morris,et al.  Fast integration of heterogeneous data sources for predicting gene function with limited annotation , 2010, Bioinform..

[14]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2009 update , 2009, Nucleic Acids Res..

[15]  Gary D. Bader,et al.  The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function , 2010, Nucleic Acids Res..

[16]  L. Giménez,et al.  Multiplexed methylation profiles of tumor suppressor genes and clinical outcome in lung cancer , 2010, Journal of Translational Medicine.

[17]  Giorgio Valentini,et al.  COSNet: A Cost Sensitive Neural Network for Semi-supervised Learning in Graphs , 2011, ECML/PKDD.

[18]  Jesse Gillis,et al.  The Impact of Multifunctional Genes on "Guilt by Association" Analysis , 2011, PloS one.

[19]  Douglas A. Chapnick,et al.  Partners in crime: the TGFβ and MAPK pathways in cancer progression , 2011, Cell & Bioscience.

[20]  Michael J. E. Sternberg,et al.  CombFunc: predicting protein function using heterogeneous data sources , 2012, Nucleic Acids Res..

[21]  Casey S. Greene,et al.  IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks , 2012, Nucleic Acids Res..

[22]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[23]  Daniel W. A. Buchan,et al.  A large-scale evaluation of computational protein function prediction , 2013, Nature Methods.

[24]  Ni Li,et al.  Gene Ontology Annotations and Resources , 2012, Nucleic Acids Res..

[25]  Giulio Pavesi,et al.  A neural network based algorithm for gene expression prediction from chromatin structure , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[26]  Giorgio Valentini,et al.  A neural network algorithm for semi-supervised node label learning from unbalanced data , 2013, Neural Networks.

[27]  Olga G. Troyanskaya,et al.  IMP 2.0: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks , 2015, Nucleic Acids Res..

[28]  Giorgio Valentini,et al.  UNIPred: Unbalance-Aware Network Integration and Prediction of Protein Functions , 2015, J. Comput. Biol..

[29]  Davide Heller,et al.  STRING v10: protein–protein interaction networks, integrated over the tree of life , 2014, Nucleic Acids Res..

[30]  Silvio C. E. Tosatto,et al.  INGA: protein function prediction combining interaction networks, domain assignments and sequence similarity , 2015, Nucleic Acids Res..

[31]  Marco Mesiti,et al.  MergeGraphs: a web-based system for merging heterogeneous big graphs , 2015, iiWAS.

[32]  Szu-Hua Pan,et al.  Abstract 1431: Id4 inhibits cancer metastasis through EMT regulation in lung cancer , 2015 .

[33]  Marco Frasca,et al.  Automated gene function prediction through gene multifunctionality in biological networks , 2015, Neurocomputing.

[34]  Chittibabu Guda,et al.  LocSigDB: a database of protein localization signals , 2015, Database J. Biol. Databases Curation.

[35]  Steven E. Brenner,et al.  SIFTER search: a web server for accurate phylogeny-based protein function prediction , 2015, Nucleic Acids Res..

[36]  Giorgio Valentini,et al.  Multi-species protein function prediction: towards web-based visual analytics , 2016, iiWAS.

[37]  Robert D. Finn,et al.  The Pfam protein families database: towards a more sustainable future , 2015, Nucleic Acids Res..

[38]  Hyojin Kim,et al.  MouseNet v2: a database of gene networks for studying the laboratory mouse and eight other model vertebrates , 2015, Nucleic Acids Res..

[39]  Tapio Salakoski,et al.  An expanded evaluation of protein function prediction methods shows an improvement in accuracy , 2016, Genome Biology.

[40]  Federico Pedersini,et al.  Hardware-accelerated high-resolution video coding in Virtual Network Functions , 2016, 2016 European Conference on Networks and Communications (EuCNC).

[41]  Silvio C. E. Tosatto,et al.  InterPro in 2017—beyond protein family and domain annotations , 2016, Nucleic Acids Res..

[42]  Giorgio Valentini,et al.  COSNet: An R package for label prediction in unbalanced biological networks , 2017, Neurocomputing.

[43]  Kara Dolinski,et al.  The BioGRID interaction database: 2017 update , 2016, Nucleic Acids Res..