Graph Neural Networks for Predicting Protein Functions

Learning the functions associated with a protein is essential to gaining insights for disease diagnostics, medical treatment, and human biology. In this paper, protein function prediction is posed as a semi-supervised learning task over multi-relational graphs, and it is tackled using a graph neural network (GNN) approach. The novel GNN architecture employs multi-relational graphs and weighs the influence of the different relations via learnable parameters. The ultimate goal is to design a powerful learning architecture able to: discover complex and highly nonlinear data associations, combine (and select) multiple types of relations, and scale gracefully with respect to the size of the graph. Numerical tests with protein networks corroborate the performance gains relative to state-of-the-art alternatives.

[1]  Leman Akoglu,et al.  Robust Semi-Supervised Learning on Multiple Networks with Noise , 2018, PAKDD.

[2]  Santiago Segarra,et al.  Sampling of Graph Signals With Successive Local Aggregations , 2015, IEEE Transactions on Signal Processing.

[3]  Jason Weston,et al.  Deep learning via semi-supervised embedding , 2008, ICML '08.

[4]  Xavier Bresson,et al.  Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[5]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Ruslan Salakhutdinov,et al.  Revisiting Semi-Supervised Learning with Graph Embeddings , 2016, ICML.

[8]  Georgios B. Giannakis,et al.  Kernel-Based Semi-Supervised Learning Over Multilayer Graphs , 2018, 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[9]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[10]  Mason A. Porter,et al.  Multilayer networks , 2013, J. Complex Networks.

[11]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[12]  Pierre Vandergheynst,et al.  Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[13]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[14]  Alejandro Ribeiro,et al.  CONVOLUTIONAL NEURAL NETWORKS VIA NODE-VARYING GRAPH FILTERS , 2017, 2018 IEEE Data Science Workshop (DSW).

[15]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[16]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[17]  Jure Leskovec,et al.  Predicting multicellular function through multi-layer tissue networks , 2017, Bioinform..

[18]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[19]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[20]  Pascal Frossard,et al.  The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains , 2012, IEEE Signal Processing Magazine.

[21]  Georgios B. Giannakis,et al.  Kernel-based Inference of Functions over Graphs , 2017, ArXiv.