DEEP: decomposition feature enhancement procedure for graphs

When dealing with machine learning on graphs, one of the most successfully approaches is the one of kernel methods. Depending if one is interested in predicting properties of graphs (e.g. graph classification) or to predict properties of nodes in a single graph (e.g. graph node classification), different kernel functions should be adopted. In the last few years, several kernels for graphs have been defined in literature that extract local features from the input graphs, obtaining both efficiency and state-of-the-art predictive performances. Recently, some work has been done in this direction also regarding graph node kernels, but the majority of the graph node kernels available in literature consider only global information, that can be not optimal for many tasks. In this paper, we propose a procedure that allows to transform a local graph kernel in a kernel for nodes in a single, huge graph. We apply a specific instantiation to the task of disease gene prioritization from the bioinformatics domain, improving the state of the art in many diseases.

[1]  Fang-Xiang Wu,et al.  Disease gene identification by using graph kernels and Markov random fields , 2014, Science China Life Sciences.

[2]  M. DePamphilis,et al.  HUMAN DISEASE , 1957, The Ulster Medical Journal.

[3]  Alessandro Sperduti,et al.  Ordered Decompositional DAG kernels enhancements , 2015, Neurocomputing.

[4]  François Fouss,et al.  An Experimental Investigation of Graph Kernels on a Collaborative Recommendation Task , 2006, Sixth International Conference on Data Mining (ICDM'06).

[5]  Pavel Yu. Chebotarev,et al.  The Matrix-Forest Theorem and Measuring Relations in Small Social Groups , 2006, ArXiv.

[6]  John D. Lafferty,et al.  Diffusion Kernels on Graphs and Other Discrete Input Spaces , 2002, ICML.

[7]  Giorgio Valentini,et al.  An extensive analysis of disease-gene associations using network integration and fast kernel-based gene prioritization methods , 2014, Artif. Intell. Medicine.

[8]  Fabrizio Costa,et al.  Fast Neighborhood Subgraph Pairwise Distance Kernel , 2010, ICML.

[9]  Sanasam Ranbir Singh,et al.  Modeling evolution of a social network using temporalgraph kernels , 2014, SIGIR.

[10]  Gene Ontology Consortium The Gene Ontology (GO) database and informatics resource , 2003 .

[11]  A. Barabasi,et al.  The human disease network , 2007, Proceedings of the National Academy of Sciences.

[12]  Alessandro Sperduti,et al.  A Tree-Based Kernel for Graphs , 2012, SDM.

[13]  Alessandro Sperduti,et al.  The Conjunctive Disjunctive Node Kernel , 2017, ESANN.

[14]  Hsinchun Chen,et al.  A graph-based recommender system for digital library , 2002, JCDL '02.

[15]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .