Substructure counting graph kernels for machine learning from RDF data

In this paper we introduce a framework for learning from RDF data using graph kernels that count substructures in RDF graphs, which systematically covers most of the existing kernels previously defined and provides a number of new variants. Our definitions include fast kernel variants that are computed directly on the RDF graph. To improve the performance of these kernels we detail two strategies. The first strategy involves ignoring the vertex labels that have a low frequency among the instances. Our second strategy is to remove hubs to simplify the RDF graphs. We test our kernels in a number of classification experiments with real-world RDF datasets. Overall the kernels that count subtrees show the best performance. However, they are closely followed by simple bag of labels baseline kernels. The direct kernels substantially decrease computation time, while keeping performance the same. For the walks counting kernel this decrease in computation time is so large that it thereby becomes a computationally viable kernel to use. Ignoring low frequency labels improves the performance for all datasets. The hub removal algorithm increases performance on two out of three of our smaller datasets, but has little impact when used on our larger datasets. Systematic graph kernel framework for RDF.Fast computation algorithms.Low frequency labels and hub removal on RDF to enhance machine learning.

[1]  Stephan Bloehdorn,et al.  The SWRC Ontology - Semantic Web for Research Communities , 2005, EPIA.

[2]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[3]  Claudio Gutiérrez,et al.  Bipartite Graphs as Intermediate Model for RDF , 2004, SEMWEB.

[4]  Thanh Tran,et al.  Relational Kernel Machines for Learning from Graph-Structured RDF Data , 2011, ESWC.

[5]  Matthew Rowe Transferring Semantic Categories with Vertex Kernels: Recommendations with SemanticSVD++ , 2014, International Semantic Web Conference.

[6]  Aart J. C. Bik,et al.  Pregel: a system for large-scale graph processing , 2010, SIGMOD Conference.

[7]  2014 IEEE International Conference on Data Mining, ICDM 2014, Shenzhen, China, December 14-17, 2014 , 2014, ICDM.

[8]  Achim Rettinger,et al.  Mining the Semantic Web , 2012, Data Mining and Knowledge Discovery.

[9]  Hans-Peter Kriegel,et al.  A scalable approach for statistical learning in semantic graphs , 2014, Semantic Web.

[10]  Stephan Bloehdorn,et al.  Graph Kernels for RDF Data , 2012, ESWC.

[11]  Jens Lehmann,et al.  DL-Learner: Learning Concepts in Description Logics , 2009, J. Mach. Learn. Res..

[12]  Chia-Hua Ho,et al.  Recent Advances of Large-Scale Linear Classification , 2012, Proceedings of the IEEE.

[13]  S. Sathiya Keerthi,et al.  Efficient algorithms for ranking with SVMs , 2010, Information Retrieval.

[14]  Hans-Peter Kriegel,et al.  Factorizing YAGO: scalable machine learning for linked data , 2012, WWW.

[15]  Inderjit S. Dhillon,et al.  Weighted Graph Cuts without Eigenvectors A Multilevel Approach , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[17]  Hans-Peter Kriegel,et al.  A Three-Way Model for Collective Learning on Multi-Relational Data , 2011, ICML.

[18]  Kurt Mehlhorn,et al.  Weisfeiler-Lehman Graph Kernels , 2011, J. Mach. Learn. Res..

[19]  Joseph M. Hellerstein,et al.  Distributed GraphLab: A Framework for Machine Learning in the Cloud , 2012, Proc. VLDB Endow..

[20]  Anthony Widjaja,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2003, IEEE Transactions on Neural Networks.

[21]  Nicola Fanizzi,et al.  A Gaussian Process Model for Knowledge Propagation in Web Ontologies , 2014, 2014 IEEE International Conference on Data Mining.

[22]  Heiko Paulheim,et al.  A Comparison of Propositionalization Strategies for Creating Features from Linked Open Data , 2014, LD4KD.

[23]  Gerben de Vries,et al.  Simplifying RDF Data for Graph-Based Machine Learning , 2014, KNOW@LOD.

[24]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[25]  Kristian Kersting,et al.  Explicit Versus Implicit Graph Feature Maps: A Computational Phase Transition for Walk Kernels , 2014, 2014 IEEE International Conference on Data Mining.

[26]  Tommaso Di Noia,et al.  A Linked Data Recommender System Using a Neighborhood-Based Graph Kernel , 2014, EC-Web.

[27]  Johannes Fürnkranz,et al.  Unsupervised generation of data mining features from linked open data , 2012, WIMS '12.

[28]  Bernhard Schölkopf,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[29]  Antoine Isaac,et al.  Supporting Linked Data Production for Cultural Heritage Institutes: The Amsterdam Museum Case Study , 2012, ESWC.

[30]  Peter A. Flach,et al.  Propositionalization approaches to relational data mining , 2001 .

[31]  Kurt Mehlhorn,et al.  Efficient graphlet kernels for large graph comparison , 2009, AISTATS.

[32]  Volker Tresp,et al.  Mining the Semantic Web Statistical Learning for Next Generation Knowledge Bases , 2012 .

[33]  S. V. N. Vishwanathan,et al.  Graph kernels , 2007 .

[34]  Hans-Peter Kriegel,et al.  Multivariate Prediction for Learning on the Semantic Web , 2010, ILP.

[35]  Marko Grobelnik,et al.  Proceedings of the 8th extended semantic web conference on The semantic web: research and applications - Volume Part I , 2011 .

[36]  Stephan Bloehdorn,et al.  Kernel Methods for Mining Instance Data in Ontologies , 2007, ISWC/ASWC.

[37]  Gerben de Vries A Fast Approximation of the Weisfeiler-Lehman Graph Kernel for RDF Data , 2013, ECML/PKDD.

[38]  Nicola Fanizzi,et al.  Induction of robust classifiers for web ontologies through kernel machines , 2012, J. Web Semant..

[39]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[40]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[41]  Steven de Rooij,et al.  A Fast and Simple Graph Kernel for RDF , 2013, DMoLD.

[42]  Danqi Chen,et al.  Reasoning With Neural Tensor Networks for Knowledge Base Completion , 2013, NIPS.

[43]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[44]  Wei Zhang,et al.  Knowledge vault: a web-scale approach to probabilistic knowledge fusion , 2014, KDD.