Stability Analysis of Learning Algorithms for Ontology Similarity Computation

Ontology, as a useful tool, is widely applied in lots of areas such as social science, computer science, and medical science. Ontology concept similarity calculation is the key part of the algorithms in these applications. A recent approach is to make use of similarity between vertices on ontology graphs. It is, instead of pairwise computations, based on a function that maps the vertex set of an ontology graph to real numbers. In order to obtain this, the ranking learning problem plays an important and essential role, especially k-partite ranking algorithm, which is suitable for solving some ontology problems. A ranking function is usually used to map the vertices of an ontology graph to numbers and assign ranks of the vertices through their scores. Through studying a training sample, such a function can be learned. It contains a subset of vertices of the ontology graph. A good ranking function means small ranking mistakes and good stability. For ranking algorithms, which are in a well-stable state, we study generalization bounds via some concepts of algorithmic stability. We also find that kernel-based ranking algorithms stated as regularization schemes in reproducing kernel Hilbert spaces satisfy stability conditions and have great generalization abilities.

[1]  Colin McDiarmid,et al.  Surveys in Combinatorics, 1989: On the method of bounded differences , 1989 .

[2]  André Elisseeff,et al.  Stability and Generalization , 2002, J. Mach. Learn. Res..

[3]  S. Smale,et al.  ESTIMATING THE APPROXIMATION ERROR IN LEARNING THEORY , 2003 .

[4]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[5]  S. Rajaram,et al.  Generalization Bounds for k-Partite Ranking , 2005 .

[6]  Shivani Agarwal,et al.  Stability and Generalization of Bipartite Ranking Algorithms , 2005, COLT.

[7]  Shivani Agarwal,et al.  Ranking on graph data , 2006, ICML.

[8]  Gene H. Golub,et al.  Generalized cross-validation as a method for choosing a good ridge parameter , 1979, Milestones in Matrix Computation.

[9]  Mikhail Belkin,et al.  Consistency of spectral clustering , 2008, 0804.0678.

[10]  Shivani Agarwal,et al.  Generalization Bounds for Ranking Algorithms via Algorithmic Stability , 2009, J. Mach. Learn. Res..

[11]  S. Smale,et al.  Geometry on Probability Spaces , 2009 .

[12]  Shivani Agarwal,et al.  Learning to rank on graphs , 2010, Machine Learning.

[13]  Y. Yao,et al.  Cross-validation based adaptation for regularization operators in learning theory , 2010 .

[14]  Ting Hu,et al.  ONLINE REGRESSION WITH VARYING GAUSSIANS AND NON-IDENTICAL DISTRIBUTIONS , 2011 .

[15]  Dachun Yang,et al.  REAL-VARIABLE CHARACTERIZATIONS OF HARDY SPACES ASSOCIATED WITH BESSEL OPERATORS , 2011, 1102.1217.

[16]  Gao Wei,et al.  Convergence of spectral clustering with a general similarity function , 2012 .

[17]  Ding-Xuan Zhou,et al.  An approximation theory approach to learning with ℓ1 regularization , 2013, J. Approx. Theory.