Learning random walks to rank nodes in graphs

Ranking nodes in graphs is of much recent interest. Edges, via the graph Laplacian, are used to encourage local smoothness of node scores in SVM-like formulations with generalization guarantees. In contrast, Page-rank variants are based on Markovian random walks. For directed graphs, there is no simple known correspondence between these views of scoring/ranking. Recent scalable algorithms for learning the Pagerank transition probabilities do not have generalization guarantees. In this paper we show some correspondence results between the Laplacian and the Pagerank approaches, and give new generalization guarantees for the latter. We enhance the Pagerank-learning approaches to use an additive margin. We also propose a general framework for rank-sensitive score-learning, and apply it to Laplacian smoothing. Experimental results are promising.

[1]  Carl D. Meyer,et al.  Deeper Inside PageRank , 2004, Internet Math..

[2]  Ben Taskar,et al.  Learning structured prediction models: a large margin approach , 2005, ICML.

[3]  Shivani Agarwal,et al.  Ranking on graph data , 2006, ICML.

[4]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[5]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[6]  Christopher J. C. Burges,et al.  High accuracy retrieval with multiple nested ranker , 2006, SIGIR.

[7]  Soumen Chakrabarti,et al.  Learning to rank networked entities , 2006, KDD '06.

[8]  Andrew Trotman,et al.  Learning to Rank , 2005, Information Retrieval.

[9]  B. Schölkopf,et al.  A Regularization Framework for Learning from Graph Data , 2004, ICML 2004.

[10]  Bernhard Schölkopf,et al.  Learning from labeled and unlabeled data on a directed graph , 2005, ICML.

[11]  André Elisseeff,et al.  Stability and Generalization , 2002, J. Mach. Learn. Res..

[12]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[13]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[14]  Thomas Hofmann,et al.  Learning to Rank with Nonsmooth Cost Functions , 2006, NIPS.

[15]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[16]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[17]  Klaus Obermayer,et al.  Support vector learning for ordinal regression , 1999 .

[18]  Alexander J. Smola,et al.  Kernels and Regularization on Graphs , 2003, COLT.

[19]  Christos Faloutsos,et al.  R-MAT: A Recursive Model for Graph Mining , 2004, SDM.

[20]  Cynthia Rudin,et al.  Ranking with a P-Norm Push , 2006, COLT.

[21]  Shivani Agarwal,et al.  Stability and Generalization of Bipartite Ranking Algorithms , 2005, COLT.

[22]  Zoubin Ghahramani,et al.  Proceedings, Twenty-Fourth International Conference on Machine Learning , 2007 .

[23]  F. Chung Laplacians and the Cheeger Inequality for Directed Graphs , 2005 .