Rank Prediction in Graphs with Locally Weighted Polynomial Regression and EM of Polynomial Mixture Models

In this paper we describe a learning framework enabling ranking predictions for graph nodes based solely on individual local historical data. The two learning algorithms capitalize on the multi feature vectors of nodes in graphs that evolve in time. In the first case we use weighted polynomial regression (LWPR) while in the second we consider the Expectation Maximization (EM) algorithm to fit a mixture of polynomial regression models. The first method uses separate weighted polynomial regression models for each web page, while the second algorithm capitalizes on group behavior, thus taking advantage of the possible interdependence between web pages. The prediction quality is quantified as the similarity between the predicted and the actual rankings and compared to alternative baseline predictor. We performed extensive experiments on a real world data set (the Wikipedia graph). The results are very encouraging.

[1]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[2]  Klaus Berberich,et al.  Representing and Quantifying Rank - Change for the Web Graph , 2006, WAW.

[3]  Jenq-Haur Wang,et al.  Finding Event-Relevant Content from the Web Using a Near-Duplicate Detection Approach , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[4]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[5]  Padhraic Smyth,et al.  Curve Clustering with Random Effects Regression Mixtures , 2003, AISTATS.

[6]  W. Cleveland Robust Locally Weighted Regression and Smoothing Scatterplots , 1979 .

[7]  Padhraic Smyth,et al.  Trajectory clustering with mixtures of regression models , 1999, KDD '99.

[8]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[9]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[10]  Young-Chon Kim,et al.  A Dynamic Web Page Prediction Model Based on Access Patterns to Offer Better User Latency , 2011, ArXiv.

[11]  Debajyoti Mukhopadhyay,et al.  An Approach to Web Page Prediction Using Markov Model and Web Page Ranking , 2009, J. Convergence Inf. Technol..

[12]  Padhraic Smyth,et al.  Prediction and ranking algorithms for event-based network data , 2005, SKDD.

[13]  Hyun-Han Kwon,et al.  Locally weighted polynomial regression: Parameter choice and application to forecasts of the Great Salt Lake , 2006 .

[14]  Michalis Vazirgiannis,et al.  Web Page Rank Prediction with PCA and EM Clustering , 2009, WAW.