T-Rank: Time-Aware Authority Ranking

Analyzing the link structure of the web for deriving a page’s authority and implied importance has deeply affected the way information providers create and link content, the ranking in web search engines, and the users’ access behavior. Due to the enormous dynamics of the web, with millions of pages created, updated, deleted, and linked to every day, timeliness of web pages and links is a crucial factor for their evaluation. Users are interested in important pages (i.e., pages with high authority score) but are equally interested in the recency of information. Time – and thus the freshness of web content and link structure – emanates as a factor that should be taken into account in link analysis when computing the importance of a page. So far only minor effort has been spent on the integration of temporal aspects into link analysis techniques. In this paper we introduce T-Rank, a link analysis approach that takes into account the temporal aspects freshness (i.e., timestamps of most recent updates) and activity (i.e., update rates) of pages and links. Preliminary experimental results show that T-Rank can improve the quality of ranking web pages.

[1]  Eli Upfal,et al.  Stochastic models for the Web graph , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[2]  R. Forthofer,et al.  Rank Correlation Methods , 1981 .

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Shlomo Moran,et al.  SALSA: the stochastic approach for link-structure analysis , 2001, TOIS.

[5]  Ronald Fagin,et al.  Comparing top k lists , 2003, SODA '03.

[6]  Reiner Kraft,et al.  TimeLinks : Exploring the link structure of the evolving Web , 2003 .

[7]  Al Franken,et al.  Lies and the Lying Liars Who Tell Them: A Fair and Balanced Look at the Right , 2004 .

[8]  Guido Caldarelli,et al.  A Multi-Layer Model for the Web Graph , 2002, WebDyn@WWW.

[9]  Marc Najork,et al.  A large‐scale study of the evolution of Web pages , 2004, Softw. Pract. Exp..

[10]  Gerhard Weikum,et al.  The BINGO! System for Information Portal Generation and Expert Web Search , 2003, CIDR.

[11]  D. Murphey,et al.  Against All Enemies: Inside America's War on Terror , 2004 .

[12]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[13]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[14]  Allan Borodin,et al.  Finding authorities and hubs from link structures on the World Wide Web , 2001, WWW '01.

[15]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[16]  M. Kendall,et al.  Rank Correlation Methods , 1949 .

[17]  Wei-Ying Ma,et al.  Implicit link analysis for small web search , 2003, SIGIR '03.

[18]  Ricardo A. Baeza-Yates,et al.  Web Structure, Dynamics and Page Quality , 2002, SPIRE.