Fast algorithms for topk personalized pagerank queries
暂无分享,去创建一个
In entity-relation (ER) graphs (V,E), nodes V represent typed entities and edges E represent typed relations. For dynamic personalized PageRank queries, nodes are ranked by their steady-state probabilities obtained using the standard random surfer model. In this work, we propose a framework to answer top-k graph conductance queries. Our top-k ranking technique leads to a 4X speedup, and overall, our system executes queries 200-1600X faster than whole-graph PageRank. Some queries might contain hard predicates i.e. predicates that must be satisfied by the answer nodes. E.g. we may seek authoritative papers on public key cryptography, but only those written during 1997. We extend our system to handle hard predicates. Our system achieves these substantial query speedups while consuming only 10-20% of the space taken by a regular text index.
[1] Jennifer Widom,et al. Scaling personalized web search , 2003, WWW '03.
[2] Gene H. Golub,et al. Exploiting the Block Structure of the Web for Computing , 2003 .
[3] Andrei Z. Broder,et al. Graph structure in the Web , 2000, Comput. Networks.
[4] Pavel Berkhin,et al. Bookmark-Coloring Algorithm for Personalized PageRank Computing , 2006, Internet Math..
[5] Soumen Chakrabarti,et al. Dynamic personalized pagerank in entity-relation graphs , 2007, WWW '07.