A Scalable Parallel HITS Algorithm for Page Ranking
暂无分享,去创建一个
The hypertext induced topic search (HITS) algorithm is a method of ranking authority of information sources in a hyperlinked environment HITS uses only topological properties of the hyperlinked network to determine rankings. We present an efficient and scalable implementation of the HITS algorithm that uses MPI as an underlying means of communication. We then analyze the performance on a shared memory supercomputer, and use our results to verify the optimal number of processors needed to rank a large number of pages for the link structure of the total University of Southern Mississippi (usm.edu domain) Web sites
[1] Ronald L. Rivest,et al. Introduction to Algorithms , 1990 .
[2] Antonio Gulli,et al. The indexable web is more than 11.5 billion pages , 2005, WWW '05.
[3] Ronald L. Rivest,et al. Introduction to Algorithms, Second Edition , 2001 .
[4] Jon Kleinberg,et al. Authoritative sources in a hyperlinked environment , 1999, SODA '98.