An analysis of optimal link bombs

We analyze the phenomenon of collusion for the purpose of boosting the pagerank of a node in an interlinked environment. We investigate the optimal attack pattern for a group of nodes (attackers) attempting to improve the ranking of a specific node (the victim). We consider attacks where the attackers can only manipulate their own outgoing links. We show that the optimal attacks in this scenario are uncoordinated, i.e. the attackers link directly to the victim and no one else; nodes do not link to each other. We also discuss optimal attack patterns for a group that wants to hide itself by not pointing directly to the victim. In these disguised attacks, the attackers link to nodes l hops away from the victim. We show that an optimal disguised attack exists and how it can be computed. The optimal disguised attack also allows us to find optimal link farm configurations. A link farm can be considered as a special case of our approach: the target page of the link farm is the victim and the other nodes in the link farm are the attackers for the purpose of improving the rank of the victim. The target page can however control its own outgoing links for the purpose of improving its own rank, which can be modeled as an optimal disguised attack of 1-hop on itself. Our results are unique in the literature as we show optimality not only in the pagerank score, but also in the rank based on the pagerank score. We further validate our results with experiments on a variety of random graph models.

[1]  C. Lee Giles,et al.  Winner''s Don''t Take All , 2002 .

[2]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[3]  Malik Magdon-Ismail,et al.  Optimal Link Bombs are Uncoordinated , 2005, AIRWeb.

[4]  Kevin S. McCurley,et al.  Ranking the web frontier , 2004, WWW '04.

[5]  Kevin S. McCurley,et al.  Analysis of anchor text for web search , 2003, SIGIR.

[6]  Eric J. Friedman,et al.  Manipulability of PageRank under Sybil Strategies , 2006 .

[7]  Hector Garcia-Molina,et al.  Link Spam Alliances , 2005, VLDB.

[8]  Andrei Z. Broder,et al.  Workshop on Algorithms and Models for the Web Graph , 2007, WAW.

[9]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[10]  Ling Liu,et al.  A Parameterized Approach to Spam-Resilient Link Analysis of the Web , 2009, IEEE Transactions on Parallel and Distributed Systems.

[11]  David M. Pennock,et al.  Winners don't take all: Characterizing the competition for links on the web , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Xin Zhao,et al.  Using spam farm to boost PageRank , 2007, AIRWeb '07.

[13]  Ramesh Govindan,et al.  Making Eigenvector-Based Reputation Systems Robust to Collusion , 2004, WAW.

[14]  Michael I. Jordan,et al.  Stable algorithms for link analysis , 2001, SIGIR '01.

[15]  Ricardo A. Baeza-Yates,et al.  Pagerank Increase under Different Collusion Topologies , 2005, AIRWeb.

[16]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[17]  Franco Scarselli,et al.  Inside PageRank , 2005, TOIT.

[18]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[19]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[20]  Tobias Scheffer,et al.  Thwarting the Nigritude Ultramarine: Learning to Identify Link Spam , 2005, ECML.

[21]  John A. Tomlin,et al.  A new paradigm for ranking pages on the world wide web , 2003, WWW '03.

[22]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.