Discriminating Biased Web Manipulations in Terms of Link Oriented Measures

In this paper, we present a link oriented measuring method to discriminate the manipulated web pages effectively. We define the label of an edge as having a link context and a similarity measure between link context and target page. By suggesting an assessing measure based on singular value decomposition, it is explained that our proposed method can effectively detect the manipulated web pages. We, however, extend the SVD as an assessment measure to detect the rank-manipulated pages. In the experiment, the LOD method reduced about 17% amount of the rank that is minimum 209.4% higher than not manipulated web pages. Using this proposed approach, the chance of manipulated web pages getting high ranks than deserved can be discriminated effectively.

[1]  Taher H. Haveliwala Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search , 2003, IEEE Trans. Knowl. Data Eng..

[2]  Joel C. Miller,et al.  Modifications of Kleinberg's HITS algorithm using matrix exponentiation and web log records , 2001, SIGIR '01.

[3]  Alexander Thomasian,et al.  CSVD: Clustering and Singular Value Decomposition for Approximate Similarity Search in High-Dimensional Spaces , 2003, IEEE Trans. Knowl. Data Eng..

[4]  Iraklis Varlamis,et al.  THESUS: Organizing Web document collections based on link semantics , 2003, The VLDB Journal.

[5]  Robert Wilensky,et al.  Robust Hyperlinks: Cheap, Everywhere, Now , 2000, DDEP/PODDP.

[6]  Sriram Raghavan,et al.  Searching the Web , 2001, ACM Trans. Internet Techn..

[7]  Hsi-Jian Lee,et al.  Anchor text mining for translation of Web queries , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[8]  David J. DeWitt,et al.  Computing PageRank in a Distributed Internet Search Engine System , 2004, VLDB.

[9]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[10]  Eli Upfal,et al.  Using PageRank to Characterize Web Structure , 2002, COCOON.

[11]  Hyun Kang,et al.  A Study on the Fault Diagnosis of 3-D Roll Shape in Rolling Systems , 2004 .

[12]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[13]  Erik Elmroth,et al.  Applying recursion to serial and parallel QR factorization leads to better performance , 2000, IBM J. Res. Dev..

[14]  Wookey Lee,et al.  Structuring the Web to Cope with Dynamic Changes , 2005, SDWP@ICWS.

[15]  Berthier A. Ribeiro-Neto,et al.  Local versus global link information in the Web , 2003, TOIS.