Effect of different network analysis strategies on search engine re-ranking

The research described in this paper examined two different approaches to building the co-citation network that the authors have used in re-ranking the set of results returned by a search engine [22, 23]. The more computationally demanding (in terms of query load) Inter- or Web-wide co-citation approach used in-links from throughout the Web to build the network. In contrast, the Intra co-citation approach only used inlinks inferred from search engine output. Results of this study confirmed the authors' previous findings [23] that reordering based on a network-analytic relevance prediction model significantly improves the precision of top 20 results as compared to the Google search engine. The results also showed (for the queries used) that the Intra co-citation approach is significantly better than the Web-wide co-citation approach, in addition to placing fewer querying demands on the search engine.

[1]  B. C. Griffith,et al.  The Structure of Scientific Literatures I: Identifying and Graphing Specialties , 1974 .

[2]  Mark H. Chignell,et al.  Searching the hypermedia Web: improved topic distillation through network analytic relevance ranking , 2002, New Rev. Hypermedia Multim..

[3]  Jon M. Kleinberg,et al.  Mining the Web's Link Structure , 1999, Computer.

[4]  Jon M. Kleinberg,et al.  Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text , 1998, Comput. Networks.

[5]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[6]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[7]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[8]  Taher H. Haveliwala Topic-sensitive PageRank , 2002, IEEE Trans. Knowl. Data Eng..

[9]  Wei Zhang,et al.  Improvement of HITS-based algorithms on web documents , 2002, WWW '02.

[10]  Loren G. Terveen,et al.  Does “authority” mean quality? predicting expert quality ratings of Web documents , 2000, SIGIR '00.

[11]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[12]  Mark Chignell,et al.  How Good is Search Engine Ranking? a Validation Study with Human Judges , 2002 .

[13]  Amanda Spink,et al.  Real life, real users, and real needs: a study and analysis of user queries on the web , 2000, Inf. Process. Manag..

[14]  Ray R. Larson,et al.  Bibliometrics of the World Wide Web: An Exploratory Analysis of the Intellectual Structure of Cyberspace , 1996 .

[15]  Allan Borodin,et al.  Finding authorities and hubs from link structures on the World Wide Web , 2001, WWW '01.

[16]  Mark H. Chignell,et al.  Re-ranking search results using network analysis a case study with google: a case study with Google , 2002, CASCON.

[17]  Joel C. Miller,et al.  Modifications of Kleinberg's HITS algorithm using matrix exponentiation and web log records , 2001, SIGIR '01.

[18]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.