On the robustness of google scholar against spam

In this research-in-progress paper we present the current results of several experiments in which we analyzed whether spamming Google Scholar is possible. Our results show, it is possible: We 'improved' the ranking of articles by manipulating their citation counts and we made articles appear in searchers for keywords the articles did not originally contained by placing invisible text in modified versions of the article.

[1]  Torsten Suel,et al.  Improving web spam classifiers using link structure , 2007, AIRWeb '07.

[2]  András A. Benczúr,et al.  Link-Based Similarity Search to Fight Web Spam , 2006, AIRWeb.

[3]  András A. Benczúr,et al.  SpamRank -- Fully Automatic Link Spam Detection , 2005, AIRWeb.

[4]  Fabrizio Silvestri,et al.  Know your neighbors: web spam detection using the web topology , 2007, SIGIR.

[5]  Jöran Beel,et al.  Google Scholar's Ranking Algorithm: The Impact of Articles' Age (An Empirical Study) , 2009, 2009 Sixth International Conference on Information Technology: New Generations.

[6]  Jöran Bela Erik Beel,et al.  Academic Search Engine Optimization (ASEO ): Optimizing Scholarly Literature for Google Scholar & Co. , 2010 .

[7]  Hector Garcia-Molina,et al.  Link Spam Alliances , 2005, VLDB.

[8]  Ira Steven Nathenson Internet Infoglut and Invisible Ink: Spamdexing Search Engines with Meta Tags , 1998 .

[9]  Tobias Scheffer,et al.  Thwarting the Nigritude Ultramarine: Learning to Identify Link Spam , 2005, ECML.

[10]  Otto-von-Guericke Google Scholar ’ s Ranking Algorithm : The Impact of Articles ’ Age ( An Empirical Study ) , 2009 .

[11]  Jöran Beel,et al.  Google Scholar’s Ranking Algorithm : An Introductory Overview , 2009 .

[12]  Baoning Wu,et al.  Extracting link spam using biased random walks from spam seed sets , 2007, AIRWeb '07.

[13]  Kazuyuki Aihara,et al.  A large-scale study of link spam detection by graph algorithms , 2007, AIRWeb '07.

[14]  Thomas Lavergne,et al.  Tracking Web Spam with Hidden Style Similarity , 2006, AIRWeb.

[15]  Jöran Beel,et al.  Google Scholar's ranking algorithm: The impact of citation counts (An empirical study) , 2009, 2009 Third International Conference on Research Challenges in Information Science.

[16]  Marc Najork,et al.  Spam, damn spam, and statistics: using statistical analysis to locate spam web pages , 2004, WebDB '04.