Web Spam, Social Propaganda and the Evolution of Search Engine Rankings

Search Engines have greatly influenced the way we experience the web. Since the early days of the web, users have been relying on them to get informed and make decisions. When the web was relatively small, web directories were built and maintained using human experts to screen and categorize pages according to their characteristics. By the mid 1990’s, however, it was apparent that the human expert model of categorizing web pages does not scale. The first search engines appeared and they have been evolving ever since, taking over the role that web directories used to play.

[1]  Hector Garcia-Molina,et al.  Web Spam Taxonomy , 2005, AIRWeb.

[2]  Clifford A. Lynch When documents deceive: trust and provenance as new factors for information retrieval in a tangled web , 2001 .

[3]  Junghoo Cho,et al.  Impact of search engines on page popularity , 2004, WWW '04.

[4]  Marc Najork,et al.  A large‐scale study of the evolution of Web pages , 2003, WWW '03.

[5]  Monika Henzinger,et al.  Hyperlink Analysis for the Web , 2001, IEEE Internet Comput..

[6]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[7]  Alfred McClung Lee,et al.  The fine art of propaganda , 1972 .

[8]  Gerard Salton,et al.  Dynamic document processing , 1972, CACM.

[9]  Massimo Marchiori,et al.  The Quest for Correct Information on the Web: Hyper Search Engines , 1997, Comput. Networks.

[10]  M. I. Mauldin,et al.  Lycos: design choices in an Internet search service , 1997 .

[11]  Helen Nissenbaum,et al.  Defining the Web: The Politics of Search Engines , 2000, Computer.

[12]  Gary William Flake,et al.  Self-organization of the web and identification of communities , 2002 .

[13]  Jon M. Kleinberg,et al.  The small-world phenomenon: an algorithmic perspective , 2000, STOC '00.

[14]  Anton Vedder,et al.  Medical Data, New Information Technologies, and the Need for Normative Principles other than Privacy Rules , 2000 .

[15]  Andrei Z. Broder,et al.  Graph structure in the Web , 2000, Comput. Networks.

[16]  L. Gostin Law and medicine. , 1993, JAMA.

[17]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[18]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[19]  GrahamLeah,et al.  "Of course it's true; I saw it on the Internet!" , 2003 .

[20]  Panagiotis Takis Metaxas,et al.  "Of course it's true; I saw it on the Internet!": critical thinking in the Internet era , 2003, CACM.

[21]  Craig A. Knoblock,et al.  Lycos : Design choices in an Internet search service , 1997 .

[22]  Marc Najork,et al.  Spam, damn spam, and statistics: using statistical analysis to locate spam web pages , 2004, WebDB '04.

[23]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[24]  Monika Henzinger,et al.  Analysis of a very large web search engine query log , 1999, SIGF.

[25]  Andrew Tomkins,et al.  The Web and Social Networks , 2002, Computer.

[26]  Rajeev Motwani,et al.  Stratified Planning , 2009, IJCAI.

[27]  Panagiotis Takis Metaxas Using Propagation of Distrust to Find Untrustworthy Web Neighborhoods , 2009, 2009 Fourth International Conference on Internet and Web Applications and Services.

[28]  Lloyd Allison,et al.  What is a Tall Poppy Among Web Pages? , 1998, Comput. Networks.

[29]  Kostas Tsioutsiouliklis,et al.  \Googlearchy": How a Few Heavily-Linked Sites Dominate Politics on the Web , 2003 .

[30]  Brian D. Davison,et al.  Identifying link farm spam pages , 2005, WWW '05.

[31]  András A. Benczúr,et al.  SpamRank -- Fully Automatic Link Spam Detection , 2005, AIRWeb.

[32]  Marc Najork,et al.  Spam, Damn Spam, and Statistics , 2004 .

[33]  C. Lee Giles,et al.  Self-Organization and Identification of Web Communities , 2002, Computer.

[34]  Prabhakar Raghavan,et al.  Social Networks: From the Web to the Enterprise , 2002, IEEE Internet Comput..

[35]  Franco Scarselli,et al.  PageRank and Web communities , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[36]  Krishna Bharat,et al.  Who links to whom: mining linkage between Web sites , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[37]  Ravi Kumar,et al.  Trawling the Web for Emerging Cyber-Communities , 1999, Comput. Networks.

[38]  Tsau Young Lin,et al.  Proceedings of the 2001 IEEE International Conference on Data Mining, 29 November - 2 December 2001, San Jose, California, USA , 2001 .