Adversarial information retrieval in the web

UPGRADE is the anchor point for UPENET (UPGRADE European NETwork), the network of CEPIS member societies’ publications, that currently includes the following ones: • Informatik-Spektrum, journal published by Springer Verlag on behalf of the CEPIS societies GI, Germany, and SI, Switzerland • ITNOW, magazine published by Oxford University Press on behalf of the British CEPIS society BCS • Mondo Digitale, digital journal from the Italian CEPIS society AICA • Novática, journal from the Spanish CEPIS society ATI • OCG Journal, journal from the Austrian CEPIS society OCG • Pliroforiki, journal from the Cyprus CEPIS society CCS • Pro Dialog, journal from the Polish CEPIS society PTI-PIPS

[1]  Sebastiano Vigna,et al.  PageRank as a function of the damping factor , 2005, WWW '05.

[2]  José María Gómez Hidalgo,et al.  Evaluating cost-sensitive Unsolicited Bulk Email categorization , 2002, SAC '02.

[3]  Ricardo A. Baeza-Yates,et al.  Pagerank Increase under Different Collusion Topologies , 2005, AIRWeb.

[4]  Brian D. Davison Recognizing Nepotistic Links on the Web , 2000 .

[5]  Marc Najork,et al.  Spam, damn spam, and statistics: using statistical analysis to locate spam web pages , 2004, WebDB '04.

[6]  Fabrizio Sebastiani,et al.  Machine learning in automated text categorization , 2001, CSUR.

[7]  András A. Benczúr,et al.  SpamRank - fully automatic link spam detection. Work in progress , 2005 .

[8]  Koichi Takeda,et al.  Information retrieval on the web , 2000, CSUR.

[9]  Ludovic Denoyer,et al.  Structured multimedia document classification , 2003, DocEng '03.

[10]  Krishna Bharat,et al.  Improved algorithms for topic distillation in a hyperlinked environment , 1998, SIGIR '98.

[11]  Pedro M. Domingos,et al.  Adversarial classification , 2004, KDD.

[12]  Gilad Mishne,et al.  Blocking Blog Spam with Language Model Disagreement , 2005, AIRWeb.

[13]  Simonetta Montemagni,et al.  NLP-enhanced Content Filtering Within the POESIA Project , 2004, LREC.

[14]  Tim Oates,et al.  Detecting Spam Blogs: A Machine Learning Approach , 2006, AAAI.

[15]  Timothy W. Finin,et al.  Characterizing the Splogosphere , 2006, WWW 2006.

[16]  David Carmel,et al.  The connectivity sonar: detecting site functionality by structural patterns , 2003, HYPERTEXT '03.

[17]  Brian D. Davison,et al.  Identifying link farm spam pages , 2005, WWW '05.

[18]  Ramesh Govindan,et al.  Making Eigenvector-Based Reputation Systems Robust to Collusion , 2004, WAW.

[19]  Marc Najork,et al.  Detecting spam web pages through content analysis , 2006, WWW '06.

[20]  Brian D. Davison,et al.  Detecting semantic cloaking on the web , 2006, WWW '06.

[21]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[22]  Siu Cheung Hui,et al.  A structural and content-based analysis for Web filtering , 2003, Internet Res..

[23]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[24]  Hector Garcia-Molina,et al.  Web Spam Taxonomy , 2005, AIRWeb.

[25]  Brian D. Davison,et al.  Cloaking and Redirection: A Preliminary Study , 2005, AIRWeb.