MailRank: using ranking for spam detection

Can we use social networks to combat spam? This paper investigates the feasibility of MailRank, a new email ranking and classification scheme exploiting the social communication network created via email interactions. The underlying email network data is collected from the email contacts of all MailRank users and updated automatically based on their email activities to achieve an easy maintenance. MailRank is used to rate the sender address of arriving emails such that emails from trustworthy senders can be ranked and classified as spam or non-spam. The paper presents two variants: Basic MailRank computes a global reputation score for each email address, whereas in Personalized MailRank the score of each email address is different for each MailRank user. The evaluation shows that MailRank is highly resistant against spammer attacks, which obviously have to be considered right from the beginning in such an application scenario. MailRank also performs well even for rather sparse networks, i.e., where only a small set of peers actually take part in the ranking of email addresses.

[1]  Kenneth J. Arrow,et al.  Information Dynamics in the Networked World , 2003, Inf. Syst. Frontiers.

[2]  P. Oscar Boykin,et al.  Collaborative Spam Filtering Using E-Mail Networks , 2006, Computer.

[3]  Andrew Clausen,et al.  Online Reputation Systems: The Cost of Attack of PageRank , 2003 .

[4]  P. Oscar Boykin,et al.  Leveraging social networks to fight spam , 2005, Computer.

[5]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[6]  James A. Hendler,et al.  Reputation Network Analysis for Email Filtering , 2004, CEAS.

[7]  David Geer Will New Standards Help Curb Spam? , 2004, Computer.

[8]  Georg Lausen,et al.  Spreading activation models for trust propagation , 2004, IEEE International Conference on e-Technology, e-Commerce and e-Service, 2004. EEE '04. 2004.

[9]  Jennifer Widom,et al.  Scaling personalized web search , 2003, WWW '03.

[10]  Shyhtsun Felix Wu,et al.  On Attacking Statistical Spam Filters , 2004, CEAS.

[11]  Hector Garcia-Molina,et al.  The Eigentrust algorithm for reputation management in P2P networks , 2003, WWW '03.

[12]  Paul Resnick,et al.  Recommender systems , 1997, CACM.

[13]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[14]  S. Bornholdt,et al.  Scale-free topology of e-mail networks. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Stephanie Forrest,et al.  Email networks and the spread of computer viruses. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Wolfgang Nejdl,et al.  Finding Related Pages Using the Link Structure of the WWW , 2004, IEEE/WIC/ACM International Conference on Web Intelligence (WI'04).

[17]  Mads Haahr,et al.  Personalised, Collaborative Spam Filtering , 2004, CEAS.

[18]  Lada A. Adamic,et al.  Information Dynamics in the Networked World , 2003, Inf. Syst. Frontiers.

[19]  Brian D. Davison,et al.  Identifying link farm spam pages , 2005, WWW '05.