Content pollution quantification in large P2P networks : A measurement study on KAD

Content pollution is one of the major issues affecting P2P file sharing networks. However, since early studies on FastTrack and Overnet, no recent investigation has reported its impact on current P2P networks. In this paper, we present a method and the supporting architecture to quantify the pollution of contents in the KAD network. We first collect information on many popular files shared in this network. Then, we propose a new way to detect content pollution by analyzing all filenames linked to a content with a metric based on the Tversky index and which gives very low error rates. By analyzing a large number of popular files, we show that 2/3 of the contents are polluted, one part by index poisoning but the majority by a new, more dangerous, form of pollution that we call index falsification.

[1]  Keith W. Ross,et al.  The Index Poisoning Attack in P2P File Sharing Systems , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[2]  Taoufik En-Najjary,et al.  A global view of kad , 2007, IMC '07.

[3]  Stefan Schmid,et al.  Poisoning the Kad Network , 2010, ICDCN.

[4]  Douglas S. Reeves,et al.  Winnowing: Protecting P2P systems against pollution through cooperative index filtering , 2012, J. Netw. Comput. Appl..

[5]  Clémence Magnien,et al.  Quantifying paedophile queries in a large P2P system , 2011, 2011 Proceedings IEEE INFOCOM.

[6]  Rakesh Kumar,et al.  Pollution in P2P file sharing systems , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[7]  Yongdae Kim,et al.  Why Kad lookup fails , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[8]  Taoufik En-Najjary,et al.  Exploiting KAD: possible uses and misuses , 2007, CCRV.

[9]  Injong Rhee,et al.  WINNOWING : Protecting P 2 P Systems Against Pollution By Cooperative Index Filtering , 2009 .

[10]  Olivier Festor,et al.  Monitoring and Controlling Content Access in KAD , 2010, 2010 IEEE International Conference on Communications.

[11]  A. Tversky Features of Similarity , 1977 .

[12]  Mario Gerla,et al.  Understanding Pollution Dynamics in P2P File Sharing , 2006, IPTPS.

[13]  Nicolas Christin,et al.  Content availability, pollution and poisoning in file sharing peer-to-peer networks , 2005, EC '05.

[14]  Olivier Festor,et al.  Efficient DHT attack mitigation through peers' ID distribution , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW).

[15]  Julien Bourgeois,et al.  International workshop on hot topics in Peer-to-Peer systems - HOTP2P , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[16]  Jussara M. Almeida,et al.  Reputation Systems for Fighting Pollution in Peer-to-Peer File Sharing Systems , 2007, Seventh IEEE International Conference on Peer-to-Peer Computing (P2P 2007).