Measuring BitTorrent swarms beyond reach

BitTorrent is one of the most popular P2P file sharing applications in the world. Each BitTorrent network is called a swarm and millions of peers may join multiple swarms. However, there are many unreachable peers (NATed, Fire-Walled, or inactive at the time of the measurement) in each swarm. Due to this unreachable peers problem, the existing work can measure only a part of the entire peers in a swarm. In this paper, we propose an improved measurement method for BitTorrent swarms that many unreachable peers. In a nutshell, our crawler obtains peers behind NAT and firewalls by letting them connect to our crawlers through actively advertising our crawlers addresses to them. The evaluation result shows that our proposed method increases the number of unique contacted peers by 112 % compared to the conventional method. The proposed method also increases the total volume of downloaded pieces by 66 %. We then investigate the sampling bias among our proposed method and conventional methods, and find that different measurement methods can lead to significantly different measurement results.

[1]  Niklas Carlsson,et al.  Dynamic swarm management for improved BitTorrent performance , 2009, IPTPS.

[2]  Thomas E. Anderson,et al.  One Hop Reputations for Peer to Peer File Sharing Workloads , 2008, NSDI.

[3]  Xiaoning Ding,et al.  A performance study of BitTorrent-like peer-to-peer systems , 2007, IEEE Journal on Selected Areas in Communications.

[4]  Xiaoning Ding,et al.  Measurements, analysis, and modeling of BitTorrent-like systems , 2005, IMC '05.

[5]  Raúl Jiménez,et al.  Connectivity properties of Mainline BitTorrent DHT nodes , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[6]  Nazareno Andrade,et al.  Resource demand and supply in BitTorrent content-sharing communities , 2009, Comput. Networks.

[7]  Keith W. Ross,et al.  Understanding Peer Exchange in BitTorrent Systems , 2010, 2010 IEEE Tenth International Conference on Peer-to-Peer Computing (P2P).

[8]  Walid Dabbous,et al.  Spying the World from Your Laptop: Identifying and Profiling Content Providers and Big Downloaders in BitTorrent , 2010, LEET.

[9]  Daniel Stutzbach,et al.  Understanding churn in peer-to-peer networks , 2006, IMC '06.

[10]  Alexandru Iosup,et al.  Sampling Bias in BitTorrent Measurements , 2010, Euro-Par.

[11]  Mikel Izal,et al.  Dissecting BitTorrent: Five Months in a Torrent's Lifetime , 2004, PAM.

[12]  Johan A. Pouwelse,et al.  The Bittorrent P2P File-Sharing System: Measurements and Analysis , 2005, IPTPS.

[13]  Akihiro Nakao,et al.  A Resource-Efficient Method for Crawling Swarm Information in Multiple BitTorrent Networks , 2011, 2011 Tenth International Symposium on Autonomous Decentralized Systems.

[14]  Pablo Rodriguez,et al.  Monitoring the Bittorrent Monitors: A Bird's Eye View , 2009, PAM.

[15]  B. Levine,et al.  Exploring the Use of BitTorrent as the Basis for a Large Trace Repository , 2004 .

[16]  Di Wu,et al.  Unraveling the BitTorrent Ecosystem , 2011, IEEE Transactions on Parallel and Distributed Systems.

[17]  Alexandru Iosup,et al.  Correlating Topology and Path Characteristics of Overlay Networks and the Internet , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[18]  Akihiro Nakao,et al.  CoreLab: an emerging network testbed employing hosted virtual machine monitor , 2008, CoNEXT '08.

[19]  Abraham Silberschatz,et al.  P4p: provider portal for applications , 2008, SIGCOMM '08.