Improving the Search Mechanism for Unstructured Peer-to-Peer Networks Using the Statistical Matrix Form

In a traditional file search mechanism, such as flooding, a peer broadcasts a query to its neighbors through an unstructured peer-to-peer (P2P) network until the time-to-live decreases to zero. A major disadvantage of flooding is that, in a large-scale network, this blind-choice strategy usually incurs an enormous traffic overhead. In this paper, we propose a method, called the statistical matrix form (SMF), which improves the flooding mechanism by selecting neighbors according to their capabilities. The SMF measures the following peer characteristics: (1) the number of shared files; (2) the content quality; (3) the query service; and (4) the transmission distance between neighbors. Based on these measurements, appropriate peers can be selected, thereby reducing the traffic overhead significantly. Our experimental results demonstrate that the SMF is effective and efficient. For example, compared with the flooding search mechanism in dynamic unstructured P2P networks, the SMF reduces the traffic overhead by more than 80%. Moreover, it achieves a good success rate and shorter response times.

[1]  Matteo Sereno,et al.  Generalized Probabilistic Flooding in Unstructured Peer-to-Peer Networks , 2011, IEEE Transactions on Parallel and Distributed Systems.

[2]  Cheng-Hsin Hsu,et al.  Design and Evaluation of a Proxy Cache for Peer-to-Peer Traffic , 2011, IEEE Transactions on Computers.

[3]  Panos Kalnis,et al.  DCMP: A Distributed Cycle Minimization Protocol for Peer-to-Peer Networks , 2008, IEEE Transactions on Parallel and Distributed Systems.

[4]  John Kubiatowicz,et al.  Probabilistic location and routing , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[5]  Li Xiao,et al.  Improving unstructured peer-to-peer systems by adaptive connection establishment , 2005, IEEE Transactions on Computers.

[6]  HyunYong Lee,et al.  A Feasibility Study of P2P Traffic Localization through Network Delay Insertion , 2012, IEICE Trans. Commun..

[7]  Biplab Sikdar,et al.  A Queuing Model for Evaluating the Transfer Latency of Peer-to-Peer Systems , 2010, IEEE Transactions on Parallel and Distributed Systems.

[8]  Stefan Saroiu,et al.  A Measurement Study of Peer-to-Peer File Sharing Systems , 2001 .

[9]  Hung-Chang Hsiao,et al.  A Tree-Based Peer-to-Peer Network with Quality Guarantees , 2008, IEEE Transactions on Parallel and Distributed Systems.

[10]  Do Young Eun,et al.  On the Performance of Content Delivery under Competition in a Stochastic Unstructured Peer-to-Peer Network , 2010, IEEE Transactions on Parallel and Distributed Systems.

[11]  Yunhao Liu,et al.  Rumor Riding: Anonymizing Unstructured Peer-to-Peer Systems , 2006, IEEE Transactions on Parallel and Distributed Systems.

[12]  Hector Garcia-Molina,et al.  Efficient search in peer to peer networks , 2004 .

[13]  Geoff Coulson,et al.  Free Riding on Gnutella Revisited: The Bell Tolls? , 2005, IEEE Distributed Syst. Online.

[14]  Bruce M. Maggs,et al.  Efficient content location using interest-based locality in peer-to-peer systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[15]  Tsungnan Lin,et al.  On efficiency in searching networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[16]  Li Xiao,et al.  AOTO: adaptive overlay topology optimization in unstructured P2P systems , 2003, GLOBECOM '03. IEEE Global Telecommunications Conference (IEEE Cat. No.03CH37489).

[17]  Li Xiao,et al.  An Effective P2P Search Scheme to Exploit File Sharing Heterogeneity , 2007, IEEE Transactions on Parallel and Distributed Systems.

[18]  Abhishek Kumar,et al.  Efficient and scalable query routing for unstructured peer-to-peer networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[19]  Dimitrios Tsoumakos,et al.  Analysis and comparison of P2P search methods , 2006, InfoScale '06.

[20]  Hirotsugu Kakugawa,et al.  An Efficient Index Dissemination in Unstructured Peer-to-Peer Networks , 2008, IEICE Trans. Inf. Syst..

[21]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[22]  Jovan G. Brankov,et al.  Estimating the standard deviation from extreme Gaussian values , 2004, IEEE Signal Processing Letters.

[23]  Hillol Kargupta,et al.  Approximate Distributed K-Means Clustering over a Peer-to-Peer Network , 2009, IEEE Transactions on Knowledge and Data Engineering.

[24]  Xue Liu,et al.  Revisiting Dynamic Query Protocols in Unstructured Peer-to-Peer Networks , 2012, IEEE Transactions on Parallel and Distributed Systems.

[25]  Qian Zhang,et al.  How Much to Share: A Repeated Game Model for Peer-to-Peer Streaming under Service Differentiation Incentives , 2012, IEEE Transactions on Parallel and Distributed Systems.

[26]  Li Xiao,et al.  Building a Scalable Bipartite P2P Overlay Network , 2007, IEEE Trans. Parallel Distributed Syst..

[27]  Hao Liao,et al.  Building Small-World Peer-to-Peer Networks Based on Hierarchical Structures , 2009, IEEE Transactions on Parallel and Distributed Systems.

[28]  Li Xiao,et al.  DiCAS: An Efficient Distributed Caching Mechanism for P2P Systems , 2006, IEEE Transactions on Parallel and Distributed Systems.

[29]  Y. Charlie Hu,et al.  Transparent query caching in peer-to-peer overlay networks , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[30]  Ben Y. Zhao,et al.  An Infrastructure for Fault-tolerant Wide-area Location and Routing , 2001 .

[31]  Murat Yuksel,et al.  Limited Scale-Free Overlay Topologies for Unstructured Peer-to-Peer Networks , 2009, IEEE Transactions on Parallel and Distributed Systems.

[32]  Hao Liao,et al.  Resolving the Topology Mismatch Problem in Unstructured Peer-to-Peer Networks , 2009, IEEE Transactions on Parallel and Distributed Systems.

[33]  Caterina M. Scoglio,et al.  Evaluating stranger policies in P2P file-sharing systems with reciprocity mechanisms , 2012, Comput. Networks.

[34]  Mohamed S. Kamel,et al.  Hierarchically Distributed Peer-to-Peer Document Clustering and Cluster Summarization , 2009, IEEE Transactions on Knowledge and Data Engineering.

[35]  Gonzalo Navarro,et al.  Near neighbor searching with K nearest references , 2015, Inf. Syst..

[36]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .

[37]  Raouf Boutaba,et al.  Plexus: A Scalable Peer-to-Peer Protocol Enabling Efficient Subset Search , 2009, IEEE/ACM Transactions on Networking.

[38]  R. Rodrigues,et al.  Full-Information Lookups for Peer-to-Peer Overlays , 2008, IEEE Transactions on Parallel and Distributed Systems.

[39]  Piet Van Mieghem,et al.  Searching with Multiple Random Walk Queries , 2007, 2007 IEEE 18th International Symposium on Personal, Indoor and Mobile Radio Communications.

[40]  Walter Willinger,et al.  On Unbiased Sampling for Unstructured Peer-to-Peer Networks , 2006, IEEE/ACM Transactions on Networking.

[41]  Po-Chiang Lin,et al.  Dynamic Search Algorithm in Unstructured Peer-to-Peer Networks , 2009, IEEE Transactions on Parallel and Distributed Systems.

[42]  Christos Gkantsidis,et al.  Random walks in peer-to-peer networks , 2004, IEEE INFOCOM 2004.

[43]  Ben Y. Zhao,et al.  Protector: A Probabilistic Failure Detector for Cost-Effective Peer-to-Peer Storage , 2011, IEEE Transactions on Parallel and Distributed Systems.

[44]  Wei-Mei Chen,et al.  Randomized Search Strategy for Unstructured P2P Networks , 2012, IEICE Trans. Commun..

[45]  Li Xiao,et al.  Location awareness in unstructured peer-to-peer systems , 2005, IEEE Transactions on Parallel and Distributed Systems.

[46]  Yuuki Takano,et al.  Distributed Hash Tables Adapting to the Conditions of the Real World , 2013 .

[47]  Li Xiao,et al.  Building a Scalable Bipartite P2P Overlay Network , 2004, IEEE Transactions on Parallel and Distributed Systems.

[48]  John C. S. Lui,et al.  A Mathematical Framework for Analyzing Adaptive Incentive Protocols in P2P Networks , 2012, IEEE/ACM Transactions on Networking.

[49]  George Economou,et al.  Color edge detection using the minimal spanning tree , 2005, Pattern Recognit..

[50]  Song Jiang,et al.  LightFlood: Minimizing Redundant Messages and Maximizing Scope of Peer-to-Peer Search , 2008, IEEE Transactions on Parallel and Distributed Systems.

[51]  Jie Wu,et al.  Stability-Optimal Grouping Strategy of Peer-to-Peer Systems , 2011, IEEE Transactions on Parallel and Distributed Systems.

[52]  Yuanyuan Yang,et al.  An Efficient Hybrid Peer-to-Peer System for Distributed Data Sharing , 2010, IEEE Transactions on Computers.

[53]  Hao Liao,et al.  A Near-Optimal Algorithm Attacking the Topology Mismatch Problem in Unstructured Peer-to-Peer Networks , 2010, IEEE Transactions on Parallel and Distributed Systems.

[54]  Takumi Miyoshi,et al.  A Router-Aided Hierarchical P2P Traffic Localization Based on Variable Additional Delay Insertion , 2014, IEICE Trans. Commun..

[55]  Özgür Ulusoy,et al.  Free Riding in Peer-to-Peer Networks , 2009, IEEE Internet Computing.

[56]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002, ICS '02.

[57]  Ian T. Foster,et al.  Mapping the Gnutella Network , 2002, IEEE Internet Comput..

[58]  Mohamed Hefeeda,et al.  On the Benefits of Cooperative Proxy Caching for Peer-to-Peer Traffic , 2010, IEEE Transactions on Parallel and Distributed Systems.

[59]  Satoshi Fujita,et al.  Tree-Based Consistency Maintenance Scheme for Peer-to-Peer File Sharing of Editable Contents , 2014, IEICE Trans. Inf. Syst..

[60]  Maria Gradinariu Potop-Butucaru,et al.  Stabilizing Distributed R-Trees for Peer-to-Peer Content Routing , 2010, IEEE Transactions on Parallel and Distributed Systems.

[61]  Scott Shenker,et al.  Making gnutella-like P2P systems scalable , 2003, SIGCOMM '03.

[62]  Leonard Kleinrock,et al.  Optimal Search Performance in Unstructured Peer-to-Peer Networks With Clustered Demands , 2006, 2006 IEEE International Conference on Communications.

[63]  Eytan Adar,et al.  Free Riding on Gnutella , 2000, First Monday.

[64]  Yao Zhao,et al.  Where the Sidewalk Ends: Extending the Internet AS Graph Using Traceroutes from P2P Users , 2014, IEEE Trans. Computers.

[65]  Gang Chen,et al.  Enhancing Search Performance in Unstructured P2P Networks Based on Users' Common Interest , 2008, IEEE Transactions on Parallel and Distributed Systems.

[66]  Li Xiao,et al.  Improving Query Response Delivery Quality in Peer-to-Peer Systems , 2006, IEEE Transactions on Parallel and Distributed Systems.

[67]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.