Exploiting Interest Locality for Peer-Assisted Search in UGC Video Systems

While there are several ways for video finding in UGC (user generated content) video systems, video search is still the number one source of video views in aggregation. In this paper, we propose to use peer-assisted search to alleviate the server burden caused by video search. To this end, we have measured and analyzed YouKu, the largest UGC video system in China. With a large dataset, we have found non-power law distribution of video popularity, low replication level for popular videos, skewed user activity and interest locality. Based on the findings, we design two-layer hierarchical semantic overlay structures to implement peer-assisted search for UGC video systems. A novel search algorithm called WISE is proposed to guide queries quickly to the semantically relevant clusters by visiting a very small fraction of nodes. Simulations using the YouKu trace demonstrate that WISE is effective and helpful to assist the search in UGC video systems. To the best of our knowledge, this is the first work to study peer-assisted search in UGC.

[1]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[2]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[3]  Anne-Marie Kermarrec,et al.  Peer sharing behaviour in the eDonkey network, and implications for the design of server-less file sharing systems , 2006, EuroSys.

[4]  Weimao Ke,et al.  Strong Ties vs. Weak Ties: Studying the Clustering Paradox for Decentralized Search , 2009, LSDS-IR@SIGIR.

[5]  Bruce M. Maggs,et al.  Efficient content location using interest-based locality in peer-to-peer systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[6]  Anand Sivasubramaniam,et al.  Semantic small world: an overlay network for peer-to-peer search , 2004, Proceedings of the 12th IEEE International Conference on Network Protocols, 2004. ICNP 2004..

[7]  Rong Gu,et al.  Measuring and enhancing the social connectivity of UGC video systems: A case study of YouKu , 2011, 2011 IEEE Nineteenth IEEE International Workshop on Quality of Service.

[8]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[9]  Jin Li,et al.  Toward P2P-Based Multimedia Sharing in User Generated Contents , 2012, IEEE Transactions on Parallel and Distributed Systems.

[10]  Theodoros Lappas,et al.  Mining tags using social endorsement networks , 2011, SIGIR.

[11]  Jun Wang,et al.  Exploiting Geographical and Temporal Locality to Boost Search Efficiency in Peer-to-Peer Systems , 2006, IEEE Transactions on Parallel and Distributed Systems.

[12]  Burton H. Bloom,et al.  Space/time trade-offs in hash coding with allowable errors , 1970, CACM.

[13]  A-L Barabási,et al.  Structure and tie strengths in mobile communication networks , 2006, Proceedings of the National Academy of Sciences.

[14]  Zhenyu Li,et al.  An Analysis of the Subscription in User-Generated Content Video Systems , 2012, 2012 21st International Conference on Computer Communications and Networks (ICCCN).

[15]  Edith Cohen,et al.  Replication strategies in unstructured peer-to-peer networks , 2002, SIGCOMM.

[16]  Lixin Gao,et al.  The impact of YouTube recommendation system on video views , 2010, IMC '10.

[17]  Yiming Hu,et al.  Enhancing Search Performance on Gnutella-Like P2P Systems , 2006, IEEE Transactions on Parallel and Distributed Systems.

[18]  Jiangchuan Liu,et al.  NetTube: Exploring Social Networks for Peer-to-Peer Short Video Sharing , 2009, IEEE INFOCOM 2009.

[19]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002, ICS '02.

[20]  D. Sornette,et al.  Extreme Deviations and Applications , 1997, cond-mat/9705132.

[21]  Daniel Stutzbach,et al.  Characterizing Unstructured Overlay Topologies in Modern P2P File-Sharing Systems , 2005, IEEE/ACM Transactions on Networking.

[22]  Minas Gjoka,et al.  Walking in Facebook: A Case Study of Unbiased Sampling of OSNs , 2010, 2010 Proceedings IEEE INFOCOM.

[23]  Songqing Chen,et al.  The stretched exponential distribution of internet media access patterns , 2008, PODC '08.

[24]  Yunhao Liu,et al.  TSS: Efficient Term Set Search in Large Peer-to-Peer Textual Collections , 2010, IEEE Transactions on Computers.

[25]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[26]  Jiangchuan Liu,et al.  Statistics and Social Network of YouTube Videos , 2008, 2008 16th Interntional Workshop on Quality of Service.

[27]  Songqing Chen,et al.  Analyzing patterns of user content generation in online social networks , 2009, KDD.