Workload characterization of a location-based social network

Recently, there has been a large popularization of location-based social networks, such as Foursquare and Apontador, in which users can share their current locations, upload tips and make comments about places. Part of this popularity is due to facility access to the Internet through mobile devices with GPS. Despite the various efforts towards understanding characteristics of these systems, little is known about the access pattern of users in these systems. Providers of this kind of services need to deal with different challenges that could benefit of such understanding, such as content storage, performance and scalability of servers, personalization and service differentiation for users. This article aims at characterizing and modeling the patterns of requests that reach a server of a location-based social network. To do that, we use a dataset obtained from Apontador, a Brazilian system with characteristics similar to Foursquare and Gowalla, where users share information about their locations and can navigate on existent system locations. As results, we identified models that describe unique characteristics of the user sessions on this kind of system, patterns in which requests arrive on the server as well as the access profile of users in the system.

[1]  Krishna P. Gummadi,et al.  On word-of-mouth based discovery of the web , 2011, IMC '11.

[2]  Adriano M. Pereira,et al.  Evaluating the impact of reactivity on the performance of Web applications , 2006, 2006 IEEE International Performance Computing and Communications Conference.

[3]  Jia Wang,et al.  A survey of web caching schemes for the Internet , 1999, CCRV.

[4]  Ben Y. Zhao,et al.  Exploiting locality of interest in online social networks , 2010, CoNEXT.

[5]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[6]  Serge Fdida,et al.  Future internet research and experimentation: the FIRE initiative , 2007, CCRV.

[7]  Anja Feldmann,et al.  Understanding online social network usage from a network perspective , 2009, IMC '09.

[8]  Ítalo S. Cunha,et al.  Analyzing client interactivity in streaming media , 2004, WWW '04.

[9]  Daniel A. Menascé,et al.  Scaling for E-Business: Technologies, Models, Performance, and Capacity Planning , 2000 .

[10]  Virgílio A. F. Almeida,et al.  A hierarchical characterization of a live streaming media workload , 2006, TNET.

[11]  Xifeng Yan,et al.  Workload characterization and prediction in the cloud: A multiple time series approach , 2012, 2012 IEEE Network Operations and Management Symposium.

[12]  C. Audemard,et al.  Needle in a haystack: involvement of the copepod Paracartia grani in the life-cycle of the oyster pathogen Marteilia refringens , 2002, Parasitology.

[13]  Virgílio A. F. Almeida,et al.  Understanding factors that affect response rates in twitter , 2012, HT '12.

[14]  Paul Barford,et al.  Generating representative Web workloads for network and server performance evaluation , 1998, SIGMETRICS '98/PERFORMANCE '98.

[15]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[16]  Virgílio A. F. Almeida,et al.  Characterization and Analysis of User Profiles in Online Video Sharing Systems , 2010, J. Inf. Data Manag..

[17]  Cecilia Mascolo,et al.  An Empirical Study of Geographic User Activity Patterns in Foursquare , 2011, ICWSM.

[18]  Pablo Rodriguez,et al.  Explore what-if scenarios with SONG: Social Network Write Generator , 2011, ArXiv.

[19]  Li Fan,et al.  Summary cache: a scalable wide-area web cache sharing protocol , 2000, TNET.

[20]  Adriano M. Pereira,et al.  Assessing the impact of reactive workloads on the performance of Web applications , 2006, 2006 IEEE International Symposium on Performance Analysis of Systems and Software.

[21]  Jordi Torres,et al.  Characterization of workload and resource consumption for an online travel and booking site , 2010, IEEE International Symposium on Workload Characterization (IISWC'10).

[22]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[23]  Cecilia Mascolo,et al.  Exploiting Semantic Annotations for Clustering Geographic Areas and Users in Location-based Social Networks , 2011, The Social Mobile Web.

[24]  Virgílio A. F. Almeida,et al.  A methodology for workload characterization of E-commerce sites , 1999, EC '99.

[25]  Jerome A. Rolia,et al.  A Synthetic Workload Generation Technique for Stress Testing Session-Based Systems , 2006, IEEE Transactions on Software Engineering.

[26]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[27]  Pablo Rodriguez,et al.  I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system , 2007, IMC '07.

[28]  Gang Lu,et al.  Characterization of real workloads of web search engines , 2011, 2011 IEEE International Symposium on Workload Characterization (IISWC).

[29]  Salvatore Scellato "Beyond the social web: the geo-social revolution" by Salvatore Scellato with Ching-man Au Yeung as Coordinator , 2011, LINK.

[30]  Azer Bestavros,et al.  Changes in Web client access patterns: Characteristics and caching implications , 1999, World Wide Web.

[31]  Virgílio A. F. Almeida,et al.  Characterizing user behavior in online social networks , 2009, IMC '09.

[32]  Virgílio A. F. Almeida,et al.  Web cache replacement policies: properties, limitations and implications , 2005, Third Latin American Web Congress (LA-WEB'2005).

[33]  Virgílio A. F. Almeida,et al.  Tips, dones and todos: uncovering user profiles in foursquare , 2012, WSDM '12.

[34]  Richard B. Bunt,et al.  Hierarchical Workload Characterization for a Busy Web Server , 2002, Computer Performance Evaluation / TOOLS.

[35]  Jerome A. Rolia,et al.  Characterizing the scalability of a large web-based shopping system , 2001, ACM Trans. Internet Techn..

[36]  Virgílio A. F. Almeida,et al.  A hierarchical characterization of a live streaming media workload , 2006 .

[37]  Zongpeng Li,et al.  Youtube traffic characterization: a view from the edge , 2007, IMC '07.

[38]  Virgílio A. F. Almeida,et al.  Characterizing user navigation and interactions in online social networks , 2012, Inf. Sci..

[39]  Martin F. Arlitt,et al.  Characterizing Web user sessions , 2000, PERV.

[40]  Cecilia Mascolo,et al.  Track globally, deliver locally: improving content delivery networks by tracking geographic social cascades , 2011, WWW.

[41]  Zongpeng Li,et al.  Characterizing user sessions on YouTube , 2008, Electronic Imaging.

[42]  Virgílio A. F. Almeida,et al.  Traffic Characteristics and Communication Patterns in Blogosphere , 2006, ICWSM.