Large scale Online Social Networks (OSNs) like Facebook and Twitter are hosted out of multiple geo-diverse data centers to provide low latency and high fault tolerance. Such geo-diversity creates large amounts of WAN traffic for maintaining the consistency of replicas at different locations. Despite the dropping price of WAN bandwidth, the growth rate of OSNs combined with the incorporation of media rich long tail content (including images and videos) makes WAN traffic costs an increasing concern for OSN operators. At the heart of the problem lies a tradeoff between consistency and WAN bandwidth cost. In this paper, we propose the “Wait Your Turn”; WYT system that optimizes the tradeoff by leveraging: (i) knowledge of mapping between social relationships and geographic location, and (ii) knowledge of timing regularities in end user activity patterns. We quantify the benefits of such an OSN-aware update propagation strategy through a trace-driven analysis and show that it reduces WAN traffic by 55% compared to an immediate update of all replicas, while having minimal impact on consistency. Furthermore, for a given budget for WAN bandwidth, WYT increases consistency by several orders of magnitude compared to FIFO scheduling of updates. Key-words: Data centers, Social Networks, Replication ∗ Telefonica Investigacion y Desarrollo (Telefonica I+D), Barcelona † Department of Electrical Engineering and Computer Science (EECS), Northwestern University in ria -0 05 04 91 3, v er si on 1 30 J an 2 01 2 WYT: Optimisation de la coherence dans les reseaux sociaux geograpiquement distribues Resume : Mots-cles : in ria -0 05 04 91 3, v er si on 1 30 J an 2 01 2 WYT: Optimized Consistency for Geo-Diverse OSNs 3
[1]
R. Sinnott.
Virtues of the Haversine
,
1984
.
[2]
Keith W. Ross,et al.
Measuring and Evaluating Large-Scale CDNs
,
2008
.
[3]
George Varghese,et al.
EndRE: An End-System Redundancy Elimination Service for Enterprises
,
2010,
NSDI.
[4]
Vyas Sekar,et al.
SmartRE: an architecture for coordinated network-wide redundancy elimination
,
2009,
SIGCOMM '09.
[5]
David Wetherall,et al.
A protocol-independent technique for eliminating redundant network traffic
,
2000,
SIGCOMM.
[6]
Alec Wolman,et al.
Volley: Automated Data Placement for Geo-Distributed Cloud Services
,
2010,
NSDI.
[7]
Jasmine Novak,et al.
Geographic routing in social networks
,
2005,
Proc. Natl. Acad. Sci. USA.
[8]
Pablo Rodriguez,et al.
The little engine(s) that could: scaling online social networks
,
2012,
TNET.
[9]
Hosung Park,et al.
What is Twitter, a social network or a news media?
,
2010,
WWW '10.
[10]
Pablo Rodriguez,et al.
Divide and Conquer: Partitioning Online Social Networks
,
2009,
ArXiv.
[11]
Pablo Rodriguez,et al.
Delay-Tolerant Bulk Data Transfers on the Internet
,
2009,
IEEE/ACM Transactions on Networking.
[12]
Anja Feldmann,et al.
Understanding online social network usage from a network perspective
,
2009,
IMC '09.