Analyzing the Impact of Dropbox Content Sharing on an Academic Network

Cloud storage services (e.g., Dropbox) are a popular means for sharing content and performing collaborative work. Yet, content sharing via cloud might result in bandwidth wastage when repetitive data is downloaded by different users in the same network domain. This paper first characterizes sharing patterns in Dropbox by analyzing data collected from a campus network for 4 months. We identify that the volume of data sharing in such homogeneous environment is reasonably high. Next, we use the characterization results to implement a synthetic workload generator that allows us to test alternatives for the Dropbox synchronization protocol. We then propose a synchronization architecture that includes network caches to temporarily hold user updates. Our evaluation of the proposed solution indicates that, even with a small cache, it is possible to achieve almost the maximum possible reduction of downloads from remote servers, thus benefiting storage providers, end users and the Internet.

[1]  Pietro Michiardi,et al.  A measurement study of the Wuala on-line storage service , 2012, 2012 IEEE 12th International Conference on Peer-to-Peer Computing (P2P).

[2]  Raimund Schatz,et al.  Quality of Experience in Cloud services: Survey and measurements , 2014, Comput. Networks.

[3]  Jeanna Neefe Matthews,et al.  The good, the bad and the ugly of consumer cloud storage , 2010, OPSR.

[4]  Aiko Pras,et al.  Benchmarking personal cloud storage , 2013, Internet Measurement Conference.

[5]  Ítalo S. Cunha,et al.  Impact of provider failures on the traffic at a university campus , 2015, 2015 IFIP Networking Conference (IFIP Networking).

[6]  Guangwen Yang,et al.  Understanding Data Characteristics and Access Patterns in a Cloud Storage System , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[7]  Feng Wang,et al.  On the impact of virtualization on Dropbox-like cloud file storage/synchronization services , 2012, 2012 IEEE 20th International Workshop on Quality of Service.

[8]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[9]  Yunhao Liu,et al.  Towards Network-level Efficiency for Cloud Storage Services , 2014, Internet Measurement Conference.

[10]  Xiaowei Yang,et al.  CloudCmp: comparing public cloud providers , 2010, IMC '10.

[11]  Raúl Gracia Tinedo,et al.  Actively Measuring Personal Cloud Storage , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[12]  Dario Rossi,et al.  Experiences of Internet traffic monitoring with tstat , 2011, IEEE Network.

[13]  Tobias Hoßfeld,et al.  Need for Speed ? On Quality of Experience for Cloud-based File Storage Services , 2013 .

[14]  Aiko Pras,et al.  Inside dropbox: understanding personal cloud storage services , 2012, Internet Measurement Conference.