Diving into cloud-based file synchronization with user collaboration

In this paper, we take a close look to understand the cloud-based file synchronization and collaboration systems. Using the popular Dropbox as a case study, our measurement reveals its cascaded computation and communication operations that are far more complicated than those in conventional file hosting. We show that this serial design is necessary for the cloud deployment, which effectively avoids the possible task interference inside the computation cloud; yet it also leads to higher service variance across users. Even worse, in a collaborative file editing session, users' updates would be discarded without any warning. The drop rate is unfortunately related to the slowest collaborator, which severely hinders the system scalability and user satisfaction. We further investigate the root causes of this phenomenon as well as other performance bottlenecks and offer hints for practical improvement.

[1]  Feng Wang,et al.  On the impact of virtualization on Dropbox-like cloud file storage/synchronization services , 2012, 2012 IEEE 20th International Workshop on Quality of Service.

[2]  Xiaowei Yang,et al.  CloudCmp: comparing public cloud providers , 2010, IMC '10.

[3]  Ada Gavrilovska,et al.  Cloud4Home -- Enhancing Data Services with @Home Clouds , 2011, 2011 31st International Conference on Distributed Computing Systems.

[4]  Anees Shaikh,et al.  CloudNaaS: a cloud networking platform for enterprise applications , 2011, SoCC.

[5]  Yunhao Liu,et al.  Towards Network-level Efficiency for Cloud Storage Services , 2014, Internet Measurement Conference.

[6]  Erran L. Li,et al.  CloudFlex: Seamless scaling of enterprise applications into the cloud , 2011, 2011 Proceedings IEEE INFOCOM.

[7]  Anees Shaikh,et al.  Kingfisher: Cost-aware elasticity in the cloud , 2011, 2011 Proceedings IEEE INFOCOM.

[8]  Myungjin Lee,et al.  Enabling Flow-level Latency Measurements across Routers in Data Centers , 2011, Hot-ICE.

[9]  Torsten Braun,et al.  Internet service monitoring with mobile agents , 2002, IEEE Netw..

[10]  Sachin Agarwal,et al.  On the scalability of data synchronization protocols for PDAs and mobile devices , 2002, IEEE Netw..

[11]  Sriram Ramabhadran,et al.  Cloud control with distributed rate limiting , 2007, SIGCOMM '07.

[12]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[13]  Bo Li,et al.  FS2You: Peer-Assisted Semipersistent Online Hosting at a Large Scale , 2010, IEEE Transactions on Parallel and Distributed Systems.

[14]  Bo Li,et al.  CloudMedia: When Cloud on Demand Meets Video on Demand , 2011, 2011 31st International Conference on Distributed Computing Systems.

[15]  Ryan Shea,et al.  Understanding the impact of Denial of Service attacks on Virtual Machines , 2012, 2012 IEEE 20th International Workshop on Quality of Service.

[16]  Aiko Pras,et al.  Inside dropbox: understanding personal cloud storage services , 2012, Internet Measurement Conference.