On the performance of cloud storage applications with global measurement

In recent years, Dropbox, Google, and Microsoft have been competing in the market of consumer cloud storage (CCS) services. While once the key comparative metric, storage capacity per user has outgrown the needs of most users. Today, third-party applications based on CCS's RESTful Web APIs are becoming a primary way for users to utilize their expanded storage resources. Unfortunately, there is very little visibility into the performance of these Web APIs, even though they are primary determinants of the end user experience on these storage applications. In this paper, we report results from a comprehensive measurement study of the Web APIs of five popular CCS providers. Our results reveal significant differences and limitations in API performance, which result in performance bottlenecks visible to the user through the storage application. We analyze the underlying system designs of the five providers' Web APIs, and present the performance implications of their different design choices. Our research provides practical guidance for service providers to optimize their API performance, for developers to improve the experience of third-party applications, and for users to pick appropriate services that best match their requirements.

[1]  Athanasios V. Vasilakos,et al.  Managing Performance Overhead of Virtual Machines in Cloud Computing: A Survey, State of the Art, and Future Directions , 2014, Proceedings of the IEEE.

[2]  Yunhao Liu,et al.  Towards Network-level Efficiency for Cloud Storage Services , 2014, Internet Measurement Conference.

[3]  Pietro Michiardi,et al.  A measurement study of the Wuala on-line storage service , 2012, 2012 IEEE 12th International Conference on Peer-to-Peer Computing (P2P).

[4]  Hai Jin,et al.  Flexible Instance: Meeting Deadlines of Delay Tolerant Jobs in the Cloud with Dynamic Pricing , 2016, 2016 IEEE 36th International Conference on Distributed Computing Systems (ICDCS).

[5]  Raimund Schatz,et al.  A first look at quality of experience in Personal Cloud Storage services , 2013, 2013 IEEE International Conference on Communications Workshops (ICC).

[6]  Jie Li,et al.  Early observations on the performance of Windows Azure , 2010, HPDC '10.

[7]  Andreas Bergen,et al.  Client bandwidth: The forgotten metric of online storage providers , 2011, Proceedings of 2011 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing.

[8]  Michael Vrable,et al.  BlueSky: a cloud-backed file system for the enterprise , 2012, FAST.

[9]  Dhiru Kholia,et al.  Looking Inside the (Drop) Box , 2013, WOOT.

[10]  Matei Ripeanu,et al.  Amazon S3 for science grids: a viable solution? , 2008, DADC '08.

[11]  Srinath T. V. Setty,et al.  Depot: Cloud Storage with Minimal Trust , 2010, TOCS.

[12]  Fangming Liu,et al.  UniDrive: Synergize Multiple Consumer Cloud Storage Services , 2015, Middleware.

[13]  Fred Douglis,et al.  Characteristics of backup workloads in production systems , 2012, FAST.

[14]  Miguel Correia,et al.  DepSky: Dependable and Secure Storage in a Cloud-of-Clouds , 2013, TOS.

[15]  Ben Y. Zhao,et al.  Efficient Batched Synchronization in Dropbox-Like Cloud Storage Services , 2013, Middleware.

[16]  Aiko Pras,et al.  Inside dropbox: understanding personal cloud storage services , 2012, Internet Measurement Conference.

[17]  Benny Pinkas,et al.  Proofs of ownership in remote storage systems , 2011, CCS '11.

[18]  Stefan Savage,et al.  Modeling TCP latency , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[19]  Michael Vrable,et al.  Cumulus: Filesystem backup to the cloud , 2009, TOS.

[20]  Hong Xu,et al.  Dynamic SDN controller assignment in data center networks: Stable matching with transfers , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[21]  Raúl Gracia Tinedo,et al.  Actively Measuring Personal Cloud Storage , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[22]  Gang Liu,et al.  Cloud download: using cloud utilities to achieve high-quality content distribution for unpopular videos , 2011, ACM Multimedia.

[23]  Krishna P. Gummadi,et al.  Improving the Reliability of Internet Paths with One-hop Source Routing , 2004, OSDI.

[24]  Edgar R. Weippl,et al.  Dark Clouds on the Horizon: Using Cloud Storage as Attack Vector and Online Slack Space , 2011, USENIX Security Symposium.

[25]  Benny Pinkas,et al.  Side Channels in Cloud Services: Deduplication in Cloud Storage , 2010, IEEE Security & Privacy.

[26]  Guangwen Yang,et al.  Understanding Data Characteristics and Access Patterns in a Cloud Storage System , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[27]  Jeanna Neefe Matthews,et al.  The good, the bad and the ugly of consumer cloud storage , 2010, OPSR.

[28]  Aiko Pras,et al.  Benchmarking personal cloud storage , 2013, Internet Measurement Conference.

[29]  Hai Jin,et al.  Fair Network Bandwidth Allocation in IaaS Datacenters via a Cooperative Game Approach , 2016, IEEE/ACM Transactions on Networking.

[30]  Feng Wang,et al.  On the impact of virtualization on Dropbox-like cloud file storage/synchronization services , 2012, 2012 IEEE 20th International Workshop on Quality of Service.

[31]  Ju Wang,et al.  Windows Azure Storage: a highly available cloud storage service with strong consistency , 2011, SOSP.

[32]  Hai Jin,et al.  Carbon-Aware Online Control of Geo-Distributed Cloud Services , 2016, IEEE Transactions on Parallel and Distributed Systems.

[33]  Alex Borges Vieira,et al.  Modeling the Dropbox client behavior , 2014, 2014 IEEE International Conference on Communications (ICC).