Mitigating Sync Amplification for Copy-on-write Virtual Disk

Copy-on-write virtual disks (e.g., qcow2 images) provide many useful features like snapshot, de-duplication, and full-disk encryption. However, our study uncovers that they introduce additional metadata for block organization and notably more disk sync operations (e.g., more than 3X for qcow2 and 4X for VMDK images). To mitigate such sync amplification, we propose three optimizations, namely per virtual disk internal journaling, dual-mode journaling, and adaptive-preallocation, which eliminate the extra sync operations while preserving those features in a consistent way. Our evaluation shows that the three optimizations result in up to 110% performance speedup for varmail and 50% for TPCC.

[1]  Ethan L. Miller,et al.  The effectiveness of deduplication on virtual machine disk images , 2009, SYSTOR '09.

[2]  Andrea C. Arpaci-Dusseau,et al.  Analysis and Evolution of Journaling File Systems , 2005, USENIX Annual Technical Conference, General Track.

[3]  John C. S. Lui,et al.  Buffer and I/O resource pre-allocation for implementing batching and buffering techniques for Video-on-Demand systems , 1997, Proceedings 13th International Conference on Data Engineering.

[4]  Andrea C. Arpaci-Dusseau,et al.  Optimistic crash consistency , 2013, SOSP.

[5]  Chunqiang Tang,et al.  FVD: A High-Performance Virtual Machine Image Format for Cloud , 2011, USENIX Annual Technical Conference.

[6]  Meng Zhu,et al.  Journaling of journal is (almost) free , 2014, FAST.

[7]  Andrea C. Arpaci-Dusseau,et al.  Consistency without ordering , 2012, FAST.

[8]  Youjip Won,et al.  WALDIO: Eliminating the Filesystem Journaling in Resolving the Journaling of Journal Anomaly , 2015, USENIX Annual Technical Conference.

[9]  A. Kivity,et al.  kvm : the Linux Virtual Machine Monitor , 2007 .

[10]  Fred Cohen A cryptographic checksum for integrity protection , 1987, Comput. Secur..

[11]  Qi Zhang,et al.  A New Disk I/O Model of Virtualized Cloud Environment , 2013, IEEE Transactions on Parallel and Distributed Systems.

[12]  Hai Huang,et al.  Understanding performance implications of nested file systems in a virtualized environment , 2012, FAST.

[13]  Carlo Curino,et al.  OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases , 2013, Proc. VLDB Endow..

[14]  Jason Flinn,et al.  Rethink the sync , 2006, OSDI '06.