A light-weight log-based hybrid storage system

Abstract The Internet of Things (IoT) and cloud computing are two important technologies in our life. They are integrated together to support each other. However, with the rapid development of IoT, the disk-based storage systems in clouds fail to process and analyze data timely from IoT devices. In order to improve the storage capabilities of traditional disk-based storage systems, we present a light-weight log-based hybrid storage system, which comprises of the phase change memory (PCM), flash memory-based SSD and disks. These devices are merged as a single block device with a continuous linear-address space. We also design and implement a log-based hybrid file system called HFS for the block device. By combining the semantic characteristics of data with distinct performance characteristics of devices, HFS optimizes the file system layout and overcomes the shortcomings of traditional log-structured systems. We implement HFS in the Linux kernel and compare it with modern file systems such as Ext4 and F2FS. Evaluation results show the efficiency of the proposed light-weight log-based hybrid storage system.

[1]  Dan Feng,et al.  Improving flash-based disk cache with Lazy Adaptive Replacement , 2013, 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST).

[2]  Ian Lumb,et al.  A Taxonomy and Survey of Cloud Computing Systems , 2009, 2009 Fifth International Joint Conference on INC, IMS and IDC.

[3]  Peter Druschel,et al.  Anticipatory scheduling: a disk scheduling framework to overcome deceptive idleness in synchronous I/O , 2001, SOSP.

[4]  Wei Xiang,et al.  Big data-driven optimization for mobile networks toward 5G , 2016, IEEE Network.

[5]  Lingkun Wu,et al.  FSMAC: A file system metadata accelerator with non-volatile memory , 2013, 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST).

[6]  David Hung-Chang Du,et al.  Hot data identification for flash-based storage systems using multiple bloom filters , 2011, 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST).

[7]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[8]  Gong Zhang,et al.  Adaptive Data Migration in Multi-tiered Storage Based Cloud Environment , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[9]  Geoffrey H. Kuenning,et al.  The Conquest file system: Better performance through a disk/persistent-RAM hybrid design , 2006, TOS.

[10]  Jit Biswas,et al.  Processing of wearable sensor data on the cloud - a step towards scaling of continuous monitoring of health and well-being , 2010, 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology.

[11]  Mahmut T. Kandemir,et al.  Disk-Cache and Parallelism Aware I/O Scheduling to Improve Storage System Performance , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[12]  Chonho Lee,et al.  A survey of mobile cloud computing: architecture, applications, and approaches , 2013, Wirel. Commun. Mob. Comput..

[13]  Song Jiang,et al.  A Scheduling Framework That Makes Any Disk Schedulers Non-Work-Conserving Solely Based on Request Characteristics , 2011, FAST.

[14]  Dave Evans,et al.  How the Next Evolution of the Internet Is Changing Everything , 2011 .

[15]  Antonio Pescapè,et al.  Integration of Cloud computing and Internet of Things: A survey , 2016, Future Gener. Comput. Syst..

[16]  Luis Carlos Erpen De Bona,et al.  A QoS aware non-work-conserving disk scheduler , 2012, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST).

[17]  Mithuna Thottethodi,et al.  SieveStore: a highly-selective, ensemble-level disk cache for cost-performance , 2010, ISCA '10.

[18]  Kang G. Shin,et al.  FS2: dynamic data replication in free disk space for improving disk performance and energy consumption , 2005, SOSP '05.

[19]  Jianxi Chen,et al.  Accelerating File System Metadata Access with Byte-Addressable Nonvolatile Memory , 2015, TOS.

[20]  Yifeng Zhu,et al.  Hot Random Off-Loading: A Hybrid Storage System with Dynamic Data Migration , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[21]  A. L. Narasimha Reddy,et al.  NVMFS: A hybrid file system for improving random write in nand-flash SSD , 2013, 2013 IEEE 29th Symposium on Mass Storage Systems and Technologies (MSST).

[22]  Da-Wei Chang,et al.  TridentFS: a hybrid file system for non‐volatile RAM, flash memory and magnetic disk , 2016, Softw. Pract. Exp..

[23]  Marimuthu Palaniswami,et al.  Internet of Things (IoT): A vision, architectural elements, and future directions , 2012, Future Gener. Comput. Syst..

[24]  Fei Tao,et al.  CCIoT-CMfg: Cloud Computing and Internet of Things-Based Cloud Manufacturing Service System , 2014, IEEE Transactions on Industrial Informatics.

[25]  Jeanna Matthews,et al.  Intel® Turbo Memory: Nonvolatile disk caches in the storage hierarchy of mainstream computer systems , 2008, TOS.

[26]  Raouf Boutaba,et al.  Cloud computing: state-of-the-art and research challenges , 2010, Journal of Internet Services and Applications.

[27]  Antonio Iera,et al.  The Internet of Things: A survey , 2010, Comput. Networks.

[28]  Seung-Ho Lim,et al.  PFFS: a scalable flash memory file system for the hybrid architecture of phase-change RAM and NAND flash , 2008, SAC '08.

[29]  Gong Zhang,et al.  Automated lookahead data migration in SSD-enabled multi-tiered storage systems , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).