Active Data Replica Recovery for Quality-Assurance Big Data Analysis in IC-IoT

QoS-aware big data analysis is critical in Information-Centric Internet of Things (IC-IoT) system to support various applications like smart city, smart grid, smart health, intelligent transportation systems, and so on. The employment of non-volatile memory (NVM) in cloud or edge system provides good opportunity to improve quality of data analysis tasks. However, we have to face the data recovery problem led by NVM failure due to the limited write endurance. In this paper, we investigate the data recovery problem for QoS guarantee and system robustness, followed by proposing a rarity-aware data recovery algorithm. The core idea is to establish the rarity indicator to evaluate the replica distribution and service requirement comprehensively. With this idea, we give the lost replicas with distinguishing priority and eliminate the unnecessary replicas. Then, the data replicas are recovered stage by stage to guarantee QoS and provide system robustness. From our extensive experiments and simulations, it is shown that the proposed algorithm has significant performance improvement on QoS and robustness than the traditional direct data recovery method. Besides, the algorithm gives an acceptable data recovery time.

[1]  Kaoru Ota,et al.  Orchestrating Data as a Services-Based Computing and Communication Model for Information-Centric Internet of Things , 2018, IEEE Access.

[2]  Jianhua Li,et al.  Big Data Analysis-Based Secure Cluster Management for Optimized Control Plane in Software-Defined Networks , 2018, IEEE Transactions on Network and Service Management.

[3]  Lei Wang,et al.  Physical principles and current status of emerging non-volatile solid state memories , 2015, Electronic Materials Letters.

[4]  Wei-Che Tseng,et al.  Software enabled wear-leveling for hybrid PCM main memory on embedded systems , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[5]  Jiwu Shu,et al.  Load-Balanced Recovery Schemes for Single-Disk Failure in Storage Systems with Any Erasure Code , 2013, 2013 42nd International Conference on Parallel Processing.

[6]  Jie Wu,et al.  Understanding Graph-Based Trust Evaluation in Online Social Networks , 2016, ACM Comput. Surv..

[7]  Vijayalakshmi Srinivasan,et al.  Enhancing lifetime and security of PCM-based Main Memory with Start-Gap Wear Leveling , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[8]  GhemawatSanjay,et al.  The Google file system , 2003 .

[9]  Jiwu Shu,et al.  Seek-Efficient I/O Optimization in Single Failure Recovery for XOR-Coded Storage Systems , 2017, IEEE Trans. Parallel Distributed Syst..

[10]  Xiaozhou Li,et al.  Flat XOR-based erasure codes in storage systems: Constructions, efficient recovery, and tradeoffs , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[11]  Jianhua Li,et al.  Edge-MapReduce-Based Intelligent Information-Centric IoV: Cognitive Route Planning , 2019, IEEE Access.

[12]  Xin Li,et al.  Migration-Based Online CPSCN Big Data Analysis in Data Centers , 2018, IEEE Access.

[13]  Weiwei Lin,et al.  An Ensemble Random Forest Algorithm for Insurance Big Data Analysis , 2017, IEEE Access.

[14]  Jie Xu,et al.  Extending the lifetime of NVMs with compression , 2018, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[15]  Xin Li,et al.  Topology-Aware VM Placement for Network Optimization in Cloud Data Centers , 2017, 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC).

[16]  Chung-Ming Huang,et al.  The Vehicular Social Network (VSN)-Based Sharing of Downloaded Geo Data Using the Credit-Based Clustering Scheme , 2018, IEEE Access.

[17]  Zhaoquan Cai,et al.  Towards secure and flexible EHR sharing in mobile health cloud under static assumptions , 2017, Cluster Computing.

[18]  Cheng Huang,et al.  Rethinking erasure codes for cloud file systems: minimizing I/O for recovery and degraded reads , 2012, FAST.

[19]  Mianxiong Dong,et al.  Real-Time Awareness Scheduling for Multimedia Big Data Oriented In-Memory Computing , 2018, IEEE Internet of Things Journal.

[20]  Junsang Kim,et al.  Real-time data replication strategy for data grids , 2017, Cluster Computing.

[21]  Ki-Woong Park,et al.  Adaptive wear-leveling algorithm for PRAM main memory with a DRAM buffer , 2014, ACM Trans. Embed. Comput. Syst..

[22]  Victor C. M. Leung,et al.  Predicting Temporal Social Contact Patterns for Data Forwarding in Opportunistic Mobile Networks , 2017, IEEE Transactions on Vehicular Technology.

[23]  Li Yan,et al.  A Flexible Dynamic Migration Strategy for Cloud Data Replica , 2017, 2017 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData).

[24]  Hai Jin,et al.  Hardware/software cooperative caching for hybrid DRAM/NVM memory architectures , 2017, ICS '17.

[25]  Tei-Wei Kuo,et al.  Age-based PCM wear leveling with nearly zero search cost , 2012, DAC Design Automation Conference 2012.

[26]  Nāgārjuna,et al.  A Secure Erasure Code-Based Cloud Storage System with Secure Data Forwarding , 2014 .

[27]  Karin Strauss,et al.  Use ECP, not ECC, for hard failures in resistive memories , 2010, ISCA.

[28]  Jin Li,et al.  Secure attribute-based data sharing for resource-limited users in cloud computing , 2018, Comput. Secur..

[29]  Ryousei Takano,et al.  RAMinate: Hypervisor-based Virtualization for Hybrid Main Memory Systems , 2016, SoCC.

[30]  Vijayalakshmi Srinivasan,et al.  Scalable high performance main memory system using phase-change memory technology , 2009, ISCA '09.

[31]  Jiwu Shu,et al.  Reconsidering Single Failure Recovery in Clustered File Systems , 2016, 2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[32]  Jiwu Shu,et al.  Cross-Rack-Aware Single Failure Recovery for Clustered File Systems , 2020, IEEE Transactions on Dependable and Secure Computing.

[33]  Julia Myint,et al.  Management of Data Replication for PC Cluster-based Cloud Storage System , 2011, CloudCom 2011.

[34]  Xiuhua Li,et al.  Data Offloading Techniques Through Vehicular Ad Hoc Networks: A Survey , 2018, IEEE Access.

[35]  Jun Yang,et al.  LLS: Cooperative integration of wear-leveling and salvaging for PCM main memory , 2011, 2011 IEEE/IFIP 41st International Conference on Dependable Systems & Networks (DSN).

[36]  Mianxiong Dong,et al.  ECCN: Orchestration of Edge-Centric Computing and Content-Centric Networking in the 5G Radio Access Network , 2018, IEEE Wireless Communications.