Replication Strategy for Spatiotemporal Data Based on Distributed Caching System

The replica strategy in distributed cache can effectively reduce user access delay and improve system performance. However, developing a replica strategy suitable for varied application scenarios is still quite challenging, owing to differences in user access behavior and preferences. In this paper, a replication strategy for spatiotemporal data (RSSD) based on a distributed caching system is proposed. By taking advantage of the spatiotemporal locality and correlation of user access, RSSD mines high popularity and associated files from historical user access information, and then generates replicas and selects appropriate cache node for placement. Experimental results show that the RSSD algorithm is simple and efficient, and succeeds in significantly reducing user access delay.

[1]  Abdelkader Hameurlain,et al.  Dynamic replication strategies in data grid systems: a survey , 2015, The Journal of Supercomputing.

[2]  Dan Feng,et al.  CDRM: A Cost-Effective Dynamic Replication Management Scheme for Cloud Storage Cluster , 2010, 2010 IEEE International Conference on Cluster Computing.

[3]  Shaoming Pan,et al.  A dynamic replication management strategy in distributed GIS , 2018, Comput. Geosci..

[4]  Zhengquan Xu,et al.  Prefetching Scheme for Massive Spatiotemporal Data in a Smart City , 2016, Int. J. Distributed Sens. Networks.

[5]  Pangfeng Liu,et al.  QoS-aware, access-efficient, and storage-efficient replica placement in grid environments , 2008, The Journal of Supercomputing.

[6]  Daniel Grosu,et al.  A Distributed Algorithm for the Replica Placement Problem , 2011, IEEE Transactions on Parallel and Distributed Systems.

[7]  Yuan Yao,et al.  From digital Earth to smart Earth , 2014 .

[8]  Peter C. J. Graham,et al.  Distributed Placement of Replicas in Hierarchical Data Grids with User and System QoS Constraints , 2011, 2011 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[9]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[10]  Xindong Wu,et al.  A Distributed Cache for Hadoop Distributed File System in Real-Time Cloud Services , 2012, 2012 ACM/IEEE 13th International Conference on Grid Computing.

[11]  X. Xu,et al.  Data Replica Placement Mechanism for Open Heterogeneous Storage Systems , 2017, ANT/SEIT.

[12]  Shang Gao,et al.  Modeling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments , 2012, Journal of Computer Science and Technology.

[13]  Vijaya Nagarajan,et al.  A prediction-based dynamic replication strategy for data-intensive applications , 2017, Comput. Electr. Eng..

[14]  Qin Xiu Progress and Challenges of Distributed Caching Techniques in Cloud Computing , 2013 .

[15]  I-Ling Yen,et al.  Distributed replica placement algorithms for correlated data , 2013, The Journal of Supercomputing.

[16]  Qunying Huang,et al.  A replication strategy for a distributed high-speed caching system based on spatiotemporal access patterns of geospatial data , 2017, Comput. Environ. Urban Syst..

[17]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[18]  Wei Wang,et al.  Progress and Challenges of Distributed Caching Techniques in Cloud Computing: Progress and Challenges of Distributed Caching Techniques in Cloud Computing , 2014 .

[19]  Ming Tang,et al.  The impact of data replication on job scheduling performance in the Data Grid , 2006, Future Gener. Comput. Syst..

[20]  Li Wen Graph-Based Optimal Cache Deployment Algorithm for Distributed Caching Systems , 2010 .

[21]  Ming Tang,et al.  Dynamic replication algorithms for the multi-tier Data Grid , 2005, Future Gener. Comput. Syst..

[22]  Ruay-Shiung Chang,et al.  A dynamic data replication strategy using access-weights in data grids , 2008, The Journal of Supercomputing.

[23]  Yuan Yao,et al.  Big data in smart cities , 2015, Science China Information Sciences.

[24]  Ciprian Dobre,et al.  Smart City Mobility Simulation and Monitoring Platform , 2017, 2017 21st International Conference on Control Systems and Computer Science (CSCS).

[25]  Zhuzhong Qian,et al.  QoS-aware placement of stream processing service , 2010, The Journal of Supercomputing.

[26]  J. Morris Chang,et al.  QoS-Aware Data Replication for Data-Intensive Applications in Cloud Computing Systems , 2013, IEEE Transactions on Cloud Computing.