A dynamic data dissemination mechanism for Cassandra NoSQL data store

Cassandra as a type of NoSQL databases has been put forward so as to surmount the hurdles of traditional relational databases in the scope of big data as well as real-time applications whose paramount traits are high-speed data production (volume) and miscellaneous data formats (variety). The dynamic nature of distributed data, distributed systems, and their concomitant applications results in skewed data access patterns, thereby causing imbalanced data issues, and in turn consecutive performance deterioration after all. In this study, we have proposed a dynamic data dissemination (D3) strategy well conforming to the dynamic behavior of distributed environment, including diversified as well as temporal popularity of data requests, and heterogeneous node capacity. The assessment results have shed light on performance improvement.

[1]  Stefan Jablonski,et al.  NoSQL evaluation: A use case oriented survey , 2011, 2011 International Conference on Cloud and Service Computing.

[2]  Paul Zikopoulos,et al.  Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data , 2011 .

[3]  João Paulo,et al.  MeT: workload aware elasticity for NoSQL , 2013, EuroSys '13.

[4]  Fatos Xhafa Special issue on advanced techniques for cloud data storage and collaborative systems , 2018, Concurr. Comput. Pract. Exp..

[5]  Rabi Prasad Padhy,et al.  RDBMS to NoSQL: Reviewing Some Next-Generation Non-Relational Database's , 2011 .

[6]  Erik Brynjolfsson,et al.  Big data: the management revolution. , 2012, Harvard business review.

[7]  Sathiamoorthy Manoharan,et al.  A performance comparison of SQL and NoSQL databases , 2013, 2013 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PACRIM).

[8]  Prashant Malik,et al.  Cassandra: a decentralized structured storage system , 2010, OPSR.

[9]  M. Anusha,et al.  Big Data-Survey , 2016 .

[10]  Alejandro Zunino,et al.  Persisting big-data: The NoSQL landscape , 2017, Inf. Syst..

[11]  Ioannis Konstantinou,et al.  Fast and Cost-Effective Online Load-Balancing in Distributed Range-Queriable Systems , 2011, IEEE Transactions on Parallel and Distributed Systems.

[12]  Mohsen Sharifi,et al.  Dynamic Multilevel Feedback-Based Searching Strategy in Unstructured Peer-to-Peer Systems , 2012, 2012 IEEE International Conference on Green Computing and Communications.

[13]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[14]  Konstantinos Tserpes,et al.  Load Balancing in In-Memory Key-Value Stores for Response Time Minimization , 2016, GECON.

[15]  Madhusudhan Govindaraju,et al.  An Evaluation of Cassandra for Hadoop , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[16]  Ioannis Konstantinou,et al.  DBalancer: distributed load balancing for NoSQL data-stores , 2013, SIGMOD '13.

[17]  Eben Hewitt Cassandra - The Definitive Guide: Distributed Data at Web Scale , 2011 .

[18]  Albert G. Greenberg,et al.  Scarlett: coping with skewed content popularity in mapreduce clusters , 2011, EuroSys '11.

[19]  Ayoub Ait Lahcen,et al.  Big Data technologies: A survey , 2017, J. King Saud Univ. Comput. Inf. Sci..

[20]  Deyu Qi,et al.  A general framework for big data knowledge discovery and integration , 2018, Concurr. Comput. Pract. Exp..

[21]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[22]  Kostas Magoutis,et al.  Incremental Elasticity for NoSQL Data Stores , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[23]  Manish Parashar,et al.  Incentivising Resource Sharing in Social Clouds , 2012, 2012 IEEE 21st International Workshop on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[24]  Vijay V. Raghavan,et al.  NoSQL Systems for Big Data Management , 2014, 2014 IEEE World Congress on Services.

[25]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[26]  Francisco Maia,et al.  Workload-aware table splitting for NoSQL , 2014, SAC.

[27]  Konstantinos Tserpes,et al.  A Classification of NoSQL Data Stores Based on Key Design Characteristics , 2016, Cloud Forward.

[28]  Shivendra Kumar Pandey,et al.  An Approach to Improve Load Balancing in Distributed Storage Systems for NoSQL Databases: MongoDB , 2018 .

[29]  Cornelia Gyorodi,et al.  A Comparative Study of Relational and Non-Relational Database Models in a Web- Based Application , 2015 .

[30]  Nishant Neeraj Mastering Apache Cassandra , 2013 .

[31]  David J. DeWitt,et al.  Can the Elephants Handle the NoSQL Onslaught? , 2012, Proc. VLDB Endow..

[32]  Carlos Eduardo,et al.  An elasticity controller for applications orquestrated with Cloudify , 2016 .

[33]  Yousaf Muhammad Evaluation and Implementation of Distributed NoSQL Database for MMO Gaming Environment , 2011 .

[34]  Tilmann Rabl,et al.  Solving Big Data Challenges for Enterprise Application Performance Management , 2012, Proc. VLDB Endow..

[35]  Neil A. Ernst,et al.  Performance Evaluation of NoSQL Databases: A Case Study , 2015, PABS@ICPE.

[36]  Jianmin Wang,et al.  An experimental study on tuning the consistency of NoSQL systems , 2017, Concurr. Comput. Pract. Exp..