Availability Enhancement of Riak TS Using Resource-Aware Mechanism

The dependability and elasticity of various NoSQL stores in critical application are still worth studying. Currently, the cluster and backup technologies are commonly used for improving NoSQL availability, but these approaches do not consider the availability reduction when NoSQL stores encounter performance bottlenecks. In order to enhance the availability of Riak TS effectively, a resource-aware mechanism is proposed. Firstly, the data table is sampled according to time, the correspondence between time and data is acquired, and the real-time resource consumption is recorded by Prometheus. Based on the sampling results, the polynomial curve fitting algorithm is used to constructing prediction curve. Then the resources required for the upcoming operation are predicted by the time interval in the SQL statement, and the operation is evaluated by comparing with the remaining resources. Using the real hydrological sensor dataset as experimental data, the effectiveness of the mechanism is experimented in two aspects of sensitivity and specificity, respectively. The results show that through the availability enhancement mechanism, the average specificity is 80.55% and the sensitivity is 76.31% which use the initial sampling dataset. As training datasets increase, the specificity increases from 80.55% to 92.42%, and the sensitivity increases from 76.31% to 87.90%. Besides, the availability increases from 40.33% to 89.15% in hydrological application scenarios. Experimental results show that this resource-aware mechanism can effectively prevent potential availability problems and enhance the availability of Riak TS. Moreover, as the number of users and the size of the data collected grow, our method will become more accurate and perfect.

[1]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[2]  Jorge Bernardino,et al.  Choosing the right NoSQL database for the job: a quality attribute evaluation , 2015, Journal of Big Data.

[3]  Patricia Morreale,et al.  Scalable Agentless Cloud Network Monitoring , 2017, 2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud).

[4]  Huai Liu,et al.  Metamorphic Testing , 2018, ACM Comput. Surv..

[5]  Lei Li,et al.  MongoDB NoSQL Injection Analysis and Detection , 2016, 2016 IEEE 3rd International Conference on Cyber Security and Cloud Computing (CSCloud).

[6]  Taghi M. Khoshgoftaar,et al.  Deep learning applications and challenges in big data analytics , 2015, Journal of Big Data.

[7]  Amreen Khan,et al.  A multidimensional data storage model for location based application on Hbase , 2015, 2015 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS).

[8]  Shuhong Chen,et al.  Elastic Database Replication in the Cloud , 2015, ICA3PP.

[9]  Shen De,et al.  Survey on NoSQL for Management of Big Data , 2013 .

[10]  Guy Harrison,et al.  Next Generation Databases , 2015, Apress.

[11]  Paola Inverardi,et al.  Model-based performance prediction in software development: a survey , 2004, IEEE Transactions on Software Engineering.

[12]  Eduardo Huedo,et al.  Orchestrating the Deployment of High Availability Services on Multi-zone and Multi-cloud Scenarios , 2018, Journal of Grid Computing.

[13]  Ge Yu,et al.  Survey on NoSQL for Management of Big Data: Survey on NoSQL for Management of Big Data , 2014 .

[14]  Kevin Mobley,et al.  Software performance in the real world: personal lessons from the performance trauma team , 2007, WOSP '07.

[15]  M. Indiramma,et al.  A novel redis security extension for NoSQL database using authentication and encryption , 2015, 2015 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[16]  Qi Huang,et al.  Gorilla: A Fast, Scalable, In-Memory Time Series Database , 2015, Proc. VLDB Endow..

[17]  Sudhakar,et al.  Context based Cassandra query language , 2017, 2017 8th International Conference on Computing, Communication and Networking Technologies (ICCCNT).

[18]  Kostas Magoutis,et al.  A Measurement-Based Approach to Performance Prediction in NoSQL Systems , 2017, 2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS).

[19]  Teodor-Florin Fortis,et al.  Benchmarking Database Systems for the Requirements of Sensor Readings , 2009 .

[20]  Guy Harrison Distributed Database Patterns , 2015 .

[21]  Norbert Ritter,et al.  NoSQL database systems: a survey and decision guidance , 2017, Computer Science - Research and Development.

[22]  Rouven Krebs,et al.  Automated inference of goal-oriented performance prediction functions , 2012, 2012 Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering.

[23]  Kostas Magoutis,et al.  Managing Service Performance in the Cassandra Distributed Storage System , 2013, 2013 IEEE 5th International Conference on Cloud Computing Technology and Science.

[24]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.