MPT: Suite Tools to Support Performance Tuning in NoSQL Systems

NoSQL databases are considered as a serious alternative for processing data whose volume reaches limits that are difficult to manage by relational DBMS. So far, they are praised for the capability to scale, replication and their capability to deal with new flexible data models. Most of these systems are compared to read/write throughput and their ability to scale. However, there is a need to get more in depth to monitor more precise metrics related to RAM, CPU and disk usage. In this paper, we propose a benchmark suite tools that enables data generation, monitoring and comparison. It supports several NoSQL systems including: column-oriented, document-oriented as well as multistores. We present some experimental results that show its utility.

[1]  Lavanya Ramakrishnan,et al.  Performance evaluation of a MongoDB and hadoop platform for scientific data analysis , 2013, Science Cloud '13.

[2]  Abdullah Talha Kabakus,et al.  A performance evaluation of in-memory databases , 2017, J. King Saud Univ. Comput. Inf. Sci..

[3]  Yon Dohn Chung,et al.  Parallel data processing with MapReduce: a survey , 2012, SGMD.

[4]  Ruichun Hou,et al.  Cache and consistency in NOSQL , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[5]  Omar Boussaïd,et al.  Columnar NoSQL Star Schema Benchmark , 2014, MEDI.

[6]  Alejandro Zunino,et al.  Persisting big-data: The NoSQL landscape , 2017, Inf. Syst..

[7]  Adam Jacobs,et al.  The pathologies of big data , 2009, Commun. ACM.

[8]  Rick Cattell,et al.  Scalable SQL and NoSQL data stores , 2011, SGMD.

[9]  Max Chevalier,et al.  Benchmark for OLAP on NoSQL technologies comparing NoSQL multidimensional data warehousing solutions , 2015, 2015 IEEE 9th International Conference on Research Challenges in Information Science (RCIS).

[10]  Zhongmei Zhang,et al.  Application research of Hadoop resource monitoring system based on Ganglia and Nagios , 2013, 2013 IEEE 4th International Conference on Software Engineering and Service Science.

[11]  Patrick Valduriez,et al.  Multistore Big Data Integration with CloudMdsQL , 2016, Trans. Large Scale Data Knowl. Centered Syst..

[12]  Jennifer Widom,et al.  The Beckman Report on Database Research , 2014, SGMD.

[13]  Michael Stonebraker,et al.  SQL databases v. NoSQL databases , 2010, CACM.

[14]  Irena Holubová,et al.  Multi-model Data Management: What's New and What's Next? , 2017, EDBT.