Design of a More Scalable Database System

With the development of cloud computing and internet, e-Commerce, e-Business and corporate world revenue are increasing with high rate. These areas require scalable and consistent databases. NoSQL databases such as HBase has been proven to scalability and well performance on cloud computing platforms. However, the inevitable special data with few increment and frequent access leads to hotspot data and unbalanced accessing distribution between data storage servers. Due to their properties, these data often cannot be stored in multiple tables. Some storage nodes become the bottleneck of the distributed storage system, therefore, it becomes difficult to improve the performance by increasing the number of nodes which severely limits the scalability of the storage system. In order to make the performance of the cluster increases with the size of the cluster simultaneously, we devise a new distributed database storage framework to solve those issues mentioned above by changing the storage and read-write mode of the hotspot data. This structure guarantees that the hotspot data will not aggregate in the same storage node, as it guarantees that the data is not too hot in a single storage node. We implement the scalable database based on Apache HBase, which achieve almost double performance of throughput considering heavy read-write pressure situation only with double reading substites. Besides, heavy load node owing to hotspot data will no longer present in the new distributed database.

[1]  Prashant Malik,et al.  Cassandra: structured storage system on a P2P network , 2009, PODC '09.

[2]  Xue Liu,et al.  HBaseMQ: A distributed message queuing system on clouds with HBase , 2013, 2013 Proceedings IEEE INFOCOM.

[3]  Dhabaleswar K. Panda,et al.  High-Performance Design of HBase with RDMA over InfiniBand , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[4]  Divyakant Agrawal,et al.  MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware Services , 2011, 2011 IEEE 12th International Conference on Mobile Data Management.

[5]  Wilson C. Hsieh,et al.  Bigtable: A Distributed Storage System for Structured Data , 2006, TOCS.

[6]  Guan Le,et al.  Survey on NoSQL database , 2011, 2011 6th International Conference on Pervasive Computing and Applications.

[7]  Chiranjeev Kumar,et al.  A scalable generic transaction model scenario for distributed NoSQL databases , 2015, J. Syst. Softw..

[8]  Pangfeng Liu,et al.  HSQL: A Highly Scalable Cloud Database for Multi-user Query Processing , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[9]  Cristopher Moore,et al.  An n-Dimensional Generalization of the Rhombus Tiling , 2001, DM-CCG.