With the development of web application, the demand of query performance and expansion becomes increasingly necessary. Relational database seems hardly handling with this, more and more applications choose migrating to NoSQL database such as HBase. The features of high reliability, high performance, column-oriented storage, high scalability, and real-time read and write, won favor from public. In this paper, we use two concepts Nested and Multiple Nested to describe join keys reference relationship between tables in relational database. Because HBase only support the column family nest the column qualifier, it is hard to deal with the situation of Multiple Nested after data migration from relational database meanwhile ensure high query performance in transformed HBase. We propose a novel HBase schema for migrating from relational database to HBase, which supports multiple nested. With this schema in HBase, query only once on one table for any query is needed to get the data, and query efficiency has been greatly improved, especially for join query. Additionally, index migration is available in this scheme, which supports faster query. Finally, this paper proposes a mapping algorithm to find column name corresponding to attribute in relational database.
[1]
Lars George,et al.
HBase: The Definitive Guide
,
2011
.
[2]
Yawei Li,et al.
Megastore: Providing Scalable, Highly Available Storage for Interactive Services
,
2011,
CIDR.
[3]
Yeh-Ching Chung,et al.
JackHare: a framework for SQL to NoSQL translation using MapReduce
,
2013,
Automated Software Engineering.
[4]
Ronald C. Taylor.
An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics
,
2010,
BMC Bioinformatics.
[5]
Chongxin Li,et al.
Transforming relational database into HBase: A case study
,
2010,
2010 IEEE International Conference on Software Engineering and Service Sciences.