Implementation of Data Transform Method into NoSQL Database for Healthcare Data

Currently, most health care systems used among divisions in medical centers still adopt the Excel file format for a variety of scales statistics, such as the clinical self-care ability scale for Functional Independence Measure. Although people can further analyze Excel files using other statistical analysis software, such as SAS, SPSS, and STATA, they cannot effectively share the archived data in Excel among divisions. We propose to do format conversion on these data and store them in a database. As the collection of Excel files cannot be shared with ease, we plan to use HBase, a non-relational database, to further integrate data. The purpose of this paper is to construct complete import tools and solutions based on HBase to facilitate easy access of data in HBase. Besides, a visual interface is also used to manage HBase to implement user friendly client connection tools for the HBase database.

[1]  Chao-Tung Yang,et al.  Implementation of a Distributed Data Storage System with Resource Monitoring on Cloud Computing , 2012, GPC.

[2]  Eleni Stroulia,et al.  Enhancing Query Support in HBase via an Extended Coprocessors Framework , 2011, ServiceWave.

[3]  Divyakant Agrawal,et al.  $\mathcal{MD}$-HBase: design and implementation of an elastic data infrastructure for cloud-scale location services , 2012, Distributed and Parallel Databases.

[4]  Chao-Tung Yang,et al.  Implementation of a Cloud Computing Environment for Hiding Huge Amounts of Data , 2010, International Symposium on Parallel and Distributed Processing with Applications.

[5]  Olivier Curé,et al.  Data Integration over NoSQL Stores Using Access Path Based Mappings , 2011, DEXA.

[6]  Divyakant Agrawal,et al.  Database Scalability, Elasticity, and Autonomy in the Cloud - (Extended Abstract) , 2011, DASFAA.

[7]  Chao-Tung Yang,et al.  A Medical Image File Accessing System with Virtualization Fault Tolerance on Cloud , 2012, GPC.

[8]  Hai Jin,et al.  VESS: An Unstructured Data-Oriented Storage System for Multi-Disciplined Virtual Experiment Platform , 2011 .

[9]  Feng Zhu,et al.  A Fast and High Throughput SQL Query System for Big Data , 2012, WISE.

[10]  Jianling Sun,et al.  Scalable RDF store based on HBase and MapReduce , 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE).

[11]  Daniel J. Abadi,et al.  Column oriented Database Systems , 2009, Proc. VLDB Endow..

[12]  Hans De Sterck,et al.  Supporting multi-row distributed transactions with global snapshot isolation using bare-bones HBase , 2010, 2010 11th IEEE/ACM International Conference on Grid Computing.

[13]  Shan Wang,et al.  LinearDB: A Relational Approach to Make Data Warehouse Scale Like MapReduce , 2011, DASFAA.