An update strategy for the WaterFowl RDF data store
暂无分享,去创建一个
The WaterFowl RDF Store is characterized by its high compression rate and a self-indexing approach. Both of these characteristics are due to its underlying architecture. Intuitively, it is based on a stack composed of two forms of Succinct Data Structures, namely bitmaps and wavelet trees. The ability to efficiently retrieve information from these structures is performed via a set of operations, i.e., rank, select and access, which are used by our query processor. The nice properties, e.g. compactness and efficient data retrieval, we have observed on our first experimentations come at the price of poor performances when insertions or deletions are required. For instance, a naive approach has a dramatic impact on the capacity to handle ABox updates. In this paper, we address this issue by proposing an update strategy which uses an hybrid wavelet tree (using both pointer-based and pointerless sub-wavelet trees).
[1] Gonzalo Navarro,et al. Fast, Small, Simple Rank/Select on Bitmaps , 2012, SEA.
[2] Gonzalo Navarro. Wavelet trees for all , 2014, J. Discrete Algorithms.
[3] Olivier Curé,et al. WaterFowl: A Compact, Self-indexed and Inference-Enabled Immutable RDF Store , 2014, ESWC.
[4] Martin L. Kersten,et al. Database Cracking , 2007, CIDR.