With the rapid growth of users and data volume, the state-of-the-art remotely sensed databases have been posed on grand challenges by the online geospatial applications. They cannot provide low-latency retrieval of big remotely sensed data and fail to adapt for various access patterns from concurrent users. We propose an adaptive hierarchical caching scheme called RSCache for remotely sensed database to overcome the deficiencies. The big data are split into large amounts of small tiles and stored as key-value objects in NoSQL database with Hilbert order. Moreover, a distributed object placement model is designed for different access patterns with consideration of spatio-temporal proximity. Besides, RSCache forms a large memory cache by collecting the exclusive memory pools allocated by cluster nodes and provides global unified view of data manipulation from application's perspective. We implement RSCache middleware prototype on scale-out HBase cluster and confirm its effectiveness with comprehensive experiments in real applications.
[1]
Suprio Ray,et al.
Surveying the landscape: an in-depth analysis of spatial database workloads
,
2012,
SIGSPATIAL/GIS.
[2]
Komal Shringare,et al.
Apache Hadoop Goes Realtime at Facebook
,
2015
.
[3]
David R. Karger,et al.
Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web
,
1997,
STOC '97.
[4]
Lionel M. Ni,et al.
CloST: a hadoop-based storage system for big spatio-temporal data analytics
,
2012,
CIKM '12.
[5]
David J. DeWitt,et al.
Turbocharging DBMS buffer pool using SSDs
,
2011,
SIGMOD '11.