Generating polystore ingestion plans — A demonstration with the AWESOME system

AWESOME is a polystore system that enables a data analyst to create a data ingestion script that specifies how it should collect, organize, run a data-derivation pipeline and reports results of the analysis. The collected data can be stored in different component stores under AWESOME for subsequent secondary analysis. This paper demonstrates the process by which AWESOME analyzes the script to construct an efficient ingestion plan, an executable database-centric dataflow specification which populates the raw and all derived data into the polystore. The demonstration will show how changing the script will alter the ingestion plan using a combination of rules and ingestion cost estimation.

[1]  Patrick Valduriez,et al.  CloudMdsQL: querying heterogeneous cloud data stores with a common language , 2016, Distributed and Parallel Databases.

[2]  Subhasis Dasgupta,et al.  Analytics-driven data ingestion and derivation in the AWESOME polystore , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[3]  François Goasdoué,et al.  Mixed-instance querying: a lightweight integration architecture for data journalism , 2016, Proc. VLDB Endow..

[4]  Michael Stonebraker,et al.  The BigDAWG polystore system and architecture , 2016, 2016 IEEE High Performance Extreme Computing Conference (HPEC).

[5]  Dan Suciu,et al.  Demonstration of the Myria big data management service , 2014, SIGMOD Conference.