Intermediate Results Materialization Selection and Format for Data-Intensive Flows

[1]  Ashraf Aboulnaga,et al.  ReStore: Reusing Results of MapReduce Jobs , 2012, Proc. VLDB Endow..

[2]  Panos Vassiliadis,et al.  Deciding the physical implementation of ETL workflows , 2007, DOLAP '07.

[3]  Guoping Wang,et al.  Multi-Query Optimization in MapReduce Framework , 2013, Proc. VLDB Endow..

[4]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[5]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[6]  Sriram Padmanabhan,et al.  Determining Essential Statistics for Cost Based Optimization of an ETL Workflow , 2014, EDBT.

[7]  Anastasia Ailamaki,et al.  H2O: a hands-free adaptive store , 2014, SIGMOD Conference.

[8]  Stefan Deßloch,et al.  A Real-time Materialized View Approach for Analytic Flows in Hybrid Cloud Environments , 2014, Datenbank-Spektrum.

[9]  Yanpei Chen,et al.  Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads , 2012, Proc. VLDB Endow..

[10]  Amine Roukh,et al.  Eco-DMW: Eco-Design Methodology for Data warehouses , 2015, DOLAP.

[11]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[12]  Xiaoyong Du,et al.  Wide Table Layout Optimization based on Column Ordering and Duplication , 2017, SIGMOD Conference.

[13]  Ryan Johnson,et al.  Here are my Data Files. Here are my Queries. Where are my Results? , 2011, CIDR.

[14]  WenAn Tan,et al.  Trust Services-Oriented Multi-Objects Workflow Scheduling Model for Cloud Computing , 2012, ICPCA/SWS.

[15]  Jeffrey D. Ullman,et al.  Optimizing Multiway Joins in a Map-Reduce Environment , 2011, IEEE Transactions on Knowledge and Data Engineering.

[16]  Dimitri Theodoratos,et al.  A general framework for the view selection problem for data warehouse design and evolution , 2000, DOLAP '00.

[17]  Sam Lightstone,et al.  DB2 with BLU Acceleration: So Much More than Just a Column Store , 2013, Proc. VLDB Endow..

[18]  Timos K. Sellis,et al.  Data Warehouse Configuration , 1997, VLDB.

[19]  Wolfgang Lehner,et al.  ResilientStore: A Heuristic-Based Data Format Selector for Intermediate Results , 2016, MEDI.

[20]  Jorge-Arnulfo Quiané-Ruiz,et al.  Trojan data layouts: right shoes for a running elephant , 2011, SoCC.

[21]  Alberto Abelló,et al.  Incremental Consolidation of Data-Intensive Multi-Flows , 2016, IEEE Transactions on Knowledge and Data Engineering.

[22]  Kevin Wilkinson,et al.  Revisiting ETL Benchmarking: The Case for Hybrid Flows , 2012, TPCTC.

[23]  Wolfgang Lehner,et al.  SAP HANA database: data management for modern business applications , 2012, SGMD.

[24]  Inderpal Singh Mumick,et al.  Selection of Views to Materialize in a Data Warehouse , 2005, IEEE Trans. Knowl. Data Eng..

[25]  Laurent d'Orazio,et al.  Cost models for view materialization in the cloud , 2012, EDBT-ICDT '12.

[26]  Surajit Chaudhuri,et al.  Database Tuning Advisor for Microsoft SQL Server 2005 , 2004, VLDB.

[27]  Vladimir Vlassov,et al.  m2r2: A Framework for Results Materialization and Reuse in High-Level Dataflow Systems for Big Data , 2013, 2013 IEEE 16th International Conference on Computational Science and Engineering.

[28]  Anastasia Ailamaki,et al.  ReCache: Reactive Caching for Fast Analytics over Heterogeneous Data , 2017, Proc. VLDB Endow..

[29]  Georgia Kougka,et al.  Practical algorithms for execution engine selection in data flows , 2015, Future Gener. Comput. Syst..

[30]  David J. DeWitt,et al.  Split query processing in polybase , 2013, SIGMOD '13.

[31]  Oleksandr Romanko,et al.  Normalization and Other Topics in Multi­Objective Optimization , 2006 .

[32]  Astrid Rheinländer,et al.  Opening the Black Boxes in Data Flow Optimization , 2012, Proc. VLDB Endow..

[33]  Kevin Wilkinson,et al.  Engine independence for logical analytic flows , 2014, 2014 IEEE 30th International Conference on Data Engineering.

[34]  Alberto Abelló,et al.  A Unified View of Data-Intensive Flows in Business Intelligence Systems: A Survey , 2016, Trans. Large Scale Data Knowl. Centered Syst..

[35]  Alok Aggarwal,et al.  The input/output complexity of sorting and related problems , 1988, CACM.

[36]  Kevin Wilkinson,et al.  QoX-driven ETL design: reducing the cost of ETL consulting engagements , 2009, SIGMOD Conference.

[37]  Timos K. Sellis,et al.  Dynamic Data Warehouse Design , 1999, DaWaK.

[38]  Jeffrey D. Ullman,et al.  Implementing data cubes efficiently , 1996, SIGMOD '96.