Transformation Lifecycle Management with Nautilus

1. THE TRANSFORMATION LIFECYCLE When developing data transformations—a task omnipresent in applications like data integration, data migration, data cleaning, or scientific data processing—developers quickly face the need to verify the semantic correctness of the transformation. Declarative specifications of data transformations, e.g. SQL or ETL tools, increase developer productivity but usually provide limited or no means for inspection or debugging. In this situation, developers today have no choice but to manually analyze the transformation and, in case of an error, to (repeatedly) fix and test the transformation.

[1]  Quoc Trung Tran,et al.  How to ConQueR why-not questions , 2010, SIGMOD Conference.

[2]  Wang Chiew Tan,et al.  Debugging schema mappings with routes , 2006, VLDB.

[3]  Cong Yu,et al.  Semantic Adaptation of Schema Mappings when Schemas Evolve , 2005, VLDB.

[4]  Jeffrey F. Naughton,et al.  On the provenance of non-answers to queries over extracted data , 2008, Proc. VLDB Endow..

[5]  Juliana Freire,et al.  Provenance and scientific workflows: challenges and opportunities , 2008, SIGMOD Conference.

[6]  Adriane Chapman,et al.  Why Not? , 1965, SIGMOD Conference.

[7]  Carlo Curino,et al.  Graceful database schema evolution: the PRISM workbench , 2008, Proc. VLDB Endow..

[8]  Dan Suciu,et al.  The Complexity of Causality and Responsibility for Query Answers and non-Answers , 2010, Proc. VLDB Endow..

[9]  Joseph M. Hellerstein,et al.  Potter's Wheel: An Interactive Data Cleaning System , 2001, VLDB.

[10]  Torsten Grust,et al.  True language-level SQL debugging , 2011, EDBT/ICDT '11.

[11]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[12]  Renée J. Miller,et al.  Muse: Mapping Understanding and deSign by Example , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[13]  Christopher Olston,et al.  Generating example data for dataflow programs , 2009, SIGMOD Conference.

[14]  Umeshwar Dayal,et al.  On the correct translation of update operations on relational views , 1982, TODS.

[15]  Elke A. Rundensteiner,et al.  A Transparent Schema-Evolution System Based on Object-Oriented View Technology , 1997, IEEE Trans. Knowl. Data Eng..

[16]  Laura M. Haas,et al.  Data-driven understanding and refinement of schema mappings , 2001, SIGMOD '01.

[17]  S. Sudarshan,et al.  X-data: Generating test data for killing SQL mutants , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[18]  Melanie Herschel,et al.  Explaining missing answers to SPJUA queries , 2010, Proc. VLDB Endow..

[19]  James Cheney,et al.  Provenance in Databases: Why, How, and Where , 2009, Found. Trends Databases.

[20]  Parag Agrawal,et al.  Trio: a system for data, uncertainty, and lineage , 2006, VLDB.

[21]  Frank Wm. Tompa,et al.  Efficiently updating materialized views , 1986, SIGMOD '86.