Logical Foundations of Database Transformations for Complex-Value Databases

Database transformations consist of queries and updates which are two fundamental types of computations in any databases - the first provides the capability to retrieve data and the second is used to maintain databases in light of ever-changing application domains. With the rising popularity of web-based applications and service-oriented architectures, the development of database transformations must address new challenges, which frequently call for establishing a theoretical framework that unifies both queries and updates over complex-value databases. This dissertation aims to lay down the foundations for establishing a theoretical framework of database transformations in the context of complex-value databases. We shall use an approach that has successfully been used for the characterisation of sequential algorithms. The sequential Abstract State Machine (ASM) thesis captures semantics and behaviour of sequential algorithms. The thesis uses the similarity of general computations and database transformations for characterisation of the later by five postulates: sequential time postulate, abstract state postulate, bounded exploration postulate, background postulate, and the bounded non-determinism postulate. The last two postulates reflect the specific form of transformations for databases. The five postulates exactly capture database transformations. Furthermore, we provide a logical proof system for database transformations that is sound and complete.

[1]  Andreas Blass,et al.  Interactive Small-Step Algorithms I: Axiomatization , 2007, Log. Methods Comput. Sci..

[2]  Max J. Cresswell,et al.  A New Introduction to Modal Logic , 1998 .

[3]  Joachim Hammer,et al.  Updatex---an xquery-based language for processing updates in xml , 2004 .

[4]  Jan Paredaens,et al.  On the Expressive Power of the Relational Algebra , 1978, Inf. Process. Lett..

[5]  Bernhard Thalheim,et al.  Fundamental Concepts of Object Oriented Databases , 1993, Acta Cybern..

[6]  Nikolai Tillmann,et al.  Partial Updates Exploration II , 2003, Abstract State Machines.

[7]  Michael Kifer,et al.  An Overview of Transaction Logic , 1994, Theor. Comput. Sci..

[8]  A. Turing On Computable Numbers, with an Application to the Entscheidungsproblem. , 1937 .

[9]  Roel Wieringa,et al.  Dynamic Database Logic: the First-order Case , 1992, FMLDO.

[10]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[11]  Andreas Blass,et al.  Abstract state machines capture parallel algorithms: Correction and extension , 2006, TOCL.

[12]  David Harel,et al.  Computable Queries for Relational Data Bases , 1980, J. Comput. Syst. Sci..

[13]  Serge Abiteboul,et al.  Object identity as a query language primitive , 1989, SIGMOD '89.

[14]  Roel Wieringa,et al.  Regular database update logics , 2001, Theor. Comput. Sci..

[15]  Egon Börger,et al.  Java and the Java Virtual Machine: Definition, Verification, Validation , 2001 .

[16]  Moshe Y. Vardi The complexity of relational query languages (Extended Abstract) , 1982, STOC '82.

[17]  Yuri Gurevich,et al.  Partially Ordered Runs: A Case Study , 2000, Abstract State Machines.

[18]  Andreas Blass,et al.  Persistent Queries , 2008, ArXiv.

[19]  Lauri Hella,et al.  Logics with aggregate operators , 2001, JACM.

[20]  Ronald Fagin,et al.  Finite-Model Theory - A Personal Perspective , 1990, Theor. Comput. Sci..

[21]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[22]  Nikolai Tillmann,et al.  Partial updates , 2005, Theor. Comput. Sci..

[23]  Serge Abiteboul,et al.  A translation language complete for database update and specification , 1987, PODS '87.

[24]  José Maria Turull Torres Relational Databases and Homogeneity in Logics with Counting , 2002, Acta Cybern..

[25]  Yuri Gurevich Intra-step Interaction , 2004, Abstract State Machines.

[26]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[27]  Qing Wang,et al.  XML Machines , 2009, ER Workshops.

[28]  Andreas Blass,et al.  Abstract state machines capture parallel algorithms , 2003, TOCL.

[29]  Andreas Blass,et al.  Ordinary interactive small-step algorithms, III , 2007, TOCL.

[30]  Wolfgang Reisig On Gurevich's theorem on sequential algorithms , 2003, Acta Informatica.

[31]  Bernhard Thalheim,et al.  Validation of streaming XML documents with abstract state machines , 2008, iiWAS.

[32]  Thomas Wilke,et al.  An Algebraic Characterization of Frontier Testable Tree Languages , 1996, Theor. Comput. Sci..

[33]  John Doner,et al.  Tree Acceptors and Some of Their Applications , 1970, J. Comput. Syst. Sci..

[34]  Klaus-Dieter Schewe,et al.  A multi-level architecture for distributed object bases , 2007, Data Knowl. Eng..

[35]  Elvinia Riccobene,et al.  A Realistic Environment for Crypto-Protocol Analyses by ASMs , 1998, Workshop on Abstract State Machines.

[36]  Roel Wieringa,et al.  Aziomatization, Declarative Semantics and Operational Semantics of Passive and Active Updates in Logic Databases , 1995, J. Log. Comput..

[37]  Arnold Schönhage,et al.  Storage Modification Machines , 1979, SIAM J. Comput..

[38]  Serge Abiteboul,et al.  Computing with Infinitary Logic , 1995, Theor. Comput. Sci..

[39]  Stanislas Nanchen Verifying abstract state machines , 2007 .

[40]  David Beech,et al.  XML-Schema Part 1: Structures Second Edition , 2004 .

[41]  Bernhard Thalheim,et al.  XML Database Transformations with Tree Updates , 2008, ABZ.

[42]  Johann A. Makowsky,et al.  Query Languages for Hierarchic Databases , 1992, Inf. Comput..

[43]  Catriel Beeri,et al.  On genericity and parametricity (extended abstract) , 1996, PODS '96.

[44]  Marc Andries,et al.  On Instance-Completeness for Database Query Languages involving Object Creation , 1996, J. Comput. Syst. Sci..

[45]  James W. Thatcher,et al.  Generalized finite automata theory with an application to a decision problem of second-order logic , 1968, Mathematical systems theory.

[46]  Andreas Blass,et al.  Background, Reserve, and Gandy Machines , 2000, CSL.

[47]  Christos H. Papadimitriou,et al.  Reflective Relational Machines , 1998, Inf. Comput..

[48]  Saharon Shelah,et al.  Choiceless Polynomial Time , 1997, Ann. Pure Appl. Log..

[49]  J. Büchi Weak Second‐Order Arithmetic and Finite Automata , 1960 .

[50]  V. A. Uspenski,et al.  On the Definition of an Algorithm , 1963 .

[51]  Herbert B. Enderton,et al.  A mathematical introduction to logic , 1972 .

[52]  Michael Kifer,et al.  The State of Change: A Survey , 1996, Transactions and Change in Logic Databases.

[53]  Sara Cohen,et al.  User-defined aggregate functions: bridging theory and practice , 2006, SIGMOD Conference.

[54]  Balder ten Cate,et al.  XPath, transitive closure logic, and nested tree walking automata , 2008, PODS.

[55]  Balder ten Cate,et al.  The expressivity of XPath with transitive closure , 2006, PODS.

[56]  E. F. Codd,et al.  Relational Completeness of Data Base Sublanguages , 1972, Research Report / RJ / IBM / San Jose, California.

[57]  Akifumi Makinouchi,et al.  A Consideration on Normal Form of Not-Necessarily-Normalized Relation in the Relational Data Model , 1977, VLDB.

[58]  Wolfram Schulte,et al.  Semantic essence of AsmL , 2003, Theor. Comput. Sci..

[59]  Andreas Blass,et al.  Interactive Small-Step Algorithms II: Abstract State Machines and the Characterization Theorem , 2007, Log. Methods Comput. Sci..

[60]  Neil Immerman,et al.  Relational Queries Computable in Polynomial Time , 1986, Inf. Control..

[61]  Bernhard Thalheim,et al.  A theory of data-intensive software services , 2009, Service Oriented Computing and Applications.

[62]  Gerard R. Renardel de Lavalette,et al.  Reasoning About Dynamic Features in Specification Languages - A Modal View on Creation and Modification , 1993, Semantics of Specification Languages.

[63]  Noga Alon,et al.  XML with data values: typechecking revisited , 2003, J. Comput. Syst. Sci..

[64]  Serge Abiteboul,et al.  Generic Computation and its complexity , 1991, STOC '91.

[65]  Victor Vianu,et al.  Validating streaming XML documents , 2002, PODS.

[66]  Markus Kirchberg,et al.  Using Abstract State Machines to Model ARIES-based Transaction Processing , 2009, J. Univers. Comput. Sci..

[67]  Michael Kifer,et al.  Database Programming in Transaction Logic , 1993, DBPL.

[68]  Klaus-Dieter Schewe,et al.  Towards a Logic for Abstract MetaFinite State Machines , 2008, FoIKS.

[69]  Yannis Papakonstantinou,et al.  DTD inference for views of XML data , 2000, PODS.

[70]  Arvind Malhotra,et al.  XML Schema Part 2: Datatypes Second Edition , 2004 .

[71]  José Maria Turull Torres On the expressibility and the computability of untyped queries , 2001, Ann. Pure Appl. Log..

[72]  E. F. Codd Derivability, redundancy and consistency of relations stored in large data banks , 2009, SGMD.

[73]  G. R. D. Lavalette A Logic of Modification and Creation , 2001 .

[74]  Saharon Shelah,et al.  On polynomial time computation over unordered structures , 2001, Journal of Symbolic Logic.

[75]  Bernhard Thalheim,et al.  Abstract State Services , 2008, ER Workshops.

[76]  Bernhard Thalheim,et al.  Updates, Schema Updates and Validation of XML Documents - Using Abstract State Machines with Automata-Defined States , 2009 .

[77]  Victor Vianu,et al.  Database method schemas and object creation , 1993, PODS '93.

[78]  Michael Benedikt,et al.  Adding Updates to XQuery: Semantics, Optimization, and Static Analysis , 2005, XIME-P.

[79]  Michael Kifer,et al.  Transaction Logic Programming , 1993, ICLP.

[80]  Dean Rosenzweig,et al.  Privacy, Abstract Encryption and Protocols: An ASM Model - Part I , 2003, Abstract State Machines.

[81]  Jianwen Su,et al.  Algebraic and Calculus Query Languages for Recursively Typed Complex Objects , 1993, J. Comput. Syst. Sci..

[82]  Jan Van den Bussche,et al.  Abstract State Machines and Computationally Complete Query Languages , 2000, Abstract State Machines.

[83]  Klaus-Dieter Schewe,et al.  An abstract model for service provision, search and composition , 2009, 2009 IEEE Asia-Pacific Services Computing Conference (APSCC).

[84]  Erich Grädel,et al.  Finite Model Theory and Descriptive Complexity , 2007 .

[85]  Murali Mani,et al.  Taxonomy of XML schema languages using formal language theory , 2005, TOIT.

[86]  Marc Gyssens,et al.  A graph-oriented object database model , 1990, IEEE Trans. Knowl. Data Eng..

[87]  Zsolt Németh,et al.  A Formal Framework for Defining Grid Systems , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[88]  François Bancilhon,et al.  On the Completeness of Query Languages for Relational Data Bases , 1978, MFCS.

[89]  Alfred V. Aho,et al.  Universality of data retrieval languages , 1979, POPL.

[90]  Dieter Fensel,et al.  MLPM: Defining a Semantics and Axiomatization for Specifying the Reasoning Process of Knowleged-based Systems , 1996, ECAI.

[91]  Robert F. Stärk,et al.  A Logic for Abstract State Machines , 2001, J. Univers. Comput. Sci..

[92]  David Schach,et al.  XML Query Language (XQL) , 1998, QL.

[93]  Serge Abiteboul,et al.  Datalog Extensions for Database Queries and Updates , 1991, J. Comput. Syst. Sci..

[94]  Joost Engelfriet,et al.  Characterization of Properties and Relations defined in Monadic Second Order Logic on the Nodes of T , 1997 .

[95]  Tatiana Yavorskaya,et al.  On Bounded Exploration and Bounded Nondeterminism , 2006 .

[96]  Serge Abiteboul,et al.  Fixpoint logics, relational machines, and computational complexity , 1997, JACM.

[97]  Klaus-Dieter Schewe,et al.  Axiomatization of Database Transformations , 2007 .

[98]  Dongwon Lee,et al.  Comparative analysis of six XML schema languages , 2000, SGMD.

[99]  Jan Gilbert Van Den Bussche Formal aspects of object identity in database manipulation , 1993 .

[100]  Serge Abiteboul,et al.  The power of reflective relational machines , 1994, Proceedings Ninth Annual IEEE Symposium on Logic in Computer Science.

[101]  Per-Åke Larson,et al.  Eager Aggregation and Lazy Aggregation , 1995, VLDB.

[102]  Dan Suciu Domain-Independent Queries on Databases with External Functions , 1995, ICDT.

[103]  Serge Abiteboul,et al.  Computing with First-Order Logic , 1995, J. Comput. Syst. Sci..

[104]  Wolfgang Reisig,et al.  An ASM-Characterization of a Class of Distributed Algorithms , 2009, Rigorous Methods for Software Construction and Analysis.

[105]  Yuri Gurevich,et al.  Metafinite Model Theory , 1994, LCC.

[106]  Patrick C. Fischer,et al.  Nested Relational Structures , 1986, Adv. Comput. Res..

[107]  Neil Immerman,et al.  An optimal lower bound on the number of variables for graph identification , 1992, Comb..

[108]  Margus Veanes,et al.  Universal Plug and Play Machine Models , 2002, DIPES.

[109]  Andreas Blass,et al.  Ordinary interactive small-step algorithms, II , 2007, TOCL.

[110]  Jan Van den Bussche,et al.  Semi-determinism , 1992, PODS.

[111]  Yuri Gurevich,et al.  Evolving algebras 1993: Lipari guide , 1995, Specification and validation methods.

[112]  Dieter Fensel,et al.  A Survey of Languages for Specifying Dynamics: A Knowledge Engineering Perspective , 2001, IEEE Trans. Knowl. Data Eng..

[113]  Jan Van den Bussche Tree-structured object creation in database transformations , 2007 .

[114]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[115]  Andreas Blass,et al.  Background of Computation , 2007, Bull. EATCS.

[116]  Jan Van den Bussche,et al.  Non-deterministic aspects of database transformations involving object creation , 1993 .

[117]  Martin Otto,et al.  The expressive power of fixed-point logic with counting , 1996, Journal of Symbolic Logic.

[118]  Victor Vianu,et al.  Procedural Languages for Database Queries and Updates , 1990, J. Comput. Syst. Sci..

[119]  Egon Börger,et al.  Abstract State Machines. A Method for High-Level System Design and Analysis , 2003 .

[120]  Yuri Gurevich Interactive Algorithms 2005 , 2005, MFCS.

[121]  Mark Levene,et al.  The Nested Universal Relation Database Model , 1992, Lecture Notes in Computer Science.

[122]  Leonid Libkin,et al.  Logics with counting and local properties , 2000, TOCL.

[123]  Bernhard Thalheim,et al.  Identification as a Primitive of Database Models , 1998, FMLDO.

[124]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[125]  Andreas Blass,et al.  Ordinary interactive small-step algorithms, I , 2006, TOCL.

[126]  Gerard Renardel de Lavalette,et al.  A Formalisation of Evolving Algebras , 1995 .

[127]  Jan Van den Bussche,et al.  Expressiveness of Efficient Semi-Deterministic Choice Constructs , 1994, ICALP.

[128]  Stefano Ceri,et al.  Comparative analysis of five XML query languages , 1999, SGMD.

[129]  Christopher Ré,et al.  XQuery!: An XML Query Language with Side Effects , 2006, EDBT Workshops.

[130]  Jan Van den Bussche,et al.  On the completeness of object-creating database transformation languages , 1997, JACM.

[131]  Margus Veanes,et al.  Rich Sequential-Time ASMs , 2000 .

[132]  Hubert Comon,et al.  Tree automata techniques and applications , 1997 .

[133]  Arno Schonegge,et al.  Extending Dynamic Logic for Reasoning about Evolving Algebras , 1995 .

[134]  Martin Otto,et al.  Inductive Definability with Counting on Finite Structures , 1992, CSL.

[135]  Kenneth A. Ross,et al.  Efficient Incremental Evaluation of Queries with Aggregation , 1994, ILPS.

[136]  Mark Aron Roth Theory of non-first normal form relational databases (null values, nesting) , 1986 .

[137]  Neil Immerman,et al.  Expressibility as a complexity measure: results and directions , 1987, SCT.

[138]  Yuri Gurevich,et al.  Sequential abstract-state machines capture sequential algorithms , 2000, TOCL.

[139]  Nikolai Tillmann,et al.  Partial Updates: Exploration , 2001, J. Univers. Comput. Sci..

[140]  Richard Hull,et al.  The Format Model: A Theory of database Organization , 1984, J. ACM.

[141]  Jianwen Su,et al.  Untyped sets, invention, and computable queries , 1989, PODS '89.