Schema Evolution for XML: A Consistency-Preserving Approach

This paper deals with updates of XML documents that satisfy a given schema, e.g., a DTD. In this context, when a given update violates the schema, it might be the case that this update is accepted, thus implying to change the schema. Our method is intended to be used by a data administrator who is an expert in the domain of application of the database, but who is not required to be a computer science expert. Our approach consists in proposing different schema options that are derived from the original one. The method is consistency-preserving: documents valid with respect to the original schema remain valid. The schema evolution is implemented by an algorithm (called GREC) that performs changes on the graph of a finite state automaton and that generates regular expressions for the modified graphs. Each regular expression proposed by GREC is a choice of schema given to the administrator.

[1]  Derick Wood,et al.  Deterministic Regular Languages , 1991, STACS.

[2]  Béatrice Bouchou-Markhoff,et al.  Updates and Incremental Validation of XML Documents , 2003, DBPL.

[3]  John F. Roddick,et al.  Evolution and change in data management — issues and directions , 2000, SGMD.

[4]  Elke A. Rundensteiner,et al.  XEM: managing the evolution of XML documents , 2001, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001.

[5]  Elke A. Rundensteiner,et al.  Automating the transformation of XML documents , 2001, WIDM '01.

[6]  Michel de Rougemont The correction of XML data , 2003 .

[7]  Helena Ahonen Disambiguation of SGML Content Models , 1996, PODP.

[8]  Béatrice Bouchou-Markhoff,et al.  Extending Tree Automata to Model XML Validation Under Element and Attribute Constraints , 2003, ICEIS.

[9]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[10]  Pascal Caron,et al.  Characterization of Glushkov automata , 2000, Theor. Comput. Sci..

[11]  Joachim Hammer,et al.  Updatex---an xquery-based language for processing updates in xml , 2004 .

[12]  Rajesh Parekh,et al.  Learning DFA from Simple Examples , 1997, Machine Learning.