Validity-Sensitive Querying of XML Databases Extended Abstract †

We consider the problem of querying XML documents which are not valid with respect to given DTDs. We propose a framework for measuring the invalidity of XML documents and compactly representing minimal repairing scenarios. Furthermore, we present a validity-sensitive method of querying XML documents, which extracts more information from invalid XML documents than does the standard query evaluation. Finally, we provide experimental results which validate our approach.

[1]  Michael Benedikt,et al.  XPath satisfiability in the presence of DTDs , 2008, JACM.

[2]  Jennifer Widom,et al.  Change detection in hierarchically structured information , 1996, SIGMOD '96.

[3]  Martín Abadi,et al.  Security analysis of cryptographically controlled access to XML documents , 2005, PODS '05.

[4]  Steven J. DeRose,et al.  XML Path Language (XPath) , 1999 .

[5]  Sergio Greco,et al.  Repairs and Consistent Answers for XML Data with Functional Dependencies , 2003, Xsym.

[6]  Renée J. Miller,et al.  ConQuer: efficient management of inconsistent databases , 2005, SIGMOD '05.

[7]  Matthias Jarke,et al.  Advances in Database Technology — EDBT 2002 , 2002, Lecture Notes in Computer Science.

[8]  Sergio Greco,et al.  Querying and Repairing Inconsistent XML Data , 2005, WISE.

[9]  Stanley M. Selkow,et al.  The Tree-to-Tree Editing Problem , 1977, Inf. Process. Lett..

[10]  Cong Yu,et al.  Schema-Free XQuery , 2004, VLDB.

[11]  Frank Neven,et al.  Automata, Logic, and XML , 2002, CSL.

[12]  Maarten Marx,et al.  Conditional XPath, the first order complete XPath dialect , 2004, PODS.

[13]  Sudipto Guha,et al.  Approximate XML joins , 2002, SIGMOD '02.

[14]  Alex K. Simpson,et al.  Computational Adequacy in an Elementary Topos , 1998, CSL.

[15]  Jeffrey D. Ullman,et al.  Introduction to automata theory, languages, and computation, 2nd edition , 2001, SIGA.

[16]  Neoklis Polyzotis,et al.  Approximate XML query answers , 2004, SIGMOD '04.

[17]  Thomas Lukasiewicz Proceedings of the 7th International Symposium on the Foundations of Information and Knowledge Systems‚ FoIKS 2012‚ Kiel‚ Germany‚ March 5−9‚ 2012 , 2000 .

[18]  Rajeev Rastogi,et al.  A cost-based model and effective heuristic for repairing constraints by value modification , 2005, SIGMOD '05.

[19]  Alex Thomo,et al.  Query containment and rewriting using views for regular path queries under constraints , 2003, PODS.

[20]  Kaizhong Zhang,et al.  Approximate tree pattern matching , 1997 .

[21]  Moshe Y. Vardi The complexity of relational query languages (Extended Abstract) , 1982, STOC '82.

[22]  H. V. Jagadish,et al.  Evaluating Structural Similarity in XML Documents , 2002, WebDB.

[23]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[24]  Kaizhong Zhang,et al.  Tree pattern matching , 1997, Pattern Matching Algorithms.

[25]  Thomas Schwentick,et al.  XML: Model, Schemas, Types, Logics, and Queries , 2003, Logics for Emerging Applications of Databases.

[26]  Gunter Saake,et al.  Logics for Emerging Applications of Databases , 2003, Springer Berlin Heidelberg.

[27]  Graham Cormode,et al.  The string edit distance matching problem with moves , 2002, SODA '02.

[28]  Serge Abiteboul,et al.  Incremental Maintenance for Materialized Views over Semistructured Data , 1998, VLDB.

[29]  Alex Thomo,et al.  Query Answering and Containment for Regular Path Queries under Distortions , 2004, FoIKS.

[30]  Wee Hyong Tok,et al.  Data cleaning and XML: the DBLP experience , 2002, Proceedings 18th International Conference on Data Engineering.

[31]  Nobutaka Suzuki,et al.  Finding an optimum edit script between an XML document and a DTD , 2005, SAC '05.

[32]  Yannis Papakonstantinou,et al.  Incremental validation of XML documents , 2003, TODS.

[33]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[34]  Serge Abiteboul,et al.  Detecting changes in XML documents , 2002, Proceedings 18th International Conference on Data Engineering.

[35]  Michel de Rougemont,et al.  Correctors for XML Data , 2004, XSym.

[36]  Alfred V. Aho,et al.  A Minimum Distance Error-Correcting Parser for Context-Free Languages , 1972, SIAM J. Comput..

[37]  Sihem Amer-Yahia,et al.  Tree Pattern Relaxation , 2002, EDBT.

[38]  Alberto H. F. Laender,et al.  Automatic web news extraction using tree edit distance , 2004, WWW '04.

[39]  Georg Gottlob,et al.  XPath processing in a nutshell , 2003, SGMD.

[40]  Anne H. H Ngu,et al.  Web Information Systems Engineering - WISE 2005, 6th International Conference on Web Information Systems Engineering, New York, NY, USA, November 20-22, 2005, Proceedings , 2005, WISE.