Ordering and Path Constraints over Semistructured Data

Constraints are a valuable tool for managing information. Feature constraints have been used for describing records in constraint programming (Aït-Kaci and Podelski, 1993; Smolka and Treinen, 1994) and record like structures in computational linguistics (Kaplan and Bresnan, 1982; Shieber, 1986). In this paper, we consider how constraint-based technology can be used to query and reason about semistructured data. The constraint system FT≤ (Müller et al., 1997) provides information ordering constraints interpreted over feature trees. Here, we show how a generalization of FT≤ combined with path constraints can be used to formally represent, state constraints, and reason about semistructured data. The constraint languages we propose provide possibilities to straightforwardly capture, for example, what it means for a tree to be a subtree or subsumed by another, or what it means for two paths to be divergent. We establish a logical semantics for our constraints thanks to axiom schemes presenting our first-order theory constraint system. We propose using the constraint systems for querying semistructured data.

[1]  Dan Suciu,et al.  A query language for a Web-site management system , 1997, SGMD.

[2]  Gert Smolka,et al.  Records for Logic Programming , 1994, J. Log. Program..

[3]  Weimin Du,et al.  Integration strategies in Pegasus object oriented multidatabase system , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.

[4]  Larry Wall,et al.  Learning Perl , 1993 .

[5]  Hassan Aït-Kaci,et al.  An Algebraic Semantics Approach to the Effective Resolution of Type Equations , 1986, Theor. Comput. Sci..

[6]  Rolf Backofen Regular Path Expressions in Feature Logic , 1994, J. Symb. Comput..

[7]  Guido Moerkotte,et al.  Evaluating queries with generalized path expressions , 1996, SIGMOD '96.

[8]  Joachim Niehren,et al.  Ordering Constraints over Feature Trees Expressed in Second-Order Monadic Logic , 2000, Inf. Comput..

[9]  Gert Smolka,et al.  A Feature-Based Constraint System for Logic Programming with Entailment , 1992, FGCS.

[10]  Catriel Beeri,et al.  A Logical Query Language for Hypermedia Systems , 1994, Inf. Sci..

[11]  Mark Johnson,et al.  Attribute-value logic and the theory of grammar , 1988 .

[12]  Steven J. DeRose,et al.  XML Path Language (XPath) , 1999 .

[13]  Serge Abiteboul,et al.  Regular path queries with constraints , 1997, J. Comput. Syst. Sci..

[14]  Robert T. Kasper,et al.  A Logical Semantics for Feature Structures , 1986, ACL.

[15]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[16]  David J. DeWitt,et al.  The Object-Oriented Database System Manifesto , 1994, Building an Object-Oriented Database System, The Story of O2.

[17]  Werner Nutt,et al.  Queries with incomplete answers over semistructured data , 1999, PODS '99.

[18]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[19]  William C. Rounds,et al.  Feature Logics , 1997, Handbook of Logic and Language.

[20]  Serge Abiteboul,et al.  From structured documents to novel query facilities , 1994, SIGMOD '94.

[21]  Richard Durbin,et al.  Acedb --- a c. elegans database: syntactic definitions for the acedb data base manager , 1992 .

[22]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[23]  Wenfei Fan,et al.  Path constraints on semistructured and structured data , 1998, PODS '98.

[24]  J. W. Lloyd,et al.  Foundations of logic programming; (2nd extended ed.) , 1987 .

[25]  David Jordan,et al.  The Object Database Standard: ODMG 2.0 , 1997 .

[26]  Hassan Aït-Kaci,et al.  LOGIN: A Logic Programming Language with Built-In Inheritance , 1986, J. Log. Program..

[27]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[28]  Joachim Niehren,et al.  Ordering Constraints over Feature Trees , 1997, Constraints.

[29]  Gert Smolka,et al.  A Feature Constraint System for Logic Programming with Entailment , 1994, Theor. Comput. Sci..

[30]  Robert Milton Underwood Object-Oriented Database Technology , 2002 .

[31]  N. Curteanu Book Reviews: Lecture on Contemporary Syntactic Theories: An Introduction to Unification-Based Approaches to Grammar , 1987, CL.

[32]  Werner Nutt,et al.  On the expressivity of feature logics with negation, functional uncertainty, and sort equations , 1993, J. Log. Lang. Inf..

[33]  Laks V. S. Lakshmanan,et al.  A declarative language for querying and restructuring the Web , 1996, Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.

[34]  Diego Calvanese,et al.  Modeling and Querying Semi-Structured data , 1999, Netw. Inf. Syst. J..

[35]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[36]  Ronald M. Kaplan,et al.  Lexical Functional Grammar A Formal System for Grammatical Representation , 2004 .

[37]  Gert Smolka,et al.  Feature-Constraint Logics for Unification Grammars , 1989, J. Log. Program..

[38]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[39]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[40]  Gert Smolkaz,et al.  Deenite Relations over Constraint Languages , 1988 .

[41]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[42]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[43]  Alberto O. Mendelzon,et al.  Expressing structural hypertext queries in graphlog , 1989, Hypertext.

[44]  Kevin Chen-Chuan Chang,et al.  Boolean Query Mapping Across Heterogeneous Information Sources , 1996, IEEE Trans. Knowl. Data Eng..

[45]  Alberto O. Mendelzon,et al.  Finding Regular Simple Paths in Graph Databases , 1989, SIAM J. Comput..

[46]  Serge Abiteboul,et al.  Queries and computation on the web , 1997, Theor. Comput. Sci..

[47]  Wolfgang May,et al.  A Rule-Based Querying and Updating Language for XML , 2001, DBPL.

[48]  Charles F. Goldfarb,et al.  SGML handbook , 1990 .

[49]  Alberto O. Mendelzon,et al.  Formal models of Web queries , 1997, Inf. Syst..

[50]  Frank Wm. Tompa,et al.  Text / Relational Database Management Systems: Harmonizing SQL and SGML , 1994, ADB.

[51]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..

[52]  Andreas Podelski,et al.  Towards a Meaning of LIFE , 1991, J. Log. Program..

[53]  Mohand-Said Hacid,et al.  A Database Approach for Modeling and Querying Video Data , 2000, IEEE Trans. Knowl. Data Eng..

[54]  Hans-Jürgen Bürckert,et al.  A Resolution Principle for Constrained Logics , 1994, Artif. Intell..

[55]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.