Modeling and querying web data: a constraint-based logic approach

The efficient and sophisticated representation of the structure of the documents being circulated over the Internet allows for effective querying and reasoning over them. This is a major goal for large information resources like the World Wide Web (WWW). Constraints are a valuable tool for managing information. In this work, we consider how constraint-based technology can be used to query and reason about semistructured data represented using the constraint-logic implied representation models. The constraint system, FT≤, provides information-ordering constraints interpreted over feature trees. Based on this approach, we show how a generalization of FT≤ combined with path constraints can be used to formally represent and state constraints and reason over semistructured data. The proposed query language is extended to facilitate query relaxation when the exact solution to the query cannot be obtained from the data repository. The applicability of the above framework, proposed for semistructured data, is examined for a particular case regarding XML documents circulated and stored over the Web.

[1]  J. W. Lloyd,et al.  Foundations of logic programming; (2nd extended ed.) , 1987 .

[2]  Felix Naumann,et al.  Approximate tree embedding for querying XML data , 2000 .

[3]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..

[4]  Laks V. S. Lakshmanan,et al.  A declarative language for querying and restructuring the Web , 1996, Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.

[5]  Serge Abiteboul,et al.  From structured documents to novel query facilities , 1994, SIGMOD '94.

[6]  Ronald M. Kaplan,et al.  Lexical Functional Grammar A Formal System for Grammatical Representation , 2004 .

[7]  Joachim Niehren,et al.  Ordering Constraints over Feature Trees , 1997, Constraints.

[8]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[9]  Mohand-Said Hacid,et al.  Querying XML with Constraints , 2001, International Conference on Internet Computing.

[10]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[11]  Gert Smolka,et al.  A Feature Constraint System for Logic Programming with Entailment , 1994, Theor. Comput. Sci..

[12]  David Jordan,et al.  The Object Database Standard: ODMG 2.0 , 1997 .

[13]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[14]  Wenfei Fan,et al.  Path constraints on semistructured and structured data , 1998, PODS '98.

[15]  C. M. Sperberg-McQueen,et al.  eXtensible Markup Language (XML) 1.0 (Second Edition) , 2000 .

[16]  Gert Smolka,et al.  A Feature-Based Constraint System for Logic Programming with Entailment , 1992, FGCS.

[17]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[18]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[19]  Rolf Backofen Regular Path Expressions in Feature Logic , 1994, J. Symb. Comput..

[20]  Amihai Motro Extending the Relational Database Model to Support Goal Queries , 1986, Expert Database Conf..

[21]  Dan Suciu,et al.  A query language for a Web-site management system , 1997, SGMD.

[22]  John Wylie Lloyd,et al.  Foundations of Logic Programming , 1987, Symbolic Computation.

[23]  Weimin Du,et al.  Integration strategies in Pegasus object oriented multidatabase system , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.

[24]  Roy Goldman,et al.  Lore: a database management system for semistructured data , 1997, SGMD.

[25]  Alberto O. Mendelzon,et al.  Formal models of Web queries , 1997, Inf. Syst..

[26]  J. Lloyd Foundations of Logic Programming , 1984, Symbolic Computation.

[27]  Serge Abiteboul,et al.  Regular path queries with constraints , 1997, J. Comput. Syst. Sci..

[28]  Frank Wm. Tompa,et al.  Text / Relational Database Management Systems: Harmonizing SQL and SGML , 1994, ADB.

[29]  Serge Abiteboul,et al.  Queries and computation on the web , 1997, Theor. Comput. Sci..

[30]  Robert Milton Underwood Object-Oriented Database Technology , 2002 .

[31]  Diego Calvanese,et al.  Semi-structured Data with Constraints and Incomplete Information , 1998, Description Logics.

[32]  Gert Smolka,et al.  Records for Logic Programming , 1994, J. Log. Program..

[33]  Gabriel M. Kuper,et al.  Subsumption for XML types , 2001, ICDT.

[34]  Catriel Beeri,et al.  A Logical Query Language for Hypermedia Systems , 1994, Inf. Sci..

[35]  Guido Moerkotte,et al.  Evaluating queries with generalized path expressions , 1996, SIGMOD '96.

[36]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1997, International Journal on Digital Libraries.

[37]  Charles F. Goldfarb,et al.  SGML handbook , 1990 .

[38]  Werner Nutt,et al.  Queries with incomplete answers over semistructured data , 1999, PODS '99.

[39]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[40]  N. Curteanu Book Reviews: Lecture on Contemporary Syntactic Theories: An Introduction to Unification-Based Approaches to Grammar , 1987, CL.

[41]  Diego Calvanese,et al.  Representing and Reasoning on XML Documents: A Description Logic Approach , 1999, J. Log. Comput..

[42]  J. A. Robinson,et al.  A Machine-Oriented Logic Based on the Resolution Principle , 1965, JACM.

[43]  Alberto O. Mendelzon,et al.  Expressing structural hypertext queries in graphlog , 1989, Hypertext.

[44]  Won Kim UniSQL/X unified relational and object-oriented database system , 1994, SIGMOD '94.

[45]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[46]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[47]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.