1 The World Wide Web is a large, heterogeneous , distributed collection of documents connected by hypertext links. The most common technology currently used for searching the Web depends on sending information retrieval requests to "index servers" that index as many documents as they can nd by navigating the network. One problem with this is that users must be aware of the various index servers (over a dozen of them are currently deployed on the Web), of their strengths and weaknesses, and of the peculiarities of their query interfaces. A more serious problem is that these queries cannot exploit the structure and topology of the document network. In this paper we propose a query language, WebSQL, that takes advantage of multiple index servers without requiring users to know about them, and that integrates textual retrieval with structure and topology-based queries. We give a formal semantics for WebSQL using a calculus based on a novel \virtual graph" model of a document network. We propose a new theory of query cost based on the idea of \query locality," that is, how much of the network must be visited to answer a particular query. We give an algorithm for characterizing WebSQL queries with respect to query locality. Finally , we describe a prototype implementation of Web-SQL written in Java.
[1]
Alberto O. Mendelzon,et al.
Expressing structural hypertext queries in graphlog
,
1989,
Hypertext.
[2]
Catriel Beeri,et al.
A Logical Query Language for Hypertext Systems
,
1992,
ECHT.
[3]
Serge Abiteboul,et al.
Querying and Updating the File
,
1993,
VLDB.
[4]
Serge Abiteboul,et al.
From structured documents to novel query facilities
,
1994,
SIGMOD '94.
[5]
Oren Etzioni,et al.
Multi-Service Search and Comparison Using the MetaCrawler
,
1995
.
[6]
Alberto O. Mendelzon,et al.
Finding Regular Simple Paths in Graph Databases
,
1989,
SIAM J. Comput..
[7]
Jennifer Widom,et al.
Querying Semistructured Heterogeneous Information
,
1995,
J. Syst. Integr..
[8]
David Konopnicki,et al.
W3QS: A Query System for the World-Wide Web
,
1995,
VLDB.
[9]
George A. Mihaila.
WebSQL - An SQL-like Query Language for the World Wide Web
,
1996
.
[10]
Laks V. S. Lakshmanan,et al.
A declarative language for querying and restructuring the Web
,
1996,
Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.
[11]
Alberto O. Mendelzon,et al.
Formal models of Web queries
,
1997,
Inf. Syst..