Web warehousing: an algebra for web information

While conventional keyword indexes maintained by web search engines such as Yahoo, Lycos, and World Wide Web Worm work well for most simple keyword searches, they are inadequate when more complex and structured queries involving the underlying hypertext structure of the World Wide Web are desired. Building from a database perspective, existing work to support such queries focuses on constructing SQL-like query languages for the WWW that assumes a relational abstraction of the WWW. Nonetheless, the WWW is a directed graph and imposing a relational abstraction filters out its inherent topological structure. We propose a data model for the WWW that retains its topological structure and construct a web algebra to manipulate objects in this model. The web algebra establishes a formal foundation from which different web query languages can be designed.

[1]  Mario Tokoro,et al.  Queries on Structures in Hypertext , 1993, FODO.

[2]  C. J. Date Relational Database - Selected Writings , 1986 .

[3]  Serge Abiteboul,et al.  Queries and computation on the web , 1997, Theor. Comput. Sci..

[4]  Erik Sandewall Towards a World-Wide Data Base , 1996, Comput. Networks.

[5]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[6]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[7]  G. Moerkotte,et al.  RAW : a Relational Algebra for the Web , 1997 .

[8]  Hector Garcia-Molina,et al.  Extracting Semistructured Information from the Web. , 1997 .

[9]  Laks V. S. Lakshmanan,et al.  A declarative language for querying and restructuring the Web , 1996, Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.

[10]  Catriel Beeri,et al.  A Logical Query Language for Hypertext Systems , 1992, ECHT.

[11]  Ralf Hartmut Güting,et al.  An algebra for structured office documents , 1989, TOIS.

[12]  David Konopnicki,et al.  Information gathering in the World-Wide Web: the W3QL query language and the W3QS system , 1998, TODS.

[13]  RalfHiutmut Gtiting,et al.  GraphDB : Modeling and Querying Graphs in Databases , 1998 .

[14]  George A. Mihaila WebSQL - An SQL-like Query Language for the World Wide Web , 1996 .

[15]  Ee-Peng Lim,et al.  A relational interface for heterogeneous information sources , 1997, Proceedings of ADL '97 Forum on Research and Technology. Advances in Digital Libraries.

[16]  Alberto O. Mendelzon,et al.  Expressing structural hypertext queries in graphlog , 1989, Hypertext.

[17]  Ralf Hartmut Güting,et al.  GraphDB: Modeling and Querying Graphs in Databases , 1994, VLDB.