/spl Pi/-web join in a web warehouse

With the enormous amount of data stored in the World Wide Web, it is increasingly important to design and develop powerful web warehousing tools. The key objective of our web warehousing project, called WHOWEDA (Warehouse of Web Data), is to design and implement a web warehouse that materializes and manages useful information from the web. We introduce the concept of /spl Pi/-web join in the context of WHOWEDA. /spl Pi/-web join operator is a web information manipulation operator to combine relevant web information residing in two web tables. Informally, it is the combination of web join and web project operators which filter out irrelevant information from a joined web table. We show how to construct the /spl Pi/-joined web table and its schema. We also highlight the benefits of the /spl Pi/-web join operator.

[1]  Dan Suciu,et al.  A Query Language and Processor for a Web-Site Management System , 1997 .

[2]  Sourav S. Bhowmick,et al.  Join Processing in Web Databases , 1998, DEXA.

[3]  Alberto O. Mendelzon,et al.  WebOQL: restructuring documents, databases, and webs , 1999 .

[4]  Sourav S. Bhowmick,et al.  Web Bags - Are They Useful in A Web Warehouse? , 1998, FODO.

[5]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[6]  Laks V. S. Lakshmanan,et al.  A declarative language for querying and restructuring the Web , 1996, Proceedings RIDE '96. Sixth International Workshop on Research Issues in Data Engineering.

[7]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[8]  Dan Suciu,et al.  STRUDEL: a Web site management system , 1997, SIGMOD '97.

[9]  Alberto O. Mendelzon,et al.  Querying the World Wide Web , 1997, International Journal on Digital Libraries.

[10]  Sourav S. Bhowmick,et al.  Web Warehousing: Design and Issues , 1998, ER Workshops.

[11]  David Konopnicki,et al.  Information gathering in the World-Wide Web: the W3QL query language and the W3QS system , 1998, TODS.

[12]  Sourav S. Bhowmick,et al.  Information Coupling in Web Databases , 1998, ER.

[13]  David Konopnicki,et al.  W3QS: A Query System for the World-Wide Web , 1995, VLDB.

[14]  Sourav S. Bhowmick,et al.  Web warehousing: an algebra for web information , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.