Catching the boat with Strudel: experiences with a Web-site management system

The Strudel system applies concepts from database management systems to the process of building Web sites. Strudel's key idea is separating the management of the site's data, the creation and management of the site's structure, and the visual presentation of the site's pages. First, the site builder creates a uniform model of all data available at the site. Second, the builder uses this model to declaratively define the Web site's structure by applying a “site-definition query” to the underlying data. The result of evaluating this query is a “site graph”, which represents both the site's content and structure. Third, the builder specifies the visual presentation of pages in Strudel's HTML-template language. The data model underlying Strudel is a semi-structured model of labeled directed graphs. We describe Strudel's key characteristics, report on our experiences using Strudel, and present the technical problems that arose from our experience. We describe our experience constructing several Web sites with Strudel and discuss the impact of potential users' requirements on Strudel's design. We address two main questions: (1) when does a declarative specification of site structure provide significant benefits, and (2) what are the main advantages provided by the semi-structured data model.

[1]  Michael R. Genesereth,et al.  Query planning in infomaster , 1997, SAC '97.

[2]  SuciuDan,et al.  A query language and optimization techniques for unstructured data , 1996 .

[3]  Paolo Merialdo,et al.  To Weave the Web , 1997, VLDB.

[4]  K. Selçuk Candan,et al.  Query caching and optimization in distributed mediator systems , 1996, SIGMOD '96.

[5]  Marc Friedman,et al.  Efficient execution of information gatheriug plans , 1997, IJCAI 1997.

[6]  Moshé M. Zloof Query-by-Example: A Data Base Language , 1977, IBM Syst. J..

[7]  Dan Suciu,et al.  Reasoning About Web-Site Structure , 1998, KRDB.

[8]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[9]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[10]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[11]  Michael Benedikt,et al.  Experience with a Domain Specific Language for Form-based Services , 1997, DSL.

[12]  Dan Suciu,et al.  Warehousing and incremental evaluation for Web Site management , 1998, BDA.

[13]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[14]  Peter T. Wood,et al.  Queries on graphs , 1989 .

[15]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[16]  Sophie Cluet,et al.  Your mediators need data conversion! , 1998, SIGMOD '98.

[17]  Patrick Valduriez,et al.  A Methodology for Query Reformulation in CIS Using Semantic Knowledge , 1996, Int. J. Cooperative Inf. Syst..

[18]  Paolo Paolini,et al.  A Conceptual Model and a Tool Environment for Developing More Scalable, Dynamic, and Customizable Web Applications , 1998, EDBT.

[19]  Daniel S. Weld,et al.  Planning to Gather Information , 1996, AAAI/IAAI, Vol. 1.

[20]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[21]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[22]  Alberto O. Mendelzon,et al.  WebOQL: restructuring documents, databases and Webs , 1998, Proceedings 14th International Conference on Data Engineering.

[23]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[24]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[25]  Dan Suciu,et al.  A query language for a Web-site management system , 1997, SGMD.

[26]  Michael R. Genesereth,et al.  Answering recursive queries using views , 1997, PODS '97.

[27]  Dan Suciu,et al.  STRUDEL: a Web site management system , 1997, SIGMOD '97.