A workload-aware approach for optimizing the XML schema design trade-off

In general, the design of XML schemas involves translating conceptual schemas into XML schemas which aim to be: (i) normalized schemas, and (ii) connected structures in order to achieve good performance on queries. However, these requirements address a trade-off because highly connected XML structures allow data redundancy, and normalized schemas generate disconnected XML structures. This paper describes a workload-based approach which balances this trade-off on translating conceptual schemas into XML structures. An experimental study on an XML database shows that our XML schemas provide high query performance on the relevant elements for the workload and, at the same time, low cost of data redundancy on elements that are not relevant for update operations.

[1]  Ronaldo dos Santos Mello,et al.  Designing XML documents from conceptual schemas and workload information , 2009, Multimedia Tools and Applications.

[2]  Terry A. Halpin,et al.  Object Role Modelling and XML-Schema , 2000, ER.

[3]  Joseph Fong,et al.  Translating Relational Schema with Constraints into Xml Schema , 2006, Int. J. Softw. Eng. Knowl. Eng..

[4]  Harald Schöning Tamino - A DBMS designed for XML , 2001, ICDE.

[5]  Cong Yu,et al.  XML schema refinement through redundancy detection and normalization , 2008, The VLDB Journal.

[6]  David W. Embley,et al.  Enterprise Modeling with Conceptual XML , 2004, ER.

[7]  Yuan-Chi Chang,et al.  Schema advisor for hybrid relational-XML DBMS , 2007, SIGMOD '07.

[8]  Michael N. Huhns,et al.  Constructing Consensus Ontologies for the Semantic Web: A Conceptual Approach , 2004, World Wide Web.

[9]  David W. Embley,et al.  Generating compact redundancy-free XML documents from conceptual-model hypergraphs , 2006, IEEE Transactions on Knowledge and Data Engineering.

[10]  Laks V. S. Lakshmanan,et al.  Making Designer Schemas with Colors , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[11]  Denilson Barbosa,et al.  ToXgene: a template-based data generator for XML , 2002, SIGMOD '02.

[12]  Murali Mani,et al.  EReX: A Conceptual Model for XML , 2004, XSym.

[13]  Elisa Quintarelli,et al.  An algorithm for generating XML Schemas from ER Schemas , 2005, SEBD.

[14]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[15]  Ramez Elmasri,et al.  The Category Concept: An Extension to the Entity-Relationship Model , 1985, Data Knowl. Eng..

[16]  Ronaldo dos Santos Mello,et al.  Improving query performance on XML documents: a workload-driven design approach , 2008, DocEng '08.

[17]  Marcelo Arenas,et al.  A normal form for XML documents , 2002, PODS '02.

[18]  Thomas Kudrass,et al.  Rule-Based Generation of XML Schemas from UML Class Diagrams , 2003, Berliner XML Tage.

[19]  Andrew Goodchild,et al.  UML and XML Schema , 2002, Australasian Database Conference.

[20]  Carlo Curino,et al.  Schism , 2010, Proc. VLDB Endow..