When conceptual model meets grammar: A dual approach to XML data modeling

In this paper we introduce a novel approach to conceptual modeling for XML schemas. Compared to other approaches, it allows for modeling of a whole family of XML schemas related to a particular application domain. It is integrated in a well-established way of software-engineering, namely Model-Driven Development (MDD). It allows software-engineers to naturally model their application domain using a conceptual schema at the platform-independent level of the MDD hierarchy. From there they can design the desired XML schemas in a form of conceptual schemas at the platform-specific level of MDD hierarchy. Schemas at the platform-specific level are then automatically translated to particular XML schemas. Beside this forward-engineering direction, reverse-engineering direction integrating existing XML schemas into the MDD hierarchy is supported as well. We provide several theoretical results which ensure correctness of the introduced approach. We exploit regular tree grammars to formalize XML schemas. We formalize the bindings between the schemas at the two MDD levels and between schemas at the platform-specific level and XML schemas. We prove that conceptual schemas specify the target XML schemas unambiguously. We also prove the expressive power of the conceptual schemas. And, finally, we prove correctness of the introduced translation algorithms between platform-specific and XML schema levels.

[1]  J. Euzenat,et al.  Ontology Matching , 2007, Springer Berlin Heidelberg.

[2]  David W. Embley,et al.  Extracting a largest redundancy-free XML storage structure from an acyclic hypergraph in polynomial time , 2010, Inf. Syst..

[3]  Robert Steele,et al.  An overview of research on reverse engineering XML schemas into UML diagrams , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[4]  Torben Bach Pedersen,et al.  Converting XML DTDs to UML diagrams for conceptual data integration , 2001, Data Knowl. Eng..

[5]  Zongmin Ma,et al.  Fuzzy XML data modeling with the UML and relational data models , 2007, Data Knowl. Eng..

[6]  Sven Hartmann,et al.  Expressive, yet tractable XML keys , 2009, EDBT '09.

[7]  Dongwon Lee,et al.  Semantic Data Modeling Using XML Schemas , 2001, ER.

[8]  G. Kappel,et al.  Representing XML Schema in UML-An UML Profile for XML Schema , 2004 .

[9]  Gerti Kappel,et al.  Representing XML Schema in UML - A Comparison of Approaches , 2004, ICWE.

[10]  Gunter Saake,et al.  Improving XML schema matching performance using Prüfer sequences , 2009, Data Knowl. Eng..

[11]  Silvana Castano,et al.  Proceedings of the 15th International Conference on Conceptual Modeling , 2000 .

[12]  Krish Narayanan,et al.  Specifications for Mapping UML Models to XML Schemas , 2005 .

[13]  Carlo Curino,et al.  PRIMA: archiving and querying historical data with evolving schemas , 2009, SIGMOD Conference.

[14]  Rainer Eckstein,et al.  XML Conceptual Modeling Using UML , 2000, ER.

[15]  William C. Chu,et al.  Proceedings of the 2011 ACM Symposium on Applied Computing (SAC), TaiChung, Taiwan, March 21 - 24, 2011 , 2011, SAC.

[16]  Sourav S. Bhowmick,et al.  An XML Schema integration and query mechanism system , 2008, Data Knowl. Eng..

[17]  Ilkka Melleri Using Object Role Modeling in a Service-Oriented Data Integration Project , 2010, OTM Workshops.

[18]  C. M. Sperberg-McQueen,et al.  W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures , 2012 .

[19]  Schahram Dustdar,et al.  End-to-End Versioning Support for Web Services , 2008, 2008 IEEE International Conference on Services Computing.

[20]  Erhard Rahm,et al.  Comparison of Schema Matching Evaluations , 2002, Web, Web-Services, and Database Systems.

[21]  Antinisca Di Marco,et al.  Interoperability mapping from XML schemas to ER diagrams , 2006, Data Knowl. Eng..

[22]  Martin Necaský Conceptual Modeling for XML: A Survey , 2006, DATESO.

[23]  Sonia Bergamaschi,et al.  Schema Normalization for Improving Schema Matching , 2009, ER.

[24]  Irena Holubová,et al.  A Framework for Efficient Design, Maintaining, and Evolution of a System of XML Applications , 2010, DATESO.

[25]  Joseph Fong,et al.  The XML Tree Model - toward an XML conceptual schema reversed from XML Schema Definition , 2008, Data Knowl. Eng..

[26]  Giuseppe Psaila,et al.  ERX: a conceptual model for XML documents , 2000, SAC '00.

[27]  Giovanna Guerrini,et al.  Updating XML schemas and associated documents through exup , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[28]  Thomas Kudrass,et al.  Rule-Based Generation of XML Schemas from UML Class Diagrams , 2003, Berliner XML Tage.

[29]  Alex Thomo,et al.  Evolving schemas for streaming XML , 2010, Theor. Comput. Sci..

[30]  Wai Yin Mok,et al.  Design Non-recursive and Redundant-Free XML Conceptual Schema with Hypergraph (Extended Abstract) , 2011, DASFAA Workshops.

[31]  Carlo Batini,et al.  A Data Quality Methodology for Heterogeneous Data , 2011 .

[32]  Andrew Goodchild,et al.  UML and XML Schema , 2002, Australasian Database Conference.

[33]  Eladio Domínguez,et al.  Evolution of XML schemas and documents from stereotyped UML class models: A traceable approach , 2011, Inf. Softw. Technol..

[34]  Wenfei Fan,et al.  Reasoning about Keys for XML , 2001, DBPL.

[35]  Richard Chbeir,et al.  Extensible User-Based XML Grammar Matching , 2009, ER.

[36]  Eladio Domínguez,et al.  A Survey of UML Models to XML Schemas Transformations , 2007, WISE.

[37]  Martín Abadi,et al.  Security analysis of cryptographically controlled access to XML documents , 2005, PODS '05.

[38]  Meike Klettke Conceptual XML Schema Evolution - the CoDEX Approach for Design and Redesign , 2007, BTW Workshops.

[39]  Heiner Stuckenschmidt,et al.  Ontology Alignment Evaluation Initiative: Six Years of Experience , 2011, J. Data Semant..

[40]  K. C. Morris A framework for XML schema naming and design rules development tools , 2010, Comput. Stand. Interfaces.

[41]  Angelo Montanari,et al.  From Entity Relationship to XML Schema: a Graph-Theoretic Approach (Extended Abstract) , 2009, SEBD.

[42]  Shi Baile,et al.  Reverse Engineering XML , 2006, First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06).

[43]  Wenfei Fan,et al.  On the Complexity of Verifying Consistency of XML Specifications , 2008, SIAM J. Comput..

[44]  Zongmin Ma,et al.  Knowledge representation and reasoning of XML with ontology , 2011, SAC '11.

[45]  Georges Gardarin,et al.  Deriving Ontologies from XML Schema , 2010, ArXiv.

[46]  Martin Necaský,et al.  Conceptual Modeling of IS-A Hierarchies for XML , 2008, EJC.

[47]  Filip Murlak,et al.  XML schema mappings , 2009, PODS.

[48]  Young-Koo Lee,et al.  DTD2OWL: automatic transforming XML documents into OWL ontology , 2009, ICIS '09.

[49]  Georg Gottlob,et al.  Distributed XML Design , 2011, J. Comput. Syst. Sci..

[50]  Ramez Elmasri,et al.  Conceptual modeling for customized XML schemas , 2002, Data Knowl. Eng..

[51]  Joaquin Miller,et al.  MDA Guide Version 1.0.1 , 2003 .

[52]  Murali Mani,et al.  EReX: A Conceptual Model for XML , 2004, XSym.

[53]  Martin Necaský Conceptual modeling for XML , 2009, DISDBIS.

[54]  Antonio Badia,et al.  Conceptual modeling for semistructured data , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering (Workshops), 2002..

[55]  Reynold Cheng,et al.  Managing uncertainty of XML schema matching , 2010, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010).

[56]  B. Lóscio,et al.  Conceptual modeling of XML schemas , 2003, WIDM '03.

[57]  Husheng Liao,et al.  A survey to conceptual modeling for XML , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[58]  Wolfgang Klas,et al.  A survey of techniques for achieving metadata interoperability , 2010, CSUR.

[59]  Richi Nayak,et al.  Element similarity measures in XML schema matching , 2010, Inf. Sci..

[60]  Hisham M. Haddad Proceedings of the 2006 ACM symposium on Applied computing , 2006, SAC.

[61]  Murali Mani,et al.  Taxonomy of XML schema languages using formal language theory , 2005, TOIT.

[62]  Erhard Rahm,et al.  Revised Papers from the NODe 2002 Web and Database-Related Workshops on Web, Web-Services, and Database Systems , 2002 .

[63]  Irena Holubová,et al.  When Conceptual Model Meets Grammar: A Formal Approach to Semi-structured Data Modeling , 2010, WISE.

[64]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[65]  Tok Wang Ling,et al.  Normal Form ORA-SS Schema Diagrams , 2009, Encyclopedia of Database Systems.

[66]  Elke A. Rundensteiner,et al.  XEM: managing the evolution of XML documents , 2001, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001.

[67]  Narasimha Bolloju,et al.  Assisting novice analysts in developing quality conceptual models with UML , 2006, CACM.

[68]  Willem Jonker,et al.  Formalizing the XML Schema Matching Problem as a Constraint Optimization Problem , 2005, DEXA.

[69]  Carlo Curino,et al.  Managing and querying transaction-time databases under schema evolution , 2008, Proc. VLDB Endow..

[70]  Sven Hartmann,et al.  Solving the Implication Problem for XML Functional Dependencies with Properties , 2010, WoLLIC.

[71]  Tharam S. Dillon,et al.  Modeling Dynamic Properties in the Layered View Model for XML Using XSemantic Nets , 2006, APWeb Workshops.

[72]  Ling Liu,et al.  Encyclopedia of Database Systems , 2009, Encyclopedia of Database Systems.

[73]  Laurian M. Chirica,et al.  The entity-relationship model: toward a unified view of data , 1975, SIGF.

[74]  Ronaldo dos Santos Mello,et al.  Designing XML documents from conceptual schemas and workload information , 2009, Multimedia Tools and Applications.

[75]  Martin Necaský XSEM - A Conceptual Model for XML , 2007, APCCM.

[76]  Martin Necaský Reverse Engineering of XML Schemas to Conceptual Diagrams , 2009, APCCM.

[77]  Jacques Fayolle,et al.  XCDL: an XML-oriented visual composition definition language , 2010, iiWAS.

[78]  Jakub Malý,et al.  XML Document Versioning and Revalidation , 2011, DATESO.

[79]  Zachary G. Ives,et al.  Bidirectional Mappings for Data and Update Exchange , 2008, WebDB.

[80]  Irena Holubová,et al.  Five-Level Multi-Application Schema Evolution , 2009, DATESO.

[81]  Terry A. Halpin,et al.  Information Modelling and Relational Databases , 2001 .

[82]  Carlo Combi,et al.  Conceptual modeling of XML data , 2006, SAC '06.

[83]  Martin Necaský,et al.  Semi-automatic Integration of Web Service Interfaces , 2010, 2010 IEEE International Conference on Web Services.

[84]  Sven Hartmann,et al.  Numerical constraints on XML data , 2010, Inf. Comput..

[85]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[86]  David W. Embley,et al.  Augmenting Traditional Conceptual Models to Accommodate XML Structural Constructs , 2007, ER.