Conceptual xml for systems analysis

Because XML has become a new standard for data representation, there is a need for a simple conceptual model that works well with XML-based development. In this research we present a conceptual model for XML, called C-XML, which meets this new need of systems analysts who store their data using XML. We describe our implementation of an automatic conversion from XML Schema to C-XML that preserves information and constraints. With this conversion, we can view an XML Schema instance graphically at a higher level of abstraction. We also describe our implementation of an automatic conversion from C-XML to XML Schema. Our conversion preserves information and constraints as long as we count the special C-XML comments that we insert in an XML-Schema instance to capture the constraints in C-XML that are not representable in XML Schema. In connection with defining C-XML and implementing conversions between C-XML and XML Schema, we are also able to make several insightful observations. We point out ways in which C-XML is more expressive than XML Schema, and we make recommendations for extending XML Schema. We also point out ways in which XML Schema is more expressive than conceptual models, and we make recommendations for augmenting traditional conceptual models to better accommodate XML. The work accomplished in connection with this research establishes the basis for several fundamental activities in system analysis, design, development, and evolution.

[1]  Martin Necaský Conceptual Modeling for XML: A Survey , 2006, DATESO.

[2]  Umeshwar Dayal,et al.  View Definition and Generalization for Database Integration in a Multidatabase System , 1984, IEEE Transactions on Software Engineering.

[3]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[4]  T. J. Teorey,et al.  A logical design methodology for relational databases using the extended entity-relationship model , 1986, CSUR.

[5]  David W. Embley,et al.  Developing XML Documents with Guaranteed "Good" Properties , 2001, ER.

[6]  Erhard Rahm,et al.  Matching large XML schemas , 2004, SGMD.

[7]  Michael J. Carey Enterprise Information Integration ? XML to the Rescue! , 2003, ER.

[8]  Pedro M. Domingos,et al.  Learning to match ontologies on the Semantic Web , 2003, The VLDB Journal.

[9]  David W. Embley Object database development - concepts and principles , 1997 .

[10]  Terry A. Halpin,et al.  Object Role Modelling and XML-Schema , 2000, ER.

[11]  Dan Suciu,et al.  Efficient evaluation of XML middle-ware queries , 2001, SIGMOD '01.

[12]  Deborah L. McGuinness,et al.  The Chimaera Ontology Environment , 2000, AAAI/IAAI.

[13]  Sriram Mohan,et al.  Formal and conceptual models for XML structures the past present and future , 2003 .

[14]  Ronaldo dos Santos Mello,et al.  Querying Heterogeneous XML Sources through a Conceptual Schema , 2003, ER.

[15]  Eric M. Dashofy Issues in generating data bindings for an xml schema-based language , 2001 .

[16]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[17]  Ronaldo dos Santos Mello,et al.  A Rule-Based Conversion of a DTD to a Conceptual Schema , 2001, ER.

[18]  Kristin Tufte,et al.  Merge as a Lattice-Join of XML Documents , 2002 .

[19]  David W. Embley,et al.  Source discovery and schema mapping for data integration , 2003 .

[20]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[21]  Eugene Wong,et al.  Multibase: integrating heterogeneous distributed database systems , 1981, AFIPS '81.

[22]  Laura M. Haas,et al.  Data-driven understanding and refinement of schema mappings , 2001, SIGMOD '01.

[23]  David Carlson,et al.  Modeling XML Applications with UML: Practical e-Business Applications , 2001 .

[24]  Chaitanya K. Baru,et al.  XML-based information mediation with MIX , 1999, SIGMOD '99.

[25]  David W. Shipman The functional data model and the data language DAPLEX , 1979, SIGMOD '79.

[26]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[27]  Maurizio Lenzerini,et al.  A Methodology for Data Schema Integration in the Entity Relationship Model , 1984, IEEE Transactions on Software Engineering.

[28]  Ioana Manolescu,et al.  Answering XML Queries on Heterogeneous Data Sources , 2001, VLDB.

[29]  Laura M. Haas,et al.  Schema Mapping as Query Discovery , 2000, VLDB.

[30]  Ramez Elmasri,et al.  Object integration in logical database design , 1984, 1984 IEEE First International Conference on Data Engineering.

[31]  David W. Embley,et al.  Atribute Match Discovery in Information Integration: Exploiting Multiple Facets of Metadata , 2002, J. Braz. Comput. Soc..

[32]  Andrew Goodchild,et al.  UML and XML Schema , 2002, Australasian Database Conference.

[33]  M. Lee,et al.  ORA-SS: An Object-Relationship-Attribute Model for Semi-structured Data , 2000 .

[34]  Joseph Albert,et al.  Data integration in the RODIN multidatabase system , 1996, Proceedings First IFCIS International Conference on Cooperative Information Systems.

[35]  David W. Embley,et al.  A composite approach to automating direct and indirect schema mappings , 2006, Inf. Syst..

[36]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[37]  Ivar Jacobson,et al.  The Unified Modeling Language User Guide , 1998, J. Database Manag..

[38]  Joachim Biskup,et al.  Extracting information from heterogeneous information sources using ontologically specified target views , 2003, Inf. Syst..

[39]  Erik Wilde,et al.  Towards Conceptual Modeling for XML , 2005, Berliner XML Tage.

[40]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[41]  Berthold Daum Modeling business objects with XML schema , 2002 .

[42]  Vassilis Christophides,et al.  On wrapping query languages and efficient XML integration , 2000, SIGMOD '00.

[43]  Mong-Li Lee,et al.  XClust: clustering XML schemas for effective integration , 2002, CIKM '02.

[44]  Rainer Eckstein,et al.  XML Conceptual Modeling Using UML , 2000, ER.

[45]  Elisa Quintarelli,et al.  An algorithm for generating XML Schemas from ER Schemas , 2005, SEBD.

[46]  Tok Wang Ling,et al.  Designing Valid XML Views , 2002, ER.

[47]  David W. Embley,et al.  An Active, Object-Oriented, Model-Equivalent Programming Language , 1995, Advances in Object-Oriented Data Modeling.

[48]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[49]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[50]  Tok Wang Ling,et al.  Resolving Structural Conflicts in the Integration of XML Schemas: A Semantic Approach , 2003, ER.

[51]  Ramez Elmasri,et al.  Conceptual Modeling for Customized XML Schemas , 2002, ER.

[52]  Alin Deutsch,et al.  Reformulation of XML Queries and Constraints , 2003, ICDT.

[53]  Jeffrey D. Ullman,et al.  Integrating information by outerjoins and full disjunctions (extended abstract) , 1996, PODS.

[54]  Laura M. Haas,et al.  The Clio project: managing heterogeneity , 2001, SGMD.

[55]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[56]  Tok Wang Ling,et al.  Automatic Generation of SQLX View Definitions from ORA-SS Views , 2004, DASFAA.

[57]  David W. Embley,et al.  Enterprise Modeling with Conceptual XML , 2004, ER.

[58]  Silvana Castano,et al.  Ontology-based Integration of Heterogeneous XML Datasources , 2002, SEBD.

[59]  Mengchi Liu,et al.  Merging of XML Documents , 2004, ER.

[60]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[61]  Tok Wang Ling,et al.  Automatic Generation of XQuery View Definitions from ORA-SS Views , 2003, ER.

[62]  Diane C. P. Smith,et al.  Database abstractions: aggregation and generalization , 1977, TODS.

[63]  David W. Embley,et al.  Object-oriented systems analysis - a model-driven approach , 1991, Yourdon Press Computing series.

[64]  Erik Wilde,et al.  The Case for Conceptual Modeling for XML , 2006 .

[65]  Ramez Elmasri,et al.  Conceptual modeling for customized XML schemas , 2002, Data Knowl. Eng..

[66]  Alin Deutsch,et al.  MARS: A System for Publishing XML from Mixed and Redundant Storage , 2003, VLDB.

[67]  Werner Nutt,et al.  Basic Description Logics , 2003, Description Logic Handbook.

[68]  Anand Rajaraman,et al.  Integrating Information by Outerjoins and Full Disjunctions , 1996, PODS 1996.

[69]  Dongwon Lee,et al.  Semantic Data Modeling Using XML Schemas , 2001, ER.

[70]  Michael J. Carey,et al.  XPERANTO: Middleware for Publishing Object-Relational Data as XML Documents , 2000, VLDB.

[71]  Pedro M. Domingos,et al.  Learning Source Description for Data Integration , 2000, WebDB.

[72]  Kishik Park,et al.  A design and implementation of XML-based Mediation Framework (XMF) for integration of Internet information resources , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[73]  Alon Y. Halevy,et al.  Efficient query reformulation in peer data management systems , 2004, SIGMOD '04.

[74]  David W. Embley,et al.  Cardinality Constraints in Semantic Data Models , 1993, Data Knowl. Eng..

[75]  Shamkant B. Navathe,et al.  A Methodology for View Inegration in Logical Database Design , 1982, VLDB.