Querying Heterogeneous XML Sources through a Conceptual Schema

XML is a widespread W3C standard used by several kinds of applications for data representation and exchange over the web. In the context of a system that provides semantic integration of heterogeneous XML sources, the same information at a semantic level may have different representations in XML. However, the syntax of an XML query depends on the structure of the specific XML source. Therefore, in order to obtain the same query result, one must write a specific query for each XML source. To deal with such problem, a much better solution is to state queries against a global conceptual schema and then translate them into an XML query against each specific data source. This paper presents CXPath (Conceptual XPath), a language for querying XML sources at the conceptual level, as well as a translation mechanism that converts a CXPath query to an XPath query against a specific XML source.

[1]  Steven J. DeRose,et al.  XML Path Language (XPath) , 1999 .

[2]  Jennifer Widom,et al.  The Lorel query language for semistructured data , 1997, International Journal on Digital Libraries.

[3]  Silvana Castano,et al.  A Method for the Unification of XML Schemata , 2002, Inf. Softw. Technol..

[4]  Torben Bach Pedersen,et al.  Converting XML DTDs to UML diagrams for conceptual data integration , 2001, Data Knowl. Eng..

[5]  Geert-Jan Houben,et al.  RDF-Based Architecture for Semantic Integration of Heterogeneous Information Sources , 2001, Workshop on Information Integration on the Web.

[6]  Stefano Spaccapietra,et al.  An entity-relationship algebra , 1984, 1984 IEEE First International Conference on Data Engineering.

[7]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[8]  Peter Bernus,et al.  Handbook on Architectures of Information Systems , 1999 .

[9]  Venkataraman Ramesh,et al.  Management of Heterogeneous and Autonomous Database Systems , 1999 .

[10]  David W. Embley,et al.  A Relationally Complete Query Language for an Entity-Relationship Model , 1985, ER.

[11]  Guido Moerkotte,et al.  Querying documents in object databases , 1997, International Journal on Digital Libraries.

[12]  Ronaldo dos Santos Mello,et al.  A Bottom-Up Approach for Integration of XML Sources , 2001, Workshop on Information Integration on the Web.

[13]  Neil Bradley The XML Companion , 1998 .

[14]  Ronaldo dos Santos Mello,et al.  A Rule-Based Conversion of a DTD to a Conceptual Schema , 2001, ER.

[15]  Bodo Rieger,et al.  Semantic Integration of Heterogeneous Information Sources , 2000, EFIS.

[16]  John Mylopoulos,et al.  A Semantic Approach to XML-based Data Integration , 2001, ER.