Piazza: data management infrastructure for semantic web applications

The Semantic Web envisions a World Wide Web in which data is described with rich semantics and applications can pose complex queries. To this point, researchers have defined new languages for specifying meanings for concepts and developed techniques for reasoning about them, using RDF as the data model. To flourish, the Semantic Web needs to be able to accommodate the huge amounts of existing data and the applications operating on them. To achieve this, we are faced with two problems. First, most of the world's data is available not in RDF but in XML; XML and the applications consuming it rely not only on the domain structure of the data, but also on its document structure. Hence, to provide interoperability between such sources, we must map between both their domain structures and their document structures. Second, data management practitioners often prefer to exchange data through local point-to-point data translations, rather than mapping to common mediated schemas or ontologies.This paper describes the Piazza system, which addresses these challenges. Piazza offers a language for mediating between data sources on the Semantic Web, which maps both the domain structure and document structure. Piazza also enables interoperation of XML data with RDF data that is accompanied by rich OWL ontologies. Mappings in Piazza are provided at a local scale between small sets of nodes, and our query answering algorithm is able to chain sets mappings together to obtain relevant data from across the Piazza network. We also describe an implemented scenario in Piazza and the lessons we learned from it.

[1]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[2]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[3]  Catriel Beeri,et al.  Ontology-Based Integration of XML Web Resources , 2002, SEMWEB.

[4]  Alon Y. Halevy,et al.  An XML query engine for network-bound data , 2002, The VLDB Journal.

[5]  Serge Abiteboul,et al.  Complexity of answering queries using materialized views , 1998, PODS.

[6]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[7]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[8]  Dan Suciu,et al.  Schema mediation for large-scale semantic data sharing , 2005, The VLDB Journal.

[9]  Mark A. Musen,et al.  PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment , 2000, AAAI/IAAI.

[10]  Michael Rys,et al.  Bringing the Internet to Your Database: Using SQLServer 2000 and XML to Build Loosely-Coupled Systems , 2001, BTW.

[11]  James A. Hendler,et al.  Owl web ontology language 1 , 2002 .

[12]  Tim Berners-Lee,et al.  Agent Technology on the Internet. 3. Integrating Applications on the Semantic Web. , 2002 .

[13]  S. Decker The Semantic Web-on the respective Roles of XML and RDF , 2000 .

[14]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[15]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[16]  Dan Suciu,et al.  The Piazza peer data management project , 2003, SGMD.

[17]  Paul Westerman Data Warehousing: Using the Wal-Mart Model , 2000 .

[18]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[19]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[20]  Alon Y. Halevy,et al.  Combining Horn Rules and Description Logics in CARIN , 1998, Artif. Intell..

[21]  Vipul Kashyap,et al.  Imprecise Answers in Distributed Environments: Estimation of Information Loss for Multi-Ontology Based Query Processing , 2000, Int. J. Cooperative Inf. Syst..

[22]  Maurizio Lenzerini,et al.  Source inconsistency and incompleteness in data integration , 2002, KRDB.

[23]  Tova Milo,et al.  Views in a large-scale XML repository , 2002, The VLDB Journal.

[24]  Amit P. Sheth,et al.  Towards Peer-to-Peer Semantic Web: A Distributed Environment for Sharing Semantic Knowledge on the Web , 2002 .

[25]  Dan Suciu,et al.  SilkRoute: trading between relations and XML , 2000, Comput. Networks.

[26]  James A. Hendler,et al.  The semantic grid: The grid meets the semantic web , 2002 .

[27]  Oren Etzioni,et al.  Crossing the Structure Chasm , 2003, CIDR.

[28]  Deborah L. McGuinness,et al.  The Chimaera Ontology Environment , 2000, AAAI/IAAI.

[29]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[30]  Peter F. Patel-Schneider,et al.  Building the Semantic Web on XML , 2002, SEMWEB.

[31]  Mark A. Musen,et al.  PROMPT: Algorithm and tool for ontology merging and alignment , 2000, AAAI 2000.

[32]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[33]  James A. Hendler,et al.  Web ontology language (OWL) reference version 1 , 2002 .

[34]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[35]  Ian Horrocks,et al.  The Semantic Web: The Roles of XML and RDF , 2000, IEEE Internet Comput..

[36]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.