Mediators over taxonomy-based information sources

Abstract.We propose a mediator model for providing integrated and unified access to multiple taxonomy-based sources. Each source comprises a taxonomy and a database that indexes objects under the terms of the taxonomy. A mediator comprises a taxonomy and a set of relations between the mediator’s and the sources’ terms, called articulations. By combining different modes of query evaluation at the sources and the mediator and different types of query translation, a flexible, efficient scheme of mediator operation is obtained that can accommodate various application needs and levels of answer quality. We adopt a simple conceptual modeling approach (taxonomies and intertaxonomy mappings) and we illustrate its advantages in terms of ease of use, uniformity, scalability, and efficiency. These characteristics make this proposal appropriate for a large-scale network of sources and mediators.

[1]  Marie-Christine RoussetL The use of carin language and algorithmsfor Information Integration : the PICSELprojectV , 1998 .

[2]  Kevin Chen-Chuan Chang,et al.  Mind your vocabulary: query mapping across heterogeneous information sources , 1999, SIGMOD '99.

[3]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[4]  Dieter Fensel,et al.  Practical Knowledge Representation for the Web , 1999, Intelligent Information Integration.

[5]  Irini Fundulaki,et al.  Integrating Ontologies and Thesauri to Build RDF Schemas , 1999, ECDL.

[6]  Carlo Meghini,et al.  Ostensive Automatic Schema Mapping for Taxonomy-Based Peer-to-Peer Systems , 2003, CIA.

[7]  Dieter Fensel,et al.  Ontobroker: Ontology Based Access to Distributed and Semi-Structured Information , 1999, DS-8.

[8]  Pedro M. Domingos,et al.  Learning to map between ontologies on the semantic web , 2002, WWW '02.

[9]  Umberto Straccia,et al.  A relevance terminological logic for information retrieval , 1996, SIGIR '96.

[10]  Luis Gravano,et al.  Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies , 1995, VLDB.

[11]  Ryutaro Ichise,et al.  Rule Induction for Concept Hierarchy Alignment , 2001, Workshop on Ontology Learning.

[12]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[13]  Nicolas Spyratos,et al.  Taxonomy-Based Conceptual Modeling for Peer-to-Peer Networks , 2003, ER.

[14]  敏嗣 弓場,et al.  20世紀の名著名論:E. F. Codd : A Relational Model of Data for Large Shared Data Banks , 2003 .

[15]  Christoph Baumgarten,et al.  Probabilistic information retrieval in a distributed heterogeneous environment , 1998 .

[16]  Esko Nuutila,et al.  Efficient transitive closure computation in large digraphs , 1995 .

[17]  Yannis Tzitzikas Democratic data fusion for information retrieval mediators , 2001, Proceedings ACS/IEEE International Conference on Computer Systems and Applications.

[18]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[19]  Ellen M. Voorhees,et al.  The Collection Fusion Problem , 1994, TREC.

[20]  Diego Calvanese,et al.  Description Logic Framework for Information Integration , 1998, KR.

[21]  Zygmunt Mazur Models of a Distributed Information Retrieval System Based on Thesauri with Weights , 1994, Inf. Process. Manag..

[22]  W. Bruce Croft Knowledge-based and statistical approaches to text retrieval , 1993, IEEE Expert.

[23]  Nicolas Spyratos,et al.  Query Evaluation for Mediators over Web Catalogs , 2002 .

[24]  R. Prieto-Diaz,et al.  Implementing faceted classification for software reuse , 1990, [1990] Proceedings. 12th International Conference on Software Engineering.

[25]  Divesh Srivastava,et al.  Data model and query evaluation in global information systems , 1995, Journal of Intelligent Information Systems.

[26]  Nicolas Spyratos The partition model: a deductive database model , 1987, TODS.

[27]  Michael R. Genesereth,et al.  Answering recursive queries using views , 1997, PODS '97.

[28]  Shiyali Ramamrita Ranganathan,et al.  The colon classification , 1965 .

[29]  George A. Miller,et al.  WordNet: A Lexical Database for the English Language , 2002 .

[30]  A. A.,et al.  Colon Classification , 1934, Nature.

[31]  Nicolas Spyratos,et al.  Mediators over ontology-based information sources , 2001, Proceedings of the Second International Conference on Web Information Systems Engineering.

[32]  Oren Etzioni,et al.  Multi-Engine Search and Comparison Using the MetaCrawler , 1995, World Wide Web J..

[33]  Vipul Kashyap,et al.  OBSERVER: An Approach for Query Processing in Global Information Systems Based on Interoperation Across Pre-Existing Ontologies , 2000, Distributed and Parallel Databases.

[34]  Prasenjit Mitra,et al.  Semi-automatic Integration of Knowledge Sources , 1999 .

[35]  Vipul Kashyap,et al.  Semantic and schematic similarities between database objects: a context-based approach , 1996, The VLDB Journal.

[36]  Michael R. Genesereth,et al.  Query planning in infomaster , 1997, SAC '97.

[37]  Dieter Fensel,et al.  Community is knowledge! in (KA)2 , 1998 .

[38]  Oren Etzioni,et al.  Multi-Service Search and Comparison Using the MetaCrawler , 1995 .

[39]  Vipul Kashyap,et al.  OB-SERVER: An approach for query processing in global infor-mation systems based on interoperation a , 1999 .

[40]  David Hawking,et al.  Merging Results From Isolated Search Engines , 1999, Australasian Database Conference.

[41]  Jeffrey D. Ullman,et al.  Computing capabilities of mediators , 1999, SIGMOD '99.

[42]  Philip M. Turner,et al.  Automatic linking of thesauri , 1996, SIGIR '96.

[43]  Jennifer Widom,et al.  Database System Implementation , 2000 .

[44]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[45]  Norbert Fuhr,et al.  A decision-theoretic approach to database selection in networked IR , 1999, TOIS.

[46]  Georg Groh,et al.  Facilitating the Exchange of Explicit Knowledge through Ontology Mappings , 2001, FLAIRS.

[47]  Magnus Boman,et al.  Conceptual modelling , 1997 .

[48]  Panos Constantopoulos,et al.  A method for monolingual thesauri merging , 1997, SIGIR '97.

[49]  Norbert Fuhr,et al.  Retrieval of complex objects using a four-valued logic , 1996, SIGIR '96.

[50]  Adele E. Howe,et al.  SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query , 1997, AI Mag..

[51]  Yizhong Fan,et al.  Adaptive Agents for Information Gathering from Multiple, Distributed Information Sources , 1999 .

[52]  Craig A. Knoblock,et al.  Cooperating Agents for Information Retrieval , 1994, CoopIS.

[53]  Chris D. Paice,et al.  A thesaural model of information retrieval , 1991, Inf. Process. Manag..

[54]  Michael R. Genesereth,et al.  Infomaster: an information integration system , 1997, SIGMOD '97.

[55]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[56]  Vipul Kashyap,et al.  Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies , 1996, Proceedings First IFCIS International Conference on Cooperative Information Systems.

[57]  Nicolas Spyratos,et al.  An Algebraic Approach for Specifying Compound Terms in Faceted Taxonomies , 2003, EJC.

[58]  Craig A. Knoblock,et al.  Ariadne: a system for constructing mediators for Internet sources , 1998, SIGMOD '98.

[59]  George Boolos,et al.  Logic, Logic, and Logic , 2000 .

[60]  James A. Hendler,et al.  Ontology-based Web agents , 1997, AGENTS '97.

[61]  Nicola Guarino,et al.  OntoSeek: content-based access to the Web , 1999, IEEE Intell. Syst..

[62]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[63]  Sophie Cluet,et al.  Your mediators need data conversion! , 1998, SIGMOD '98.

[64]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[65]  W. Bruce Croft,et al.  Searching distributed collections with inference networks , 1995, SIGIR '95.

[66]  Nicola Guarino,et al.  Some Ontological Principles for Designing Upper Level Lexical Resources , 1998, LREC.

[67]  Antony Galton Logic - for information technology , 1990 .

[68]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[69]  Vipul Kashyap,et al.  Semantic heterogeneity in global information systems: The role of metadata , 1996 .

[70]  James A. Hendler,et al.  Massively parallel support for case-based planning , 1994, IEEE Expert.

[71]  Martin Doerr,et al.  Repositories for Software Reuse: The Software Information Base , 1993, Information System Development Process.

[72]  Giovanni Maria Sacco,et al.  Dynamic Taxonomies: A Model for Large Information Bases , 2000, IEEE Trans. Knowl. Data Eng..

[73]  Christine Froidevaux,et al.  Repairing Queries in a Mediator Approach , 2000, ECAI.

[74]  Ronald Fagin,et al.  Combining Fuzzy Information from Multiple Systems , 1999, J. Comput. Syst. Sci..

[75]  Ellen M. Voorhees,et al.  Multiple search engines in database merging , 1997, DL '97.

[76]  Kevin Chen-Chuan Chang,et al.  Approximate query mapping: Accounting for translation closeness , 2001, The VLDB Journal.

[77]  Diego Calvanese,et al.  A Framework for Ontology Integration , 2001, The Emerging Semantic Web.

[78]  Carlo Meghini,et al.  Query Evaluation in Peer-to-Peer Networks of Taxonomy-Based Sources , 2003, CoopIS/DOA/ODBASE.