Query reformulation for dynamic information integration

The standard approach to integrating heterogeneous information sources is to build a global schema that relates all of the information in the different sources, and to pose queries directly against it. The problem is that schema integration is usually difficult, and as soon as any of the information sources change or a new source is added, the process may have to be repeated.The SIMS system uses an alternative approach. A domain model of the application domain is created, establishing a fixed vocabulary for describing data sets in the domain. Using this language, each available information source is described. Queries to SIMS against the collection of available information sources are posed using terms from the domain model, and reformulation operators are employed to dynamically select an appropriate set of information sources and to determine how to integrate the available information to satisfy a query. This approach results in a system that is more flexible than existing ones, more easily scalable, and able to respond dynamically to newly available or unexpectedly missing information sources.This paper describes the query reformulation process in SIMS and the operators used in it. We provide precise definitions of the reformulation operators and explain the rationale behind choosing the specific ones SIMS uses. We have demonstrated the feasibility and effectiveness of this approach by applying SIMS in the domains of transportation planning and medical trauma care.

[1]  Craig A. Knoblock,et al.  Rule Induction for Semantic Query Optimization , 1994, KDD Workshop.

[2]  Craig A. Knoblock,et al.  Query Processing in an Information Mediator , 1994 .

[3]  Waiman Cheung The model-assisted global query system , 1992 .

[4]  Ronald J. Brachman,et al.  An overview of the KL-ONE Knowledge Representation System , 1985 .

[5]  John F. Sowa,et al.  Principles of semantic networks , 1991 .

[6]  Craig A. Knoblock Generating Parallel Execution Plans with a Partial-order Planner , 1994, AIPS.

[7]  Peter F. Patel-Schneider,et al.  Living wiht Classic: When and How to Use a KL-ONE-Like Language , 1991, Principles of Semantic Networks.

[8]  Terry A. Landers,et al.  An Overview of MULTIBASE , 1986, DDB.

[9]  Tomasz Ksiezyk,et al.  Integrating enterprise information models in Carnot , 1993, [1993] Proceedings International Conference on Intelligent and Cooperative Information Systems.

[10]  Amar Gupta,et al.  Integration of Information Systems: Bridging Heterogeneous Databases , 1989 .

[11]  Timothy W. Finin,et al.  The Intelligent Database Interface: Integrating AI and Database Systems , 1990, AAAI.

[12]  Stuart E. Madnick,et al.  A Metadata Approach to Resolving Semantic Conflicts , 2011, VLDB.

[13]  Donald D. Chamberlin,et al.  Access Path Selection in a Relational Database Management System , 1989 .

[14]  Ramanathan V. Guha,et al.  Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project , 1990 .

[15]  Ronald J. Brachman,et al.  An Overview of the KL-ONE Knowledge Representation System , 1985, Cogn. Sci..

[16]  Craig A. Knoblock Planning, Executing, Sensing, and Replanning for Information Gathering , 1995, IJCAI.

[17]  Divesh Srivastava,et al.  Data model and query evaluation in global information systems , 1995, Journal of Intelligent Information Systems.

[18]  Mark E. Deppe,et al.  Distributed Data Bases; A Summary of Research , 1976, Comput. Networks.

[19]  Bharat Bhasker Query processing in heterogeneous distributed database management systems , 1992 .

[20]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[21]  Tim Finin,et al.  View-Concepts: Knowledge-Based Access to Databases , 1992, CIKM 1992.

[22]  Robert M. MacGregor,et al.  The Evolving Technology of Classification-Based Knowledge Representation Systems , 1991, Principles of Semantic Networks.

[23]  Alan Bundy,et al.  Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence - IJCAI-95 , 1995 .

[24]  Craig A. Knoblock,et al.  Using Inductive Learning To Generate Rules for Semantic Query Optimization , 1996, Advances in Knowledge Discovery and Data Mining.

[25]  Craig A. Knoblock,et al.  Estimating the Robustness of Discovered Knowledge , 1995, KDD.

[26]  Christine Collet,et al.  Resource integration using a large knowledge base in Carnot , 1991, Computer.

[27]  Craig A. Knoblock,et al.  Reformulating query plans for multidatabase systems , 1993, CIKM '93.

[28]  Louiqa Raschid,et al.  Query Transformation Techniques for Interoperable Query Processing in Cooperative Information Systems , 1994, CoopIS.

[29]  Charles J. Petrie Model Driven Enterprise Information Management in Carnot , 1992 .

[30]  Charles J. Petrie Enterprise integration modeling : proceedings of the first international conference , 1992 .

[31]  Jungyun Seo,et al.  Classifying schematic and data heterogeneity in multidatabase systems , 1991, Computer.

[32]  Robert Mac Gregor,et al.  THE EVOLVING TECHNOLOGY OF CLASSIFICATION-BASED KNOWLEDGE REPRESENTATION SYSTEMS , 1991 .

[33]  Weimin Du,et al.  The Pegasus heterogeneous multidatabase system , 1991, Computer.

[34]  Paliath Narendran,et al.  An algorithm for finding canonical sets of ground rewrite rules in polynomial time , 1993, JACM.