The TSIMMIS Project: Integration of Heterogeneous Information Sources

The goal of the Tsimmis Project is to develop tools that facilitate the rapid integration of heterogeneous information sources that may include both structured and unstructured data. This paper gives an overview of the project, describing components that extract properties from unstructured objects, that translate information into a common object model, that combine information from several sources, that allow browsing of information, and that manage constraints across heterogeneous sites. Tsimmis is a joint project between Stanford and the IBM Almaden Research Center. 1 Overview A common problem facing many organizations today is that of multiple, disparate information sources and repositories, including databases, object stores, knowledge bases, file systems, digital libraries, information retrieval systems, and electronic mail systems. Decision makers often need information from multiple sources, but are unable to get and fuse the required information in a timely fashion due to the diffculties of accessing the different systems, and due to the fact that the information obtained can be inconsistent and contradictory. Research sponsored by the Wright Laboratory, Aeronautical Systems Center, Air Force Material Command, USAF, under Grant Number F33615-93-1-1339. The US Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation thereon. The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the offcial policies or endorsements, either express or implied, of Wright Laboratory or the US Government. This work was also supported by the Reid and Polly Anderson Faculty Scholar Fund, the Center for Integrated Systems at Stanford University, and by Equipment Grants from Digital Equipment Corporation and IBM Corporation. The goal of the TSIMMIS 1 project is to provide tools for accessing, in an integrated fashion, multiple informati

[1]  C. Batini,et al.  A comparative analysis of methodologies for database schema integration , 1986, CSUR.

[2]  R. Snodgrass Temporal Databases , 1986, Computer.

[3]  Henry F. Korth,et al.  Query Languages for Nested Relational Databases , 1987, NF².

[4]  G. Weiderhold File organization for database design , 1987 .

[5]  Abraham Silberschatz,et al.  Extended algebra and calculus for nested relational databases , 1988, TODS.

[6]  Amar Gupta,et al.  Integration of Information Systems: Bridging Heterogeneous Databases , 1989 .

[7]  Martín Abadi,et al.  Temporal Logic Programming , 1989, J. Symb. Comput..

[8]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[9]  Stephen Fox,et al.  Heterogeneous distributed database systems for production use , 1990, ACM Comput. Surv..

[10]  Amit P. Sheth,et al.  Specifying interdatabase dependencies in a multidatabase environment , 1991, Computer.

[11]  Weimin Du,et al.  The Pegasus heterogeneous multidatabase system , 1991, Computer.

[12]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[13]  Michael Kifer,et al.  Querying object-oriented databases , 1992, SIGMOD '92.

[14]  Paul W. P. J. Grefen Combining Theory and Practice in Integrity Control: A Declarative Approach to the Specification of a Transaction Modification Subsystem , 1993, VLDB.

[15]  Jennifer Widom,et al.  Constraint Management in Loosely Coupled Distributed Databases , 1993 .

[16]  Peter M. Schwarz,et al.  The Rufus System: Information Organization for Semi-Structured Data , 1993, VLDB.

[17]  Jennifer Widom,et al.  Managing Semantic Heterogeneity with Production Rules and Persistent Queues , 1993, VLDB.

[18]  Dale Skeen,et al.  The Information Bus: an architecture for extensible distributed systems , 1994, SOSP '93.

[19]  R. G. G. Cattell,et al.  The Object Database Standard: ODMG-93 , 1993 .

[20]  Larry Rosenbaum Lotus Notes, Release 3 , 1994 .

[21]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.