Architecture and Implementation of an XQuery-based Information Integration Platform.

An increasing number of business users and software applications need to process information that is accessible via multiple diverse information systems, such as database systems, file systems, legacy applications or web services. We describe the Enosys XML Integration Platform (EXIP), a commercial XQuery-based data integration software platform that provides a queryable integrated view of such information. We describe the platform architecture and describe what the main principles and challenges are for the query engine. In particular, we discuss the query engine architecture and the underlying semistructured algebra, which is tuned for enabling query plan optimizations.

[1]  Jirí Matousek,et al.  Efficient partition trees , 1991, SCG '91.

[2]  Christos Faloutsos,et al.  The R+-Tree: A Dynamic Index for Multi-Dimensional Objects , 1987, VLDB.

[3]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[4]  Dimitrios Gunopulos,et al.  On indexing mobile objects , 1999, PODS '99.

[5]  Guido Moerkotte,et al.  Evaluating queries with generalized path expressions , 1996, SIGMOD '96.

[6]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[7]  Chin-Wan Chung,et al.  Selectivity estimation for spatio-temporal queries to moving objects , 2002, SIGMOD '02.

[8]  Jonathan Goldstein,et al.  Processing queries by linear constraints , 1997, PODS '97.

[9]  Yannis Papakonstantinou,et al.  Object Fusion in Mediator Systems , 1996, VLDB.

[10]  Jun Yang,et al.  An Optimizer for Heterogeneous Systems with NonStandard Data and Search Capabilities. , 1996 .

[11]  Dan Suciu,et al.  Towards a Scalable Infrastructure for Advanced E-Services. , 2001 .

[12]  Özgür Ulusoy,et al.  A Quadtree-Based Dynamic Attribute Indexing Method , 1998, Comput. J..

[13]  Yannis Papakonstantinou,et al.  The Enosys Markets data integration platform: lessons from the trenches , 2001, CIKM '01.

[14]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[15]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[16]  Henry F. Korth,et al.  Query Languages for Nested Relational Databases , 1987, NF².

[17]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[18]  Christian S. Jensen,et al.  Indexing of moving objects for location-based services , 2002, Proceedings 18th International Conference on Data Engineering.

[19]  Yannis Papakonstantinou,et al.  Expressive Capabilities Description Languages and Query Rewriting Algorithms , 2000, J. Log. Program..

[20]  Guido Moerkotte,et al.  Nested Queries in Object Bases , 1993, DBPL.

[21]  Jeffrey Scott Vitter,et al.  On two-dimensional indexability and optimal range search indexing , 1999, PODS '99.

[22]  Bo Xu,et al.  Moving objects databases: issues and solutions , 1998, Proceedings. Tenth International Conference on Scientific and Statistical Database Management (Cat. No.98TB100243).

[23]  Vassilis Christophides,et al.  On wrapping query languages and efficient XML integration , 2000, SIGMOD 2000.

[24]  Dimitrios Gunopulos,et al.  Nearest Neighbor Queries in a Mobile Environment , 1999, Spatio-Temporal Database Management.

[25]  Yannis Papakonstantinou,et al.  XML query forms (XQForms): declarative specification of XML query interfaces , 2001, WWW '01.

[26]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[27]  Sophie Cluet,et al.  Your mediators need data conversion! , 1998, SIGMOD '98.

[28]  Dan Suciu,et al.  Query containment for conjunctive queries with regular expressions , 1998, PODS.

[29]  A. Prasad Sistla,et al.  Modeling and querying moving objects , 1997, Proceedings 13th International Conference on Data Engineering.

[30]  Eugene J. Shekita,et al.  Querying XML Views of Relational Data , 2001, VLDB.

[31]  Ouri Wolfson,et al.  Cost and imprecision in modeling the position of moving objects , 1998, Proceedings 14th International Conference on Data Engineering.

[32]  Nick Roussopoulos,et al.  K-Nearest Neighbor Search for Moving Query Point , 2001, SSTD.

[33]  Stephen Fox,et al.  Heterogeneous distributed database systems for production use , 1990, ACM Comput. Surv..

[34]  Gio Wiederhold,et al.  Intelligent integration of information , 1993, Springer US.

[35]  Hector Garcia-Molina,et al.  Template-based wrappers in the TSIMMIS system , 1997, SIGMOD '97.

[36]  Laura M. Haas,et al.  Towards heterogeneous multimedia information systems: the Garlic approach , 1995, Proceedings RIDE-DOM'95. Fifth International Workshop on Research Issues in Data Engineering-Distributed Object Management.

[37]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[38]  Yufei Tao,et al.  Time-parameterized queries in spatio-temporal databases , 2002, SIGMOD '02.

[39]  Anand Rajaraman,et al.  Answering queries using templates with binding patterns (extended abstract) , 1995, PODS.

[40]  Akhil Kumar,et al.  A dynamic warehouse for XML Data of the Web. , 2001 .

[41]  Leonidas J. Guibas,et al.  Data Structures for Mobile Data , 1997, J. Algorithms.

[42]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.

[43]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[44]  Abraham Silberschatz,et al.  Extended algebra and calculus for nested relational databases , 1988, TODS.

[45]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[46]  H. V. Jagadish,et al.  On Indexing Line Segments , 1990, VLDB.

[47]  Dimitrios Gunopulos,et al.  ALL-PAIRS NEAREST NEIGHBORS IN A MOBILE ENVIRONMENT , 2000 .

[48]  Yannis Papakonstantinou,et al.  Query rewriting for semistructured data , 1999, SIGMOD '99.

[49]  Bertram Ludäscher,et al.  Navigation-Driven Evaluation of Virtual Mediated Views , 2000, EDBT.

[50]  Alon Y. Halevy,et al.  The nimble integration engine , 2001, SIGMOD '01.

[51]  Dimitrios Gunopulos,et al.  Indexing mobile objects on the plane , 2002, Proceedings. 13th International Workshop on Database and Expert Systems Applications.

[52]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[53]  Divyakant Agrawal,et al.  Query processing for moving objects with space-time grid storage model , 2002, Proceedings Third International Conference on Mobile Data Management MDM 2002.

[54]  Douglas Comer,et al.  Ubiquitous B-Tree , 1979, CSUR.

[55]  Amar Gupta,et al.  Integration of Information Systems: Bridging Heterogeneous Databases , 1989 .

[56]  Hamid Pirahesh,et al.  Extensible query processing in starburst , 1989, SIGMOD '89.

[57]  Hanan Samet,et al.  The Design and Analysis of Spatial Data Structures , 1989 .

[58]  Dan Suciu,et al.  A query language and optimization techniques for unstructured data , 1996, SIGMOD '96.

[59]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[60]  Laura M. Haas,et al.  Capabilities-Based Query Rewriting in Mediator Systems , 2004, Distributed and Parallel Databases.

[61]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[62]  Diego Calvanese,et al.  Rewriting of regular expressions and regular path queries , 1999, PODS '99.

[63]  Christian S. Jensen,et al.  Indexing the Positions of Continuously Moving Objects , 2000, SIGMOD Conference.

[64]  Oliver Günther,et al.  The design of the cell tree: an object-oriented index structure for geometric databases , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[65]  Dan Suciu,et al.  Efficient evaluation of XML middle-ware queries , 2001, SIGMOD '01.

[66]  Anand Rajaraman,et al.  Answering Queries Using Limited External Processors. , 1996, ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems.