How foreign function integration conquers heterogeneous query processing

With the emergence of application systems which encapsulate databases and related application components, pure data integration using, for example, a federated database system is not possible anymore. Instead, access via predefined functions is the only way to get data from an application system. As a result, retrieval of such heterogeneous and encapsulated data sources needs the combination of generic query as well as predefined function access. In this paper, we present a middleware approach supporting such a novel and extended kind of integration. Starting with the overall architecture, we explain the functionality and cooperation of its core components: a federated database system and a workflow management system connected via a wrapper. Afterwards, we concentrate on essential aspects of query processing across these heterogeneous components focusing on the impact of the functions included. We discuss the operations the wrapper should provide in order to extend the workflow system's native functionality. In addition to selection and projection, these operations could include aggregation and the support of subqueries. Moreover, we point out modifications to the traditional cost model needed to consider the cost estimates for the function calls as well.

[1]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.

[2]  Ioana Manolescu,et al.  Query optimization in the presence of limited access patterns , 1999, SIGMOD '99.

[3]  Theo Härder,et al.  The intrinsic problems of structural heterogeneity and an approach to their solution , 1999, The VLDB Journal.

[4]  Kyuseok Shim,et al.  Query Optimization in the Presence of Foreign Functions , 1993, VLDB.

[5]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[6]  Hector Garcia-Molina,et al.  Capability-sensitive query processing on Internet sources , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[7]  Dragos Manolescu,et al.  Production workflow: concepts and techniques , 2001, SOEN.

[8]  Clement T. Yu,et al.  Query Processing in Multidatabase Systems , 1995, Modern Database Systems.

[9]  Frank Leymann,et al.  Production Workflow: Concepts and Techniques , 1999 .

[10]  David W. Embley,et al.  An approach to schema integration and query formulation in federated database systems , 1987, 1987 IEEE Third International Conference on Data Engineering.

[11]  Jennifer Widom,et al.  Object exchange across heterogeneous information sources , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[12]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[13]  Theo Härder,et al.  A middleware approach for combining heterogeneous data sources - integration of generic query and predefined function access , 2000, Proceedings of the First International Conference on Web Information Systems Engineering.