User defined aggregates in object-relational systems

User-defined aggregates are essential in many advanced database applications, particularly in expressing data mining functions, but they find little support in current systems including object-relational databases. Three serious limitations of current systems are (i) the inability to introduce new aggregates (e.g., by coding them in procedural language as originally proposed in SQL3), (ii) the inability to return partial results during the computation (e.g. to support online aggregation), and (iii) the inability to use aggregates in recursive queries (e.g. to express bill of materials and optimized graph searches). In this paper, we presents a unified solution to these problems which realizes the SQL3 original proposal for user-defined aggregates (U-DAs), and adds significant improvements in terms of expressive power and ease of use: in fact our SQL-AG system also supports online aggregation, monotonic aggregation, and a high-level aggregate definition language named SADL. We focus on applications of UDAs and SADL.

[1]  Michael Stonebraker,et al.  The Implementation of Postgres , 1990, IEEE Trans. Knowl. Data Eng..

[2]  Christos Faloutsos,et al.  Advanced Database Systems , 1997, Lecture Notes in Computer Science.

[3]  Allen Van Gelder,et al.  Foundations of Aggregation in Deductive Databases , 1993, DOOD.

[4]  Rakesh Agrawal,et al.  SPRINT: A Scalable Parallel Classifier for Data Mining , 1996, VLDB.

[5]  Carlo Zaniolo,et al.  Metaqueries for Data Mining , 1996, Advances in Knowledge Discovery and Data Mining.

[6]  Kenneth A. Ross,et al.  Querying Multiple Features of Groups in Relational Databases , 1996, VLDB.

[7]  Donald D. Chamberlin,et al.  Using the New DB2: IBM's Object-Relational Database System , 1996 .

[8]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[9]  Helen J. Wang,et al.  Online aggregation , 1997, SIGMOD '97.

[10]  Kenneth A. Ross,et al.  Monotonic Aggregation in Deductive Database , 1997, J. Comput. Syst. Sci..

[11]  Carlo Zaniolo,et al.  Temporal aggregation in active database rules , 1997, SIGMOD '97.

[12]  Charles Elkan,et al.  Boosting and Naive Bayesian learning , 1997 .

[13]  Sunita Sarawagi,et al.  Integrating association rule mining with relational database systems: alternatives and implications , 1998, SIGMOD '98.

[14]  Carlo Zaniolo,et al.  Logic-Based User-Defined Aggregates for the Next Generation of Database Systems , 1999, The Logic Programming Paradigm.

[15]  Carlo Zaniolo,et al.  Universal temporal extensions for database languages , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[16]  Surajit Chaudhuri,et al.  Scalable classification over SQL databases , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[17]  Carlo Zaniolo,et al.  User-Defined Aggregates for Datamining , 1999, 1999 ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.