Database Summarization: The SaintEtiQ System

SaintEtiQ (Saint-Paul et al., 2005) is an on-line linguistic summarization system of tables and/or views. Our approach considers a first normal form relation R(A<sub>1</sub>,...,A<sub>n</sub>) in the relational database model, and constructs a new relation R*( A<sub>1</sub>,...,A<sub>n</sub>), in which tuples z are summaries and attribute values are linguistic labels describing a set of tuples R<sub>z</sub>, sub-table of R. Thus, the SaintEtiQ system identifies statements of the form "Most tuples of R are (a<sub>1</sub><sup>1</sup> or a<sub>1</sub><sup>2</sup>...or a<sub>1</sub><sup>m</sup><sub>1</sub>) and (a<sub>2</sub><sup>1</sup>...or a a<sub>2</sub><sup>m</sup><sub>1</sub>)".

[1]  Noureddine Mouaddib,et al.  General Purpose Database Summarization , 2005, VLDB.

[2]  G. Raschia,et al.  Mining a commercial banking data set: the saintetiq approach , 2002, IEEE International Conference on Systems, Man and Cybernetics.