A rule-based tool for gradual granular data aggregation

In order to keep more detailed data available for longer periods, old data has to be reduced gradually to save space and improve query performance, especially on resource-constrained systems with limited storage and query processing capabilities. In this regard, some hand-coded data aggregation solutions have been developed; however, their actual usage have been limited, for the reason that hand-coded data aggregation solutions have proven themselves too complex to maintain. Maintenance need to occur as requirements change frequently and the existing data aggregation techniques lack flexibility with regards to efficient requirements change management. This paper presents an effective rule-based tool for data reduction based on gradual granular data aggregation. With the proposed solution, data can be maintained at different levels of granularity. The solution is based on high-level data aggregation rules. Based on these rules, data aggregation code can be auto-generated. The solution is effective, easy-to-use and easy-to-maintain. In addition, the paper also demonstrates the use of the proposed tool based on a farming case study using standard database technologies. The results show productivity of the proposed tool-based solution in terms of initial development time, maintenance time and alteration time as compared to a hand-coded solution.

[1]  Anne Laurent,et al.  Multidimensional Data Stream Summarization Using Extended Tilted-Time Windows , 2009, 2009 International Conference on Advanced Information Networking and Applications Workshops.

[2]  Torben Bach Pedersen,et al.  Schema Design Alternatives for Multi-granular Data Warehousing , 2010, DEXA.

[3]  Aliou Boly,et al.  Forgetting data intelligently in data warehouses , 2007, 2007 IEEE International Conference on Research, Innovation and Vision for the Future.

[4]  Torben Bach Pedersen,et al.  An Embedded Database Application for the Aggregation of Farming Device Data , 2010 .

[5]  Christian S. Jensen,et al.  A foundation for vacuuming temporal databases , 2003, Data Knowl. Eng..

[6]  John F. Roddick,et al.  Schema Vacuuming in Temporal Databases , 2009, IEEE Transactions on Knowledge and Data Engineering.

[7]  Torben Bach Pedersen,et al.  Gradual Data Aggregation in Multi-granular Fact Tables on Resource-Constrained Systems , 2010, KES.

[8]  Yixin Chen,et al.  Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams , 2005, Distributed and Parallel Databases.

[9]  Nadeem Iftikhar Integration, aggregation and exchange of farming device data: A high level perspective , 2009, 2009 Second International Conference on the Applications of Digital Information and Web Technologies.

[10]  Alfredo Cuzzocrea,et al.  CAMS: OLAPing Multidimensional Data Streams Efficiently , 2009, DaWaK.

[11]  Torben Bach Pedersen,et al.  Specification-based data reduction in dimensional data warehouses , 2008, Inf. Syst..

[12]  Kjetil Nørvåg Granularity reduction in temporal document databases , 2006, Inf. Syst..

[13]  Dimitrios Gunopulos,et al.  Temporal and spatio-temporal aggregations over data streams using multiple time granularities , 2003, Inf. Syst..