An Efficient Hybrid Algorithm to Reduce Latency in Ad-Hoc Aggregation

A data warehouse is a collection of data gathered and organized so that it can easily be analyzed, extracted, synthesized and also be used for the purpose of further understanding data. Peer to Peer networks are used for distribution and sharing of documents. In traditional techniques, when aggregate functions like average, sum and count are encountered, the aggregate operation is performed by considering all the nodes and tuples, which reduces the efficiency of the query processing system. Exact solutions can be time consuming and difficult to implement, given the distributed and dynamic nature of P2P networks. The problem is overcome in this project by selecting random peers and random tuples from P2P networks and then performing the aggregation operation, thus the speed is increased and latency is reduced. Though accuracy is compromised to small extent, efficiency is achieved. Thus, this kind of approximate query processing will be beneficial to the areas where efficiency plays a main role than accuracy. Adaptive Hybrid approach based on random walk is used to achieve the efficiency in the performance of aggregation operation.

[1]  Deborah Estrin,et al.  Computing aggregates for monitoring wireless sensor networks , 2003, Proceedings of the First IEEE International Workshop on Sensor Network Protocols and Applications, 2003..

[2]  Divyakant Agrawal,et al.  Approximate Range Selection Queries in Peer-to-Peer Systems , 2003, CIDR.

[3]  Dimitrios Gunopulos,et al.  Efficient Approximate Query Processing in Peer-to-Peer Networks , 2007, IEEE Transactions on Knowledge and Data Engineering.

[4]  Surajit Chaudhuri,et al.  Dynamic sample selection for approximate query processing , 2003, SIGMOD '03.

[5]  Dan Suciu,et al.  What Can Peer-to-Peer Do for Databases, and Vice Versa? , 2001 .

[6]  Jared Saia,et al.  Choosing a random peer , 2004, PODC '04.

[7]  Johannes Gehrke,et al.  Gossip-based computation of aggregate information , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..