Dynamic load balancing of large-scale distributed association rule mining

The focus of this paper is to propose a dynamic load balancing strategy for parallel association rule mining algorithms in the context of a Grid computing environment. This strategy is built upon a distributed model which necessitates small overheads in the communication costs for load updates and for both data and work transfers. It also supports the heterogeneity of the system and it is fault tolerant.

[1]  María S. Pérez-Hernández,et al.  Design and implementation of a data mining grid-aware architecture , 2007, Future Gener. Comput. Syst..

[2]  R. V. van Nieuwpoort,et al.  The Grid 2: Blueprint for a New Computing Infrastructure , 2003 .

[3]  J. D. Teresco,et al.  New challanges in dynamic load balancing , 2005 .

[4]  Zhiling Lan,et al.  A Survey of Load Balancing in Grid Computing , 2004, CIS.

[5]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[6]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[7]  Rakesh Agrawal,et al.  Parallel Mining of Association Rules , 1996, IEEE Trans. Knowl. Data Eng..

[8]  Walter A. Kosters,et al.  Apriori, A Depth First Implementation , 2003, FIMI.

[9]  Mohammed J. Zaki Parallel and distributed association mining: a survey , 1999, IEEE Concurr..

[10]  Srinivasan Parthasarathy,et al.  New Algorithms for Fast Discovery of Association Rules , 1997, KDD.

[11]  Fabrizio Silvestri,et al.  A Scalable Multi-Strategy Algorithm for Counting Frequent Sets , 2002 .

[12]  Anthony P. Reeves,et al.  Strategies for Dynamic Load Balancing on Highly Parallel Computers , 1993, IEEE Trans. Parallel Distributed Syst..

[13]  Franck Cappello,et al.  Grid'5000: a large scale and highly reconfigurable grid experimental testbed , 2005, The 6th IEEE/ACM International Workshop on Grid Computing, 2005..

[14]  Bernard Toursel,et al.  Distributed Data Mining , 2001, Scalable Comput. Pract. Exp..

[15]  Karen D. Devinea,et al.  New Challenges in Dynamic Load Balancing , 2004 .

[16]  Thomas L. Casavant,et al.  A Taxonomy of Scheduling in General-Purpose Distributed Computing Systems , 1988, IEEE Trans. Software Eng..

[17]  Ke Wang,et al.  Top Down FP-Growth for Association Rule Mining , 2002, PAKDD.