Grid-based Data Mining in Real-life Business Scenario

This paper presents a Grid-based distributed and parallel data mining system targeting a real-life application scenario typical in the business realm - franchise supermarket basket analysis. Following a layered design of three tiers, this system enables parallel association rule mining on a farm of Grid servers, offers a standard service interface for custom applications, and provides a friendly user portal. The work presented in this paper reveals specific requirements for applying Grid-based data mining in the business realm, which is helpful for the design and implementation of a generic Grid-based data mining system.

[1]  Ian Foster,et al.  The Globus toolkit , 1998 .

[2]  Anthony Rowe,et al.  Discovery net: towards a grid of knowledge discovery , 2002, KDD.

[3]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[4]  Mario Cannataro,et al.  The knowledge grid , 2003, CACM.

[5]  Jason Novotny,et al.  Data mining on NASA's Information Power Grid , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[6]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[7]  Toni Bollinger,et al.  Distributed and Parallel Data Mining on the Grid , 2004, ARCS Workshops.