An Efficient Algorithm for Real-Time Frequent Pattern Mining for Real-Time Business Intelligence Analytics

Finding frequent patterns from databases has been the most time consuming process in data mining tasks, like association rule mining. Frequent pattern mining in real-time is of increasing thrust in many business applications such as e-commerce, recommender systems, and supply-chain management and group decision support systems, to name a few. A plethora of efficient algorithms have been proposed till date, among which, vertical mining algorithms have been found to be very effective, usually outperforming the horizontal ones. However, with dense datasets, the performances of these algorithms significantly degrade. Moreover, these algorithms are not suited to respond to the real-time need. In this paper, we describe BDFS(b)-diff-sets, an algorithm to perform real-time frequent pattern mining using diff-sets and limited computing resources. Empirical evaluations show that our algorithm can make a fair estimation of the probable frequent patterns and reaches some of the longest frequent patterns much faster than the existing algorithms.

[1]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[2]  Bart Goethals,et al.  Survey on Frequent Pattern Mining , 2003 .

[3]  Ulrich Güntzer,et al.  Algorithms for association rule mining — a general survey and comparison , 2000, SKDD.

[4]  Salvatore J. Stolfo,et al.  Real time data mining-based intrusion detection , 2001, Proceedings DARPA Information Survivability Conference and Exposition II. DISCEX'01.

[5]  Dimitris N. Chorafas The Real-Time Enterprise , 2004 .

[6]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[7]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD '00.

[8]  Paul S. Bradley,et al.  Mathematical Programming for Data Mining: Formulations and Challenges , 1999, INFORMS J. Comput..

[9]  Bart Goethals,et al.  Memory issues in frequent itemset mining , 2004, SAC '04.

[10]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[11]  Katya Scheinberg,et al.  Efficient SVM Training Using Low-Rank Kernel Representations , 2002, J. Mach. Learn. Res..

[12]  Eric Bibelnieks,et al.  Optimizing Customer Mail Streams at Fingerhut , 2001, Interfaces.

[13]  Robert L. Grossman,et al.  Data Mining for Scientific and Engineering Applications , 2001, Massive Computing.

[14]  Mohammed J. Zaki,et al.  Fast vertical mining using diffsets , 2003, KDD '03.

[15]  Hongjun Lu,et al.  Mining the Customer's Up-To-Moment Preferences for E-commerce Recommendation , 2003, PAKDD.

[16]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[17]  Asim K. Pal,et al.  A high-performance limited-memory admissible and real time search algorithm for networks , 1992 .

[18]  A. K. Pujari,et al.  Data Mining Techniques , 2006 .

[19]  Wen-Yang Lin,et al.  CBW: an efficient algorithm for frequent itemset mining , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[20]  Ambuj Mahanti,et al.  An Efficient Technique for Frequent Pattern Mining in Real-Time Business Applications , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[21]  Gregory Piatetsky-Shapiro,et al.  Advances in Knowledge Discovery and Data Mining , 2004, Lecture Notes in Computer Science.

[22]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[23]  David C. Yen,et al.  Data mining techniques for customer relationship management , 2002 .

[24]  Nils J. Nilsson,et al.  Artificial Intelligence: A New Synthesis , 1997 .

[25]  Henk Sol,et al.  Proceedings of the 54th Hawaii International Conference on System Sciences , 1997, HICSS 2015.

[26]  Devavrat Shah,et al.  Turbo-charging vertical mining of large databases , 2000, SIGMOD 2000.

[27]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[28]  Ramasamy Uthurusamy,et al.  EVOLVING DATA MINING INTO SOLUTIONS FOR INSIGHTS , 2002 .