A Running Pattern Recognition Algorithm for Publish/Subscribe Distributed Systems

Publish/subscribe distributed systems are often used in critical applications. It is necessary to monitor their running patterns in real time to detect abnormal status. Therefore, identifying the normal running pattern is the precondition of monitoring publish/subscribe distributed systems. Based on Apriori algorithm, this paper presents a weighted frequent itemset mining algorithm for running pattern recognition of publish/subscribe distributed systems. By introducing the transaction matrix, the algorithm only needs to scan the transaction database once. By weighting the items from two aspects of influence and frequency, the support of the items with few occurrences but much importance can be improved, so that the running pattern containing small frequency events can be mined out. Experimental results show that the algorithm can effectively mine the running patterns, and has better performance than Apriori algorithm and FP-growth algorithm.

[1]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[2]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[3]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules and sequential patterns , 1996 .

[4]  Gerardo Pardo-Castellote,et al.  OMG Data-Distribution Service: architectural overview , 2003, 23rd International Conference on Distributed Computing Systems Workshops, 2003. Proceedings..

[5]  Cong Wang,et al.  A Matrix-Based Apriori Algorithm Improvement , 2018, 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC).

[6]  Yang Wei Discovery of Weighted Association Rules in Databases , 2001 .

[7]  Manziba Akanda Nishi,et al.  An efficient approach to mine flexible periodic patterns in time series databases , 2015, Eng. Appl. Artif. Intell..

[8]  Jian Pei,et al.  Pattern-Growth Methods , 2014, Frequent Pattern Mining.