Frequent pattern mining has been used in many applications of data mining. One of the reasons for the effectiveness of frequent pattern methods is that frequently occurring patterns can capture crucial aspects of the underlying semantics of the data. Thus, a good set of frequent patterns obtained from a data set can serve as a high level representation for the data. Hence, an interesting question is that of quantifying the similarity between sets of patterns. Such a similarity measure allows us to compare different data sets by comparing the sets of patterns mined from the data. In this paper we address this problem of quantifying similarity between two sets of patterns.
[1]
Avinash Achar,et al.
A unified view of the apriori-based algorithms for frequent episode discovery
,
2011,
Knowledge and Information Systems.
[2]
Shivakumar Sastry,et al.
Discovering compressing serial episodes from event sequences
,
2015,
Knowledge and Information Systems.
[3]
Tony Jebara,et al.
Probability Product Kernels
,
2004,
J. Mach. Learn. Res..
[4]
Siwei Lyu,et al.
A Kernel Between Unordered Sets of Data: The Gaussian Mixture Approach
,
2005,
ECML.
[5]
Jiawei Han,et al.
Mining Compressed Frequent-Pattern Sets
,
2005,
VLDB.