论文信息 - A New Method to Find Top K Items in Data Streams at Arbitrary Time Granularities

A New Method to Find Top K Items in Data Streams at Arbitrary Time Granularities

Finding top K items in data streams means finding K items whose frequence are larger than other items in data streams. There are some methods to find most frequent K items in the whole data streams, but they can't be used in arbitrary time interval. This paper proposes a new method-MMF(K)_MS to find most frequent K items based on Hierarchical Synopsis. MMF(K)_MS supports query in arbitrary time interval through using HFVN framework with variable number of node in every layer and using Count Stretch data structure to maintain Synopsis in each layer. At Last, Proving MMF(K)_MS rational and available by experiment.

Huahui Chen | Pingda Shu

[1] Won Suk Lee,et al. A Sliding Window Method for Finding Recently Frequent Itemsets over Online Data Streams , 2004, J. Inf. Sci. Eng..

[2] Suh-Yin Lee,et al. An Efficient Algorithm for Mining Frequent Itemests over the Entire History of Data Streams , 2004 .

[3] Dong Yi-sheng,et al. Mining Frequent Closed Patterns from a Sliding Window over Data Streams , 2006 .

[4] Moses Charikar,et al. Finding frequent items in data streams , 2004, Theor. Comput. Sci..