Stream Mining of Frequent Patterns from Delayed Batches of Uncertain Data

Streams of data can be continuously generated by sensors in various real-life applications such as environment surveillance. Partially due to the inherited limitation of the sensors, data in these streams can be uncertain. To discover useful knowledge in the form of frequent patterns from streams of uncertain data, a few algorithms have been developed. They mostly use the sliding window model for processing and mining data streams. However, for some applications, other stream processing models such as the time-fading model are more appropriate. Moreover, batches of data in the stream may be delayed and not arrived in the intended order. In this paper, we propose mining algorithms that use the time-fading model to mine frequent patterns when these batches in the streams of uncertain data were delayed and arrived out of order.

[1]  Alfredo Cuzzocrea,et al.  Discovering Frequent Patterns from Uncertain Data Streams with Time-Fading and Landmark Models , 2013, Trans. Large Scale Data Knowl. Centered Syst..

[2]  Carson Kai-Sang Leung,et al.  Frequent Pattern Mining from Time-Fading Streams of Uncertain Data , 2011, DaWaK.

[3]  Hongjun Lu,et al.  False Positive or False Negative: Mining Frequent Itemsets from High Speed Transactional Data Streams , 2004, VLDB.

[4]  Longbing Cao,et al.  Mining Frequent Patterns from Human Interactions in Meetings Using Directed Acyclic Graphs , 2013, PAKDD.

[5]  Carson Kai-Sang Leung,et al.  Mining uncertain data , 2011, WIREs Data Mining Knowl. Discov..

[6]  Carson Kai-Sang Leung,et al.  Mining Popular Patterns from Transactional Databases , 2012, DaWaK.

[7]  Philip S. Yu,et al.  Mining Frequent Patterns in Data Streams at Multiple Time Granularities , 2002 .

[8]  Charu C. Aggarwal,et al.  Frequent pattern mining with uncertain data , 2009, KDD.

[9]  Carson Kai-Sang Leung,et al.  Mining probabilistic datasets vertically , 2012, IDEAS '12.

[10]  Nan Jiang,et al.  Research issues in data stream association rule mining , 2006, SGMD.

[11]  Carson Kai-Sang Leung,et al.  A Tree-Based Approach for Frequent Pattern Mining from Uncertain Data , 2008, PAKDD.

[12]  Carson Kai-Sang Leung,et al.  Finding Diverse Friends in Social Networks , 2013, APWeb.

[13]  Mengchi Liu,et al.  A Fast Algorithm for Frequent Itemset Mining Using Patricia* Structures , 2012, DaWaK.

[14]  Dan Zhang,et al.  TidFP: Mining Frequent Patterns in Different Databases with Transaction ID , 2009, DaWaK.

[15]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[16]  Carson Kai-Sang Leung,et al.  PUF-Tree: A Compact Tree Structure for Frequent Pattern Mining of Uncertain Data , 2013, PAKDD.

[17]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[18]  Carson Kai-Sang Leung,et al.  Mining of Frequent Itemsets from Streams of Uncertain Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[19]  Kyuseok Shim,et al.  Web Technologies and Applications , 2014, Lecture Notes in Computer Science.

[20]  Abdelkader Hameurlain,et al.  Transactions on Large-Scale Data- and Knowledge-Centered Systems VIII , 2013, Lecture Notes in Computer Science.

[21]  Toon Calders,et al.  Efficient Pattern Mining of Uncertain Data with Sampling , 2010, PAKDD.

[22]  Alfredo Cuzzocrea,et al.  Vertical Frequent Pattern Mining from Uncertain Data , 2012, KES.

[23]  Gillian Dobbie,et al.  Rare Pattern Mining on Data Streams , 2012, DaWaK.

[24]  Carson Kai-Sang Leung,et al.  Mining Frequent Patterns from Uncertain Data with MapReduce for Big Data Analytics , 2013, DASFAA.

[25]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.