An adaptive memory conscious approach for mining frequent trees: implications for multi-core architectures
暂无分享,去创建一个
We consider the problem of frequent tree mining and present algorithms targeting emerging single-chip multiprocessor (CMP) architectures. We explore algorithmic designs that improve the memory performance of such algorithms, both in terms of alleviating latency to memory as well as in terms of reducing the off-chip traffic. We then explore adaptive task-parallel and data-parallel design strategies which facilitate effective parallelization even in the presence of data and workload skew while minimizing parallelization overheads. We show that our optimized algorithms achieve orders of magnitude improvement both in run time and memory usage, when compared to state-of-the-art algorithms. Also, we show that our adaptive parallelization strategy achieves near-linear speedups on a modern dual quad-core system.
[1] Shirish Tatikonda,et al. LCS-TRIM: Dynamic Programming Meets XML Indexing and Querying , 2007, VLDB.
[2] Shirish Tatikonda,et al. TRIPS and TIDES: new algorithms for tree mining , 2006, CIKM '06.
[3] Mohammed J. Zaki. Efficiently mining frequent trees in a forest , 2002, KDD.
[4] Tharam S. Dillon,et al. IMB3-Miner: Mining Induced/Embedded Subtrees by Constraining the Level of Embedding , 2006, PAKDD.