Using SAS for Mining Indirect Associations in Data
暂无分享,去创建一个
Association rules are often used to capture significant dependencies among attributes in a variety of datasets. A typical association rule for market basket data might be of the form, "Purchasers who buy an item x will very likely buy another item y ." We have introduced a new type of dependence relationship called indirect association. A pair of items, x and y, are indirectly associated if they rarely occur together in the dataset, and there exists a set Z such that the presence of x or y in a transaction is highly dependent on the occurrence of items in Z. Indirect association has found its applications in retail, textual and stock market domains. In this paper, we describe the implementation of a prototype system for discovering indirect associations using SAS. We have tested the prototype system using masked data from Hewlett-Packard's online purchase transactions to discover interesting product purchase patterns.
[1] Jian Pei,et al. Mining frequent patterns without candidate generation , 2000, SIGMOD '00.
[2] Tomasz Imielinski,et al. Database Mining: A Performance Perspective , 1993, IEEE Trans. Knowl. Data Eng..