Activity data accumulated in real life, such as terrorist activities and governmental customer contacts, present special structural and semantic complexities. Activity data may lead to or be associated with significant business impacts, and result in important actions and decision making leading to business advantage. For instance, a series of terrorist activities may trigger a disaster to society, and large amounts of fraudulent activities in social security programs may result in huge government customer debt. Uncovering these activities or activity sequences can greatly evidence and/or enhance corresponding actions in business decisions. However, mining such data challenges the existing KDD research in aspects such as unbalanced data distribution and impact-targeted pattern mining. This paper investigates the characteristics and challenges of activity data, and the methodologies and tasks of activity mining based on case-study experience in the area of social security. Activity mining aims to discover high impact activity patterns in huge volumes of unbalanced activity transactions. Activity patterns identified can be used to prevent disastrous events or improve business decision making and processes. We illustrate the above issues and prospects in mining governmental customer contacts data to recover customer debt.
[1]
David Taniar,et al.
Domain-Driven, Actionable Knowledge Discovery
,
2007,
IEEE Intelligent Systems.
[2]
Wil M. P. van der Aalst,et al.
Process mining: a research agenda
,
2004,
Comput. Ind..
[3]
Zhengxin Chen,et al.
From Data Mining to Behavior Mining
,
2006,
Int. J. Inf. Technol. Decis. Mak..
[4]
Xindong Wu,et al.
10 Challenging Problems in Data Mining Research
,
2006,
Int. J. Inf. Technol. Decis. Mak..
[5]
Markus Hammori,et al.
Interactive workflow mining - requirements, concepts and implementation
,
2006,
Data Knowl. Eng..
[6]
Chengqi Zhang,et al.
Domain-Driven Data Mining: A Practical Methodology
,
2006,
Int. J. Data Warehous. Min..
[7]
Abraham Silberschatz,et al.
What Makes Patterns Interesting in Knowledge Discovery Systems
,
1996,
IEEE Trans. Knowl. Data Eng..
[8]
Michael J. Pazzani,et al.
A Computational Theory of Learning Causal Relationships
,
1991,
Cogn. Sci..