Modeling interactions based on consistent patterns

Providers of Web based services are interested in monitoring the usage of their services in combination with those of other providers. The identification of services frequently accessed together may be valuable as a basis for strategic collaboration among their owners. We propose data mining to discover services of different providers which could complement one another, based on their usage. In particular we model the activities of a user as a sequence of service invocations recorded in a log, on which pattern discovery techniques can be applied. However we claim that conventional sequence mining is not adequate for this type of application. This is because, conventional mining concentrates on frequent (or infrequent) patterns of access, while we also require a notion of the consistency of these access patterns as a basis for collaboration. We present a model for constructing patterns that depict consistently used sequences of activities. This model is general enough to be applied to any system of autonomous entities, where relationships between entities are dynamic. For testing our model, we have analyzed the behavior of users in a news group, in order to determine consistent patterns in the way users respond to questions posed to the group.

[1]  Ke Wang,et al.  Discovering Patterns from Large and Dynamic Sequential Data , 1997, Journal of Intelligent Information Systems.

[2]  Kevin Crowston,et al.  The interdisciplinary study of coordination , 1994, CSUR.

[3]  Michele H. Jackson Assessing the Structure of Communication on the World Wide Web , 2006, J. Comput. Mediat. Commun..

[4]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[5]  Jaideep Srivastava,et al.  Grouping Web page references into transactions for mining World Wide Web browsing patterns , 1997, Proceedings 1997 IEEE Knowledge and Data Engineering Exchange Workshop.

[6]  Mohammed J. Zaki,et al.  PlanMine: Sequence Mining for Plan Failures , 1998, KDD.

[7]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[8]  Sunita Sarawagi,et al.  Mining Surprising Patterns Using Temporal Description Length , 1998, VLDB.

[9]  Myra Spiliopoulou,et al.  The Laborious Way From Data Mining to Web Log Mining , 1999 .

[10]  Michael H. Goldhaber,et al.  The Attention Economy and the Net , 1997, First Monday.

[11]  Caroline Haythornthwaite,et al.  A Social Network Study of the Growth of Community Among Distance Learners , 1998, Inf. Res..

[12]  Myra Spiliopoulou,et al.  WUM: A tool for Web Utilization analysis , 1999 .

[13]  Christos Faloutsos,et al.  Efficient Similarity Search In Sequence Databases , 1993, FODO.

[14]  Jaideep Srivastava,et al.  Data Preparation for Mining World Wide Web Browsing Patterns , 1999, Knowledge and Information Systems.

[15]  Tad Hogg,et al.  Communities of practice: Performance and evolution , 1994, Comput. Math. Organ. Theory.

[16]  Heikki Mannila,et al.  Discovering Generalized Episodes Using Minimal Occurrences , 1996, KDD.