Coupled Behavior Analysis with Applications

Coupled behaviors refer to the activities of one to many actors who are associated with each other in terms of certain relationships. With increasing network and community-based events and applications, such as group-based crime and social network interactions, behavior coupling contributes to the causes of eventual business problems. Effective approaches for analyzing coupled behaviors are not available, since existing methods mainly focus on individual behavior analysis. This paper discusses the problem of Coupled Behavior Analysis (CBA) and its challenges. A Coupled Hidden Markov Model (CHMM)-based approach is illustrated to model and detect abnormal group-based trading behaviors. The CHMM models cater for: 1) multiple behaviors from a group of people, 2) behavioral properties, 3) interactions among behaviors, customers, and behavioral properties, and 4) significant changes between coupled behaviors. We demonstrate and evaluate the models on order-book-level stock tick data from a major Asian exchange and demonstrate that the proposed CHMMs outperforms HMM-only for modeling a single sequence or combining multiple single sequences, without considering coupling relationships to detect anomalies. Finally, we discuss interaction relationships and modes between coupled behaviors, which are worthy of substantial study.

[1]  Sanjay Ranka,et al.  Statistical change detection for multi-dimensional data , 2007, KDD '07.

[2]  Ka Keung Lee,et al.  Human behavior learning and transfer , 2005 .

[3]  Carl D. Cheney,et al.  Behavior Analysis and Learning , 1998 .

[4]  Chengqi Zhang,et al.  Efficient Mining of Event-Oriented Negative Sequential Rules , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[5]  Xiaoli Li,et al.  Learning to Classify Texts Using Positive and Unlabeled Data , 2003, IJCAI.

[6]  Johannes Gehrke,et al.  Sequential PAttern mining using a bitmap representation , 2002, KDD.

[7]  Andrew McCallum,et al.  Introduction to Statistical Relational Learning , 2007 .

[8]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[9]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Qiming Chen,et al.  PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[11]  Kevin Chen-Chuan Chang,et al.  PEBL: positive example based learning for Web page classification using SVM , 2002, KDD.

[12]  Shai Ben-David,et al.  Detecting Change in Data Streams , 2004, VLDB.

[13]  Chengqi Zhang,et al.  Combined Mining: Discovering Informative Knowledge in Complex Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Xiaohui Liu,et al.  Variable grouping in multivariate time series via correlation , 2001, IEEE Trans. Syst. Man Cybern. Part B.

[15]  Jerold B. Warner,et al.  Using daily stock returns: The case of event studies , 1985 .

[16]  You-Jin Park,et al.  Individual and group behavior-based customer profile model for personalized product recommendation , 2009, Expert Syst. Appl..

[17]  Daniel R. Ilgen,et al.  Computational Modeling of Behavior in Organizations: The Third Scientific Discipline , 2000 .

[18]  Nikos Mamoulis,et al.  Discovery of Periodic Patterns in Spatiotemporal Sequences , 2007, IEEE Transactions on Knowledge and Data Engineering.

[19]  Eamonn J. Keogh,et al.  Clustering of time-series subsequences is meaningless: implications for previous and future research , 2004, Knowledge and Information Systems.

[20]  Chellu Chandra Sekhar,et al.  A density based method for multivariate time series clustering in kernel feature space , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[21]  Cyrus Shahabi,et al.  Feature subset selection and feature ranking for multivariate time series , 2005, IEEE Transactions on Knowledge and Data Engineering.

[22]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[23]  Philip S. Yu,et al.  Detecting abnormal coupled sequences and sequence changes in group-based manipulative trading behaviors , 2010, KDD.

[24]  D. Seborg,et al.  Clustering multivariate time‐series data , 2005 .

[25]  D. Hinkle,et al.  Applied statistics for the behavioral sciences , 1979 .

[26]  Raj Bhatnagar,et al.  Discovery of Temporal Dependencies between Frequent Patterns in Multivariate Time Series , 2007, 2007 IEEE Symposium on Computational Intelligence and Data Mining.

[27]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[28]  Kristian Kersting,et al.  Boosting Relational Sequence Alignments , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[29]  Longbing Cao,et al.  Agent Mining: The Synergy of Agents and Data Mining , 2009, IEEE Intelligent Systems.

[30]  Tad Hogg,et al.  Diversity of online community activities , 2008, Hypertext.

[31]  Milos Hauskrecht,et al.  Multivariate Time Series Classification with Temporal Abstractions , 2009, FLAIRS.

[32]  Huan Liu,et al.  Social Computing, Behavioral Modeling, and Prediction , 2008 .

[33]  Philip S. Yu,et al.  Behavior Informatics: An Informatics Perspective for Behavior Studies , 2009, IEEE Intell. Informatics Bull..

[34]  Longbing Cao,et al.  In-depth behavior understanding and use: The behavior informatics approach , 2010, Inf. Sci..

[35]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[36]  Alexander Zien,et al.  Semi-Supervised Learning , 2006 .

[37]  Chengqi Zhang,et al.  Mining Impact-Targeted Activity Patterns in Imbalanced Data , 2008, IEEE Transactions on Knowledge and Data Engineering.