Coupling learning of complex interactions

Abstract Complex applications such as big data analytics involve different forms of coupling relationships that reflect interactions between factors related to technical, business (domain-specific) and environmental (including socio-cultural and economic) aspects. There are diverse forms of couplings embedded in poor-structured and ill-structured data. Such couplings are ubiquitous, implicit and/or explicit, objective and/or subjective, heterogeneous and/or homogeneous, presenting complexities to existing learning systems in statistics, mathematics and computer sciences, such as typical dependency, association and correlation relationships. Modeling and learning such couplings thus is fundamental but challenging. This paper discusses the concept of coupling learning, focusing on the involvement of coupling relationships in learning systems. Coupling learning has great potential for building a deep understanding of the essence of business problems and handling challenges that have not been addressed well by existing learning theories and tools. This argument is verified by several case studies on coupling learning, including handling coupling in recommender systems, incorporating couplings into coupled clustering, coupling document clustering, coupled recommender algorithms and coupled behavior analysis for groups.

[1]  Longbing Cao,et al.  Coupled clustering ensemble: Incorporating coupling relationships both between base clusterings and objects , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[2]  Michael R. Lyu,et al.  SoRec: social recommendation using probabilistic matrix factorization , 2008, CIKM '08.

[3]  Robert Hooke,et al.  Uncertainty, Calibration and Probability. , 1974 .

[4]  Ute St. Clair,et al.  Fuzzy Set Theory: Foundations and Applications , 1997 .

[5]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[6]  Philip S. Yu,et al.  Link Mining: Models, Algorithms, and Applications , 2014, Link Mining.

[7]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[8]  Chengqi Zhang,et al.  Mining for combined association rules on multiple datasets , 2007, DDDM '07.

[9]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[10]  Saěso Dězeroski Relational Data Mining , 2001, Encyclopedia of Machine Learning and Data Mining.

[11]  Bracha Shapira,et al.  Recommender Systems Handbook , 2015, Springer US.

[12]  John Riedl,et al.  ClustKNN: A Highly Scalable Hybrid Model- & Memory-Based CF Algorithm , 2006 .

[13]  Jennifer Neville,et al.  Relational Dependency Networks , 2007, J. Mach. Learn. Res..

[14]  A. Arenas,et al.  Community analysis in social networks , 2004 .

[15]  R. Wardrop Simpson's Paradox and the Hot Hand in Basketball , 1995 .

[16]  Marcello Pelillo,et al.  Similarity-Based Pattern Analysis and Recognition , 2013, Advances in Computer Vision and Pattern Recognition.

[17]  Yang Gao,et al.  A Coupled Clustering Approach for Items Recommendation , 2013, PAKDD.

[18]  Longbing Cao,et al.  Model the complex dependence structures of financial variables by using canonical vine , 2012, CIKM.

[19]  Peter Wiemer-Hastings,et al.  Latent semantic analysis , 2004, Annu. Rev. Inf. Sci. Technol..

[20]  Chengqi Zhang,et al.  Combined Pattern Mining: From Learned Rules to Actionable Knowledge , 2008, Australasian Conference on Artificial Intelligence.

[21]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[22]  Cornelius Frank Dietrich,et al.  Uncertainty, calibration and probability: The statistics of scientific and industrial measurement , 1973 .

[23]  Xianggui Qu,et al.  Multivariate Data Analysis , 2007, Technometrics.

[24]  Hongjun Lu,et al.  Beyond intratransaction association analysis: mining multidimensional intertransaction association rules , 2000, TOIS.

[25]  Can Wang,et al.  Modeling and Analysis of Social Activity Process , 2012 .

[26]  Longbing Cao,et al.  Coupled Attribute Analysis on Numerical Data , 2013, IJCAI.

[27]  Veronique Kiermer,et al.  Six degrees of separation , 2006, Nature Methods.

[28]  Suresh Sood Book Review: Behavior Computing: Modeling, Analysis, Mining and Decision , 2012, IEEE Intell. Informatics Bull..

[29]  Maria L. Rizzo,et al.  Brownian distance covariance , 2009, 1010.0297.

[30]  Philip S. Yu,et al.  Coupled Behavior Analysis with Applications , 2012, IEEE Transactions on Knowledge and Data Engineering.

[31]  Philip S. Yu,et al.  Efficient Selection of Globally Optimal Rules on Large Imbalanced Data Based on Rule Coverage Relationship Analysis , 2013, SDM.

[32]  Xindong Wu,et al.  Coupled behavior analysis for capturing coupling relationships in group-based market manipulations , 2012, KDD.

[33]  Chengqi Zhang,et al.  Combined Mining: Discovering Informative Knowledge in Complex Data , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[34]  Longbing Cao,et al.  Combined mining: Analyzing object and pattern relations for discovering and constructing complex yet actionable patterns , 2013, WIREs Data Mining Knowl. Discov..

[35]  Eddy Mayoraz,et al.  Improved Pairwise Coupling Classification with Correcting Classifiers , 1998, ECML.

[36]  John Scott What is social network analysis , 2010 .

[37]  Chengqi Zhang,et al.  Mining Impact-Targeted Activity Patterns in Imbalanced Data , 2008, IEEE Transactions on Knowledge and Data Engineering.

[38]  Fangfang Li,et al.  CGMF: Coupled Group-Based Matrix Factorization for Recommender System , 2013, WISE.

[39]  Masud Mansuripur,et al.  Introduction to information theory , 1986 .

[40]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[41]  Michael A. Newton Introducing the discussion paper by Sz\'{e}kely and Rizzo , 2010 .

[42]  Michael R. Lyu,et al.  Learning to recommend with social trust ensemble , 2009, SIGIR.

[43]  A. John MINING GRAPH DATA , 2022 .

[44]  Andrew McCallum,et al.  Introduction to Statistical Relational Learning , 2007 .

[45]  Longbing Cao,et al.  Coupled nominal similarity in unsupervised learning , 2011, CIKM '11.

[46]  Guandong Xu,et al.  Personalized recommendation via cross-domain triadic factorization , 2013, WWW.

[47]  Longbing Cao,et al.  USpan: an efficient algorithm for mining high utility sequential patterns , 2012, KDD.

[48]  HanJiawei,et al.  Beyond intratransaction association analysis , 2000 .

[49]  Thomas L. Griffiths,et al.  Nonparametric Latent Feature Models for Link Prediction , 2009, NIPS.

[50]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[51]  Philip S. Yu,et al.  Detecting abnormal coupled sequences and sequence changes in group-based manipulative trading behaviors , 2010, KDD.

[52]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[53]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[54]  Guandong Xu,et al.  Cross-Domain Collaborative Filtering via Bilinear Multilevel Analysis , 2013, IJCAI.

[55]  Longbing Cao,et al.  Non-IIDness Learning in Behavioral and Social Data , 2014, Comput. J..

[56]  P. Holland Statistics and Causal Inference , 1985 .

[57]  Ruslan Salakhutdinov,et al.  Probabilistic Matrix Factorization , 2007, NIPS.

[58]  Gerhard Friedrich,et al.  Recommender Systems - An Introduction , 2010 .

[59]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[60]  Chih-Jen Lin,et al.  Probability Estimates for Multi-class Classification by Pairwise Coupling , 2003, J. Mach. Learn. Res..

[61]  Wei Cao,et al.  Deep Modeling of Group Preferences for Group-Based Recommendation , 2014, AAAI.

[62]  Longbing Cao,et al.  In-depth behavior understanding and use: The behavior informatics approach , 2010, Inf. Sci..

[63]  Longbing Cao,et al.  Modeling Asymmetry and Tail Dependence among Multiple Variables by Using Partial Regular Vine , 2014, SDM.

[64]  Harald Steck,et al.  Circle-based recommendation in online social networks , 2012, KDD.

[65]  Philip S. Yu,et al.  Domain Driven Data Mining , 2015 .

[66]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[67]  Maria L. Rizzo,et al.  Measuring and testing dependence by correlation of distances , 2007, 0803.4101.

[68]  Longbing Cao,et al.  Optimal Allocation of High Dimensional Assets through Canonical Vines , 2013, PAKDD.

[69]  John F. Roddick,et al.  Association mining , 2006, CSUR.

[70]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[71]  Chengqi Zhang,et al.  Ubiquitous Intelligence in Agent Mining , 2009, ADMI.

[72]  Longbing Cao,et al.  Coupled term-term relation analysis for document clustering , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[73]  Wei Cao,et al.  Coupled market behavior based financial crisis detection , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).