Detecting community patterns capturing exceptional link trails

We present a new method for detecting descriptive community patterns capturing exceptional (sequential) link trails. For that, we provide a novel problem formalization: We model sequential data as first-order Markov chain models, mapped to an attributed weighted network represented as a graph. Then, we detect subgraphs (communities) using exceptional model mining techniques: We target subsets of sequential transitions between nodes that are exceptional in that sense that they either conform strongly to a specific reference or show significant deviations, estimated by a quality measure. In particular, such a community is described by a community pattern composed of descriptive features (of the attributed graph) covering the respective community. We present a comprehensive modeling approach and discuss results of a case study analyzing data from two real-world social networks.

[1]  Denis Helic,et al.  Detecting Memory and Structure in Human Navigation Patterns Using Markov Chain Models of Varying Order , 2014, PloS one.

[2]  Shlomo Moran,et al.  The stochastic approach for link-structure analysis (SALSA) and the TKC effect , 2000, Comput. Networks.

[3]  Steve Harenberg,et al.  Anomaly detection in dynamic networks: a survey , 2015 .

[4]  Florian Lemmerich,et al.  Exploratory pattern mining on social media using geo-references and social tagging information , 2013, Int. J. Web Sci..

[5]  Dimitrios Gunopulos,et al.  Constraint-Based Rule Mining in Large, Dense Databases , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[6]  Danai Koutra,et al.  Graph based anomaly detection and description: a survey , 2014, Data Mining and Knowledge Discovery.

[7]  Martin Atzmüller,et al.  The Mining and Analysis Continuum of Explaining Uncovered , 2010, SGAI Conf..

[8]  Martin Atzmüller,et al.  DASHTrails: An Approach for Modeling and Analysis of Distribution-Adapted Sequential Hypotheses and Trails , 2016, WWW.

[9]  Martin Atzmüller,et al.  Description-oriented community detection using exhaustive subgroup discovery , 2016, Inf. Sci..

[10]  Stefan Wrobel,et al.  An Algorithm for Multi-relational Discovery of Subgroups , 1997, PKDD.

[11]  Gerd Stumme,et al.  Temporal evolution of contacts and communities in networks of face-to-face human interactions , 2014, Science China Information Sciences.

[12]  Mohammed J. Zaki,et al.  Mining Attribute-structure Correlated Patterns in Large Attributed Graphs , 2012, Proc. VLDB Endow..

[13]  Andreas Hotho,et al.  Ubicon and its applications for ubiquitous social computing , 2014, New Rev. Hypermedia Multim..

[14]  Andreas Hotho,et al.  The social distributional hypothesis: a pragmatic proxy for homophily in online social networks , 2014, Social Network Analysis and Mining.

[15]  Andreas Hotho,et al.  Ubicon: Observing Physical and Social Activities , 2012, 2012 IEEE International Conference on Green Computing and Communications.

[16]  David Krackardt,et al.  QAP partialling as a test of spuriousness , 1987 .

[17]  Martin Atzmüller,et al.  Efficient Descriptive Community Mining , 2011, FLAIRS.

[18]  Barbora Micenková,et al.  Clustering attributed graphs: Models, measures and methods , 2015, Network Science.

[19]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Gerd Stumme,et al.  On the Predictability of Human Contacts: Influence Factors and the Strength of Stronger Ties , 2012, 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing.

[21]  Martin Atzmüller,et al.  Subgroup discovery , 2005, Künstliche Intell..

[22]  Andreas Hotho,et al.  On the Semantics of User Interaction in Social Media , 2013, LWA.

[23]  Andreas Hotho,et al.  Face-to-Face Contacts at a Conference: Dynamics of Communities and Roles , 2011, MSM/MUSE.

[24]  Christopher C. Strelioff,et al.  Inferring Markov chains: Bayesian estimation, model comparison, entropy rate, and out-of-class modeling. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Andreas Schmidt,et al.  Data Preparation for Big Data Analytics: Methods and Experiences , 2016 .

[26]  Boleslaw K. Szymanski,et al.  Overlapping community detection in networks: The state-of-the-art and comparative study , 2011, CSUR.

[27]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[28]  Gerd Stumme,et al.  Anatomy of a conference , 2012, HT '12.

[29]  T. Vicsek,et al.  Directed network modules , 2007, physics/0703248.

[30]  Martin Atzmuller,et al.  Knowledge-Intensive Subgroup Mining: Techniques for Automatic and Interactive Discovery , 2007 .

[31]  Martin Atzmüller,et al.  Data Mining on Social Interaction Networks , 2013, J. Data Min. Digit. Humanit..

[32]  Dominik Benz,et al.  Community Assessment Using Evidence Networks , 2010, MSM/MUSE.

[33]  Thomas Seidl,et al.  GAMer: a synthesis of subspace clustering and dense subgraph mining , 2013, Knowledge and Information Systems.

[34]  Bernd Ludwig,et al.  Social event network analysis: Structure, preferences, and reality , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[35]  Martin Atzmüller,et al.  Towards capturing social interactions with SDCF: an extensible framework for mobile sensing and ubiquitous data collection , 2013, MSM '13.

[36]  David Elsweiler,et al.  Relating user interaction to experience during festivals , 2014, IIiX.

[37]  Frank Puppe,et al.  Rule-Based Information Extraction for Structured Data Acquisition using TextMarker , 2008, LWA.

[38]  Peter Pirolli,et al.  Distributions of surfers' paths through the World Wide Web: Empirical characterizations , 1999, World Wide Web.

[39]  Francesco Bonchi,et al.  Description-Driven Community Detection , 2014, TIST.

[40]  Arno Knobbe,et al.  Exceptional Model Mining , 2008, ECML/PKDD.

[41]  Florian Lemmerich,et al.  Fast Subgroup Discovery for Continuous Target Concepts , 2009, ISMIS.

[42]  Didier Sornette,et al.  Encyclopedia of Complexity and Systems Science , 2009 .

[43]  Andreas Hotho,et al.  VizTrails: An Information Visualization Tool for Exploring Geographic Movement Trajectories , 2015, HT.

[44]  Florian Lemmerich,et al.  Generic Pattern Trees for Exhaustive Exceptional Model Mining , 2012, ECML/PKDD.

[45]  A. Hotho,et al.  HypTrails: A Bayesian Approach for Comparing Hypotheses About Human Trails on the Web , 2014, WWW.

[46]  Marc Plantevit,et al.  A method for characterizing communities in dynamic attributed complex networks , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[47]  Christoph Trattner,et al.  Mining, Modeling, and Recommending 'Things' in Social Media , 2014, Lecture Notes in Computer Science.

[48]  Willi Klösgen,et al.  Explora: A Multipattern and Multistrategy Discovery Assistant , 1996, Advances in Knowledge Discovery and Data Mining.

[49]  Andrea Lancichinetti,et al.  Detecting the overlapping and hierarchical community structure in complex networks , 2008, 0802.1218.

[50]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.