Generic Multilayer Network Data Analysis with the Fusion of Content and Structure

Multi-feature data analysis (e.g., on Facebook, LinkedIn) is challenging especially if one wants to do it efficiently and retain the flexibility by choosing features of interest for analysis. Features (e.g., age, gender, relationship, political view etc.) can be explicitly given from datasets, but also can be derived from content (e.g., political view based on Facebook posts). Analysis from multiple perspectives is needed to understand the datasets (or subsets of it) and to infer meaningful knowledge. For example, the influence of age, location, and marital status on political views may need to be inferred separately (or in combination). In this paper, we adapt multilayer network (MLN) analysis, a nontraditional approach, to model the Facebook datasets, integrate content analysis, and conduct analysis, which is driven by a list of desired application based queries. Our experimental analysis shows the flexibility and efficiency of the proposed approach when modeling and analyzing datasets with multiple features.

[1]  Jae-Gil Lee,et al.  Community Detection in Multi-Layer Graphs: A Survey , 2015, SGMD.

[2]  Mason A. Porter,et al.  Multilayer networks , 2013, J. Complex Networks.

[3]  Sanjukta Bhowmick,et al.  Holistic Analysis of Multi-source, Multi-feature Data: Modeling and Computation Challenges , 2017, BDA.

[4]  J. Bridge,et al.  Epidemiology of youth suicide and suicidal behavior , 2009, Current opinion in pediatrics.

[5]  Fabio Pianesi,et al.  Workshop on Computational Personality Recognition: Shared Task , 2013, Proceedings of the International AAAI Conference on Web and Social Media.

[6]  Iryna Gurevych,et al.  Lexical-semantic resources: yet powerful resources for automatic personality classification , 2017, GWC.

[7]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[8]  Pierre Azoulay,et al.  Age and High-Growth Entrepreneurship , 2018, American Economic Review: Insights.

[9]  Sanjukta Bhowmick,et al.  Efficient Community Re-creation in Multilayer Networks Using Boolean Operations , 2017, ICCS.

[10]  Dai Quoc Nguyen,et al.  NIHRIO at SemEval-2018 Task 3: A Simple and Accurate Neural Network Model for Irony Detection in Twitter , 2018, *SEMEVAL.

[11]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[12]  Massimiliano Zanin,et al.  Emergence of network features from multiplexity , 2012, Scientific Reports.

[13]  P. Mucha,et al.  Communities in multislice voting networks. , 2010, Chaos.

[14]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[15]  Zhiyuan Liu,et al.  A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[16]  Lili Jiang,et al.  Self-adaptive Privacy Concern Detection for User-generated Content , 2018, CICLing.

[17]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[18]  Mark Rowan,et al.  Observed Gender Differences in Privacy Concerns and Behaviors of Mobile Device End Users , 2014, EUSPN/ICTH.

[19]  Ludvig Bohlin,et al.  Community detection and visualization of networks with the map equation framework , 2014 .

[20]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[21]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[22]  P. Costa,et al.  The revised NEO personality inventory (NEO-PI-R) , 2008 .

[23]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[24]  Z. Wang,et al.  The structure and dynamics of multilayer networks , 2014, Physics Reports.

[25]  Sanjukta Bhowmick,et al.  HUBify: Efficient Estimation of Central Entities Across Multiplex Layer Compositions , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[26]  Arash Heydarian Pashakhanlou Fully integrated content analysis in International Relations , 2017 .

[27]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.