ExEm: Expert Embedding using dominating set theory with deep learning approaches

A collaborative network is a social network that is comprised of experts who cooperate with each other to fulfill a special goal. Analyzing the graph of this network yields meaningful information about the expertise of these experts and their subject areas. To perform the analysis, graph embedding techniques have emerged as a promising tool. Graph embedding attempts to represent graph nodes as low-dimensional vectors. In this paper, we propose a graph embedding method, called ExEm, which using dominating-set theory and deep learning approaches. In the proposed method, the dominating set theory is applied to the collaborative network and dominating nodes of this network are found. After that, a set of random walks is created which starts from dominating nodes (experts). The main condition for constricting these random walks is the existence of another dominating node. After making the walks that satisfy the stated conditions, they are stored as a sequence in a corpus. In the next step, the corpus is fed to the SKIP-GRAM neural network model. Word2vec, fastText and their combination are employed to train the neural network of the SKIP-GRAM model. Finally, the result is the low dimensional vectors of experts, called expert embeddings. Expert embeddings can be used for various purposes including accurately modeling experts' expertise or computing experts' scores in expert recommendation systems. Hence, we also introduce a novel strategy to calculate experts' scores by using the extracted expert embedding vectors. The effectiveness of ExEm is validated through assessing its performance on multi-label classification, link prediction, and recommendation tasks. We conduct extensive experiments on common datasets. Moreover in this study, we present data related to a co-author network formed by crawling the vast author profiles from Scopus.

[1]  Jian Pei,et al.  Asymmetric Transitivity Preserving Graph Embedding , 2016, KDD.

[2]  Li Guo,et al.  SSE: Semantically Smooth Embedding for Knowledge Graphs , 2017, IEEE Transactions on Knowledge and Data Engineering.

[3]  Tim Roughgarden,et al.  CS167: Reading in Algorithms Counting Triangles , 2014 .

[4]  Feiping Nie,et al.  Nonlinear Dimensionality Reduction with Local Spline Embedding , 2009, IEEE Transactions on Knowledge and Data Engineering.

[5]  Arif Mahmood,et al.  Using Geodesic Space Density Gradients for Network Community Detection , 2017, IEEE Transactions on Knowledge and Data Engineering.

[6]  Dorothea Wagner,et al.  Finding, Counting and Listing All Triangles in Large Graphs, an Experimental Study , 2005, WEA.

[7]  Britta Ruhnau,et al.  Eigenvector-centrality - a node-centrality? , 2000, Soc. Networks.

[8]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[9]  Jure Leskovec,et al.  Representation Learning on Graphs: Methods and Applications , 2017, IEEE Data Eng. Bull..

[10]  Ivor W. Tsang,et al.  A Unified Feature Selection Framework for Graph Embedding on High Dimensional Data , 2015, IEEE Transactions on Knowledge and Data Engineering.

[11]  eon BottouAT Stochastic Gradient Learning in Neural Networks , 2022 .

[12]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[13]  Ding-Zhu Du,et al.  Connected Dominating Set: Theory and Applications , 2012 .

[14]  Yueting Zhuang,et al.  Expert Finding for Community-Based Question Answering via Ranking Metric Network Learning , 2016, IJCAI.

[15]  Wei Zhang,et al.  Dynamic Graph Representation Learning via Self-Attention Networks , 2018, ArXiv.

[16]  Sampo Pyysalo,et al.  Neural networks for link prediction in realistic biomedical graphs: a multi-dimensional evaluation of graph embedding-based approaches , 2018, BMC Bioinformatics.

[17]  Tomas Mikolov,et al.  Bag of Tricks for Efficient Text Classification , 2016, EACL.

[18]  Zhendong Mao,et al.  Knowledge Graph Embedding: A Survey of Approaches and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[19]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[20]  Qiongkai Xu,et al.  GraRep: Learning Graph Representations with Global Structural Information , 2015, CIKM.

[21]  Jatinder Singh,et al.  Finding Communities in Social Networks with Node Attribute and Graph Structure using Jaya Optimization Algorithm , 2018 .

[22]  Boleslaw K. Szymanski,et al.  Dominating Scale-Free Networks Using Generalized Probabilistic Methods , 2014, Scientific reports.

[23]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[24]  Fadi Dornaika,et al.  Joint Graph Based Embedding and Feature Weighting for Image Classification , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[25]  Wenwu Zhu,et al.  Structural Deep Network Embedding , 2016, KDD.

[26]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[27]  Evangelos E. Milios,et al.  A multi-centrality index for graph-based keyword extraction , 2019, Inf. Process. Manag..

[28]  Kevin Chen-Chuan Chang,et al.  A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications , 2017, IEEE Transactions on Knowledge and Data Engineering.

[29]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[30]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[31]  Kara Dolinski,et al.  The BioGRID Interaction Database: 2008 update , 2008, Nucleic Acids Res..

[32]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[33]  Xiangnan He,et al.  Attributed Social Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[34]  Jie Wu,et al.  Extended Dominating Set and Its Applications in Ad Hoc Networks Using Cooperative Communication , 2006, IEEE Transactions on Parallel and Distributed Systems.

[35]  Chengqi Zhang,et al.  Tri-Party Deep Network Representation , 2016, IJCAI.

[36]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.

[37]  Wei Lu,et al.  Deep Neural Networks for Learning Graph Representations , 2016, AAAI.

[38]  Palash Goyal,et al.  dyngraph2vec: Capturing Network Dynamics using Dynamic Graph Representation Learning , 2018, Knowl. Based Syst..

[39]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[40]  Bolun Chen,et al.  Link Prediction on Directed Networks Based on AUC Optimization , 2018, IEEE Access.

[41]  Pascal Poupart,et al.  Relational Representation Learning for Dynamic (Knowledge) Graphs: A Survey , 2019, ArXiv.

[42]  Xiaoming Fu,et al.  Building and Analyzing a Global Co-Authorship Network Using Google Scholar Data , 2017, WWW.

[43]  Dong Zhang,et al.  Interpreting the formation of co-author networks via utility analysis , 2017, Inf. Process. Manag..

[44]  M. Balafar,et al.  The state-of-the-art in expert recommendation systems , 2019, Eng. Appl. Artif. Intell..

[45]  Vishal Bhatnagar,et al.  Data Preprocessing for Dynamic Social Network Analysis , 2013 .

[46]  Aijun An,et al.  dynnode2vec: Scalable Dynamic Network Embedding , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[47]  Jie Wu,et al.  Local construction of connected dominating sets in wireless ad hoc networks , 2005 .

[48]  Jian Pei,et al.  A Survey on Network Embedding , 2017, IEEE Transactions on Knowledge and Data Engineering.

[49]  Chengqi Zhang,et al.  Network Representation Learning: A Survey , 2017, IEEE Transactions on Big Data.

[50]  Masoud Asadpour,et al.  Community Aware Random Walk for Network Embedding , 2017, Knowl. Based Syst..

[51]  Pascal Poupart,et al.  Representation Learning for Dynamic Graphs: A Survey , 2020, J. Mach. Learn. Res..

[52]  Xindong Wu,et al.  Detecting and Assessing Anomalous Evolutionary Behaviors of Nodes in Evolving Social Networks , 2019, ACM Trans. Knowl. Discov. Data.

[53]  Aynaz Taheri,et al.  Learning to Represent the Evolution of Dynamic Graphs with Recurrent Models , 2019, WWW.

[54]  Zi Huang,et al.  Self-taught dimensionality reduction on the high-dimensional small-sized data , 2013, Pattern Recognit..

[55]  Weiguo Fan,et al.  ExpertRank: A topic-aware expert finding algorithm for online knowledge communities , 2013, Decis. Support Syst..

[56]  Jingyuan Zhang,et al.  Knowledge Graph Embedding Based Question Answering , 2019, WSDM.

[57]  Guangyuan Fu,et al.  A new method to construct co-author networks , 2015 .

[58]  Jian Pei,et al.  High-Order Proximity Preserved Embedding for Dynamic Networks , 2018, IEEE Transactions on Knowledge and Data Engineering.