Link Prediction in Social Network Using Co-clustering Based Approach

This paper introduces an approach to derive whether an individual is related to an item or not. In our approach, the well-known DBLP dataset is used and we try to find some skills that are related to an author that we were not aware of before. To realize our objective, we cluster authors and skills using Spectral Graph Clustering algorithm, then simultaneously obtain user and movie clusters via Bipartite Graph (Bigraph) Spectral Co-clustering approach, and then generate predictions based on the outputs of clustering and co-clustering steps. Accordingly, we utilize clustering and co-clustering advantages to predict the probability of link existing between an author and a skill. Experimental results on DBLP dataset show that our approach works well in the specified task.

[1]  Arindam Banerjee,et al.  Bayesian Co-clustering , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[2]  John Riedl,et al.  GroupLens: an open architecture for collaborative filtering of netnews , 1994, CSCW '94.

[3]  Ting Liu,et al.  A Two-Phase Spectral Bigraph Co-clustering Approach for the “Who Rated What” Task in KDD Cup 2007 , 2007 .

[4]  Carla E. Brodley,et al.  Solving cluster ensemble problems by bipartite graph partitioning , 2004, ICML.

[5]  Benjamin Auffarth Spectral Graph Clustering , 2007 .

[6]  Sahin Albayrak,et al.  The Link Prediction Problem in Bipartite Networks , 2010, IPMU.

[7]  Inderjit S. Dhillon,et al.  Information-theoretic co-clustering , 2003, KDD '03.

[8]  U. Feige,et al.  Spectral Graph Theory , 2015 .

[9]  Ben Taskar,et al.  Link Prediction in Relational Data , 2003, NIPS.

[10]  William E. Winkler,et al.  Advanced Methods For Record Linkage , 1994 .

[11]  Mohammad Al Hasan,et al.  Link prediction using supervised learning , 2006 .

[12]  Inderjit S. Dhillon,et al.  Co-clustering documents and words using bipartite spectral graph partitioning , 2001, KDD '01.

[13]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[14]  Avi Pfeffer,et al.  Probabilistic Frame-Based Systems , 1998, AAAI/IAAI.

[15]  Theodoros Lappas,et al.  Finding a team of experts in social networks , 2009, KDD.

[16]  Lyle H. Ungar,et al.  Statistical Relational Learning for Link Prediction , 2003 .

[17]  Padhraic Smyth,et al.  EventRank: a framework for ranking time-varying networks , 2005, LinkKDD '05.

[18]  B. Mohar Some applications of Laplace eigenvalues of graphs , 1997 .

[19]  M. Richman,et al.  Euclidean Distance as a Similarity Metric for Principal Component Analysis , 2001 .