On the Formation of Circles in Co-authorship Networks

The availability of an overwhelmingly large amount of bibliographic information including citation and co-authorship data makes it imperative to have a systematic approach that will enable an author to organize her own personal academic network profitably. An effective method could be to have one's co-authorship network arranged into a set of ``circles'', which has been a recent practice for organizing relationships (e.g., friendship) in many online social networks. In this paper, we propose an unsupervised approach to automatically detect circles in an ego network such that each circle represents a densely knit community of researchers. Our model is an unsupervised method which combines a variety of node features and node similarity measures. The model is built from a rich co-authorship network data of more than 8 hundred thousand authors. In the first level of evaluation, our model achieves 13.33% improvement in terms of overlapping modularity compared to the best among four state-of-the-art community detection methods. Further, we conduct a task-based evaluation -- two basic frameworks for collaboration prediction are considered with the circle information (obtained from our model) included in the feature set. Experimental results show that including the circle information detected by our model improves the prediction performance by 9.87% and 15.25% on average in terms of AUC (Area under the ROC) and Prec@20 (Precision at Top 20) respectively compared to the case, where the circle information is not present.

[1]  Krishna P. Gummadi,et al.  You are who you know: inferring user profiles in online social networks , 2010, WSDM '10.

[2]  Weimao Ke,et al.  Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams , 2005, Complex..

[3]  Weimao Ke,et al.  Studying the emerging global brain: Analyzing and visualizing the impact of co-authorship teams: Research Articles , 2005 .

[4]  Kon Shing Kenneth Chung,et al.  Egocentric analysis of co-authorship network structure, position and performance , 2012, Inf. Process. Manag..

[5]  William W. Cohen,et al.  Block-LDA: Jointly Modeling Entity-Annotated Text and Entity-Entity Links , 2014, Handbook of Mixed Membership Models and Their Applications.

[6]  Santo Fortunato,et al.  Finding Statistically Significant Communities in Networks , 2010, PloS one.

[7]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[8]  Jure Leskovec,et al.  Supervised random walks: predicting and recommending links in social networks , 2010, WSDM '11.

[9]  R. Wigand,et al.  Measuring social capital through network analysis and its influence on individual performance , 2014 .

[10]  Mao-Bin Hu,et al.  Detect overlapping and hierarchical community structure in networks , 2008, ArXiv.

[11]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[12]  Jure Leskovec,et al.  Discovering social circles in ego networks , 2012, ACM Trans. Knowl. Discov. Data.

[13]  Steve Gregory,et al.  Finding overlapping communities in networks by label propagation , 2009, ArXiv.

[14]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[15]  Boleslaw K. Szymanski,et al.  Towards Linear Time Overlapping Community Detection in Social Networks , 2012, PAKDD.

[16]  Jure Leskovec,et al.  Overlapping community detection at scale: a nonnegative matrix factorization approach , 2013, WSDM.

[17]  Christopher McCarty,et al.  Predicting author h-index using characteristics of the co-author network , 2013, Scientometrics.

[18]  Ying Ding,et al.  Applying centrality measures to impact analysis: A coauthorship network analysis , 2009, J. Assoc. Inf. Sci. Technol..

[19]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[20]  Ajith Kumar,et al.  Social-structural foundations of publication productivity in the Journal of Consumer Research , 2002 .

[21]  Stanford,et al.  Learning to Discover Social Circles in Ego Networks , 2012 .

[22]  Niloy Ganguly,et al.  Citation interactions among computer science fields: a quantitative route to the rise and fall of scientific research , 2014, Social Network Analysis and Mining.

[23]  Jian Liu,et al.  Detecting community structure in complex networks using simulated annealing with k-means algorithms , 2010 .