Coloured graphlet profiles as a predictor of career length in scientific co-authorship networks

Graphlets, or induced motifs, have long been used to find important medium-scale structures in directed networks. We present a method using the composition of coloured graphlets in ego-networks to characterise nodes. We give an example application using our technique to predict the numbers of years researchers are active from their collaboration networks, and compare our success with simpler metrics; particularly, we find that the use of coloured graphlets improves predictive performance compared to colour-blind graphlets; that 4-star graphlets centred on an author are predictors of a long career, and that this effect is not degenerate to centralities.

[1]  Brendan D. McKay,et al.  Practical graph isomorphism, II , 2013, J. Symb. Comput..

[2]  S. Strogatz Exploring complex networks , 2001, Nature.

[3]  Cailing Dong,et al.  Analysis of Computer Science Communities Based on DBLP , 2010, ECDL.

[4]  Niloy Ganguly,et al.  An author is known by the context she keeps: significance of network motifs in scientific collaborations , 2015, Social Network Analysis and Mining.

[5]  Padraig Cunningham,et al.  Characterizing ego-networks using motifs , 2013, Network Science.

[6]  R. Burt Models of Network Structure , 1980 .

[7]  Sahar Asadi,et al.  Kavosh: a new algorithm for finding network motifs , 2009, BMC Bioinformatics.

[8]  Carlo Bonferroni Sulle medie multiple di potenze , 1950 .

[9]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[10]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[11]  Fernando M. A. Silva,et al.  Comparison of Co-authorship Networks across Scientific Fields Using Motifs , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.

[12]  A Masoudi-Nejad,et al.  Building blocks of biological networks: a review on major network motif discovery algorithms. , 2012, IET systems biology.

[13]  L. Krumov,et al.  Motifs in co-authorship networks and their relation to the impact of scientific publications , 2011 .

[14]  J. Brian Gray,et al.  Introduction to Linear Regression Analysis , 2002, Technometrics.

[15]  Sameer Kumar,et al.  Co-authorship networks: a review of the literature , 2015, Aslib J. Inf. Manag..

[16]  Natasa Przulj,et al.  Biological network comparison using graphlet degree distribution , 2007, Bioinform..

[17]  Padraig Cunningham,et al.  Network Analysis of Recurring YouTube Spam Campaigns , 2012, ICWSM.

[18]  Eldon Y. Li,et al.  Co-authorship networks and research impact: A social capital perspective , 2013 .

[19]  Jie Tang,et al.  Influence Maximization in Dynamic Social Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[20]  Filippo Radicchi,et al.  Changing demographics of scientific careers: The rise of the temporary workforce , 2018, Proceedings of the National Academy of Sciences.