Rational Erdös number and maximum flow as measurement models for scientific social network analysis

In social network analysis, the detection of communities—composed of people with common interests—is a classical problem. Moreover, people can somehow influence any other in the community, i.e., they can spread information among them. In this paper, two models are proposed considering information diffusion strategies and the identification of communities in a scientific social network built through these two model concepts. The maximum flow-based and the Erdös number-based models are proposed as a measurement to weigh all the relationships between elements. A clustering algorithm (k-medoids) was used for the identification of communities of closely connected people in order to evaluate the proposed models in a scientific social network. Detailed analysis of the obtained scientific communities was conducted to compare the structure of formed groups and to demonstrate the feasibility of the solution. The results demonstrate the viability and effectiveness of the proposed solution, showing that information reaches elements that are not directly related to the element that produces it.

[1]  I. Gutman,et al.  Resistance distance and Laplacian spectrum , 2003 .

[2]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[3]  Éva Tardos,et al.  Maximizing the Spread of Influence through a Social Network , 2015, Theory Comput..

[4]  Leysia Palen,et al.  Twitter adoption and use in mass convergence and emergency events , 2009 .

[5]  E. B. Wilson PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES. , 1919, Science.

[6]  Luming Zhang,et al.  Multiple Social Network Learning and Its Application in Volunteerism Tendency Prediction , 2015, SIGIR.

[7]  R. Yin Case Study Research: Design and Methods , 1984 .

[8]  Piet Van Mieghem,et al.  Finding Critical Regions and Region-Disjoint Paths in a Network , 2015, IEEE/ACM Transactions on Networking.

[9]  M. Aldenderfer Cluster Analysis , 1984 .

[10]  Taxiarchis Botsis,et al.  Exploring the community structure of a diabetes forum. , 2012, Studies in health technology and informatics.

[11]  Christopher J. Fariss,et al.  Inferring Tie Strength from Online Directed Behavior , 2013, PloS one.

[12]  Iraklis Varlamis,et al.  A Study on Social Network Metrics and Their Application in Trust Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[13]  Jian Ma,et al.  A Multilevel Information Mining Approach for Expert Recommendation in Online Scientific Communities , 2015, Comput. J..

[14]  Erjia Yan,et al.  Predicting and recommending collaborations: An author-, institution-, and country-level analysis , 2014, J. Informetrics.

[15]  N. Milburn,et al.  Mobilizing homeless youth for HIV prevention: a social network analysis of the acceptability of a face-to-face and online social networking intervention. , 2012, Health education research.

[16]  Richard M. Karp,et al.  Theoretical Improvements in Algorithmic Efficiency for Network Flow Problems , 1972, Combinatorial Optimization.

[17]  Jon Kleinberg,et al.  The link prediction problem for social networks , 2003, CIKM '03.

[18]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[19]  Liaquat Hossain,et al.  Social networks and coordination performance of distributed software development teams , 2009 .

[20]  R. Bapat,et al.  A Simple Method for Computing Resistance Distance , 2003 .

[21]  Tat-Seng Chua,et al.  Micro Tells Macro: Predicting the Popularity of Micro-Videos via a Transductive Model , 2016, ACM Multimedia.

[22]  A. Cayley A theorem on trees , 2009 .

[23]  M. Newman Coauthorship networks and patterns of scientific collaboration , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Jano Moreira de Souza,et al.  Group and link analysis of multi-relational scientific social networks , 2013, J. Syst. Softw..

[25]  Renaud Lambiotte,et al.  Community structure and patterns of scientific collaboration in Business and Management , 2011, Scientometrics.

[26]  Esteban Moro,et al.  Social Features of Online Networks: The Strength of Intermediary Ties in Online Social Media , 2011, PloS one.

[27]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[28]  Mark Newman,et al.  Detecting community structure in networks , 2004 .

[29]  Jure Leskovec,et al.  Defining and Evaluating Network Communities Based on Ground-Truth , 2012, ICDM.

[30]  D. R. Fulkerson,et al.  Maximal Flow Through a Network , 1956 .

[31]  Adrien Guille,et al.  Information diffusion in online social networks , 2013, SIGMOD'13 PhD Symposium.

[32]  Zhe Zhang,et al.  Exploiting sentiment homophily for link prediction , 2014, RecSys '14.

[33]  Ravindra B. Bapat,et al.  Resistance matrix of a weighted graph , 2004 .

[34]  Ujjwal Maulik,et al.  Validity index for crisp and fuzzy clusters , 2004, Pattern Recognit..

[35]  D. Kaufman,et al.  Introduction to the lipoprotein series , 2006, Intensive Care Medicine.

[36]  Lada A. Adamic,et al.  The role of social networks in information diffusion , 2012, WWW.

[37]  Jason J. Jung Integrating Social Networks for Context Fusion in Mobile Service Platforms , 2010, J. Univers. Comput. Sci..

[38]  David Liben-Nowell,et al.  The link-prediction problem for social networks , 2007 .

[39]  Ryutaro Ichise,et al.  Community mining tool using bibliography data , 2005, Ninth International Conference on Information Visualisation (IV'05).

[40]  Luming Zhang,et al.  Interest Inference via Structure-Constrained Multi-Source Multi-Task Learning , 2015, IJCAI.

[41]  Raquel Recuero,et al.  Taking tweets to the streets: A spatial analysis of the Vinegar Protests in Brazil , 2014, First Monday.

[42]  Nicholas A. Cumpsty,et al.  Some Lessons Learned , 2010 .

[43]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[44]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[45]  M. Newman 1 Who is the best connected scientist ? A study of scientific coauthorship networks , 2004 .

[46]  P. Howard,et al.  Opening Closed Regimes: What Was the Role of Social Media During the Arab Spring? , 2011 .

[47]  Dino Pedreschi,et al.  Human mobility, social ties, and link prediction , 2011, KDD.

[48]  Jano Moreira de Souza,et al.  Information Extraction to improve Link Prediction in scientific social networks , 2016, 2016 IEEE 20th International Conference on Computer Supported Cooperative Work in Design (CSCWD).

[49]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.