The pursuit of hubbiness: Analysis of hubs in large multidimensional networks

Abstract Hubs are highly connected nodes within a network. In complex network analysis, hubs have been widely studied, and are at the basis of many tasks, such as web search and epidemic outbreak detection. In reality, networks are often multidimensional, i.e., there can exist multiple connections between any pair of nodes. In this setting, the concept of hub depends on the multiple dimensions of the network, whose interplay becomes crucial for the connectedness of a node. In this paper, we characterize multidimensional hubs. We consider the multidimensional generalization of the degree and introduce a new class of measures, that we call Dimension Relevance, aimed at analyzing the importance of different dimensions for the hubbiness of a node. We assess the meaningfulness of our measures by comparing them on real networks and null models, then we study the interplay among dimensions and their effect on node connectivity. Our findings show that: (i) multidimensional hubs do exist and their characterization yields interesting insights and (ii) it is possible to detect the most influential dimensions that cause the different hub behaviors. We demonstrate the usefulness of multidimensional analysis in three real world domains: detection of ambiguous query terms in a word–word query log network, outlier detection in a social network, and temporal analysis of behaviors in a co-authorship network.

[1]  Anna Monreale,et al.  Foundations of Multidimensional Network Analysis , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[2]  Marko A. Rodriguez,et al.  Exposing multi-relational networks to single-relational network analysis algorithms , 2008, J. Informetrics.

[3]  Jiawei Han,et al.  gSpan: graph-based substructure pattern mining , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[4]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[5]  Aristides Gionis,et al.  Mining Graph Evolution Rules , 2009, ECML/PKDD.

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  Jukka-Pekka Onnela,et al.  Community Structure in Time-Dependent, Multiscale, and Multiplex Networks , 2009, Science.

[8]  M. E. J. Newman,et al.  Power laws, Pareto distributions and Zipf's law , 2005 .

[9]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[10]  Jure Leskovec,et al.  The dynamics of viral marketing , 2005, EC '06.

[11]  Lada A. Adamic,et al.  Looking at the Blogosphere Topology through Different Lenses , 2007, ICWSM.

[12]  Tanya Y. Berger-Wolf,et al.  Online Sampling of High Centrality Individuals in Social Networks , 2010, PAKDD.

[13]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[14]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[15]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[16]  Lada A. Adamic,et al.  Search in Power-Law Networks , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Huiru Zheng,et al.  eNelator: A simulation system for large-scale vulnerability analysis of species-, disease- and process-specific protein networks , 2010, J. Comput. Sci..

[18]  Philip S. Yu,et al.  Graph OLAP: Towards Online Analytical Processing on Graphs , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[19]  Jiawei Han,et al.  Community Mining from Multi-relational Networks , 2005, PKDD.

[20]  A. Barabasi,et al.  Lethality and centrality in protein networks , 2001, Nature.

[21]  Alessandro Vespignani,et al.  Modeling the spatial spread of infectious diseases: The GLobal Epidemic and Mobility computational model , 2010, J. Comput. Sci..

[22]  Huan Liu,et al.  Scalable learning of collective behavior based on sparse social dimensions , 2009, CIKM.

[23]  Jure Leskovec,et al.  Predicting positive and negative links in online social networks , 2010, WWW '10.

[24]  Jason Noble,et al.  Extremism Propagation in Social Networks with Hubs , 2008, Adapt. Behav..

[25]  Christos Faloutsos,et al.  Epidemic thresholds in real networks , 2008, TSEC.

[26]  Hongyuan Zha,et al.  Co-ranking Authors and Documents in a Heterogeneous Network , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[27]  S. Redner How popular is your paper? An empirical study of the citation distribution , 1998, cond-mat/9804163.

[28]  Hans-Peter Kriegel,et al.  Pattern Mining in Frequent Dynamic Subgraphs , 2006, Sixth International Conference on Data Mining (ICDM'06).