Entropy-based randomisation of rating networks

In recent years, due to the great diffusion of e-commerce, online rating platforms quickly became a common tool for purchase recommendations. However, instruments for their analysis did not evolve at the same speed. Indeed, interesting information about users' habits and tastes can be recovered just considering the bipartite network of users and products, in which links represent products' purchases and have different weights due to the score assigned to the item in users' reviews. With respect to other weighted bipartite networks, in these systems we observe a maximum possible weight per link, that limits the variability of the outcomes. In the present article we propose an entropy-based randomization method for this type of networks (i.e., bipartite rating networks) by extending the configuration model framework: the randomized network satisfies the constraints of the degree per rating, i.e., the number of given ratings received by the specified product or assigned by the single user. We first show that such a null model is able to reproduce several nontrivial features of the real network better than other null models. Then, using our model as benchmark, we project the information contained in the real system on one of the layers: To provide an interpretation of the projection obtained, we run the Louvain community detection on the obtained network and discuss the observed division in clusters. We are able to detect groups of music albums due to the consumers' taste or communities of movies due to their audience. Finally, we show that our method is also able to handle the special case of categorical bipartite networks: we consider the bipartite categorical network of scientific journals recognized for the scientific qualification in economics and statistics. In the end, from the outcome of our method, the probability that each user appreciate every product can be easily recovered. Therefore, this information may be employed in future applications to implement a more detailed recommendation system that also takes into account information regarding the topology of the observed network.

[1]  Daniel B. Larremore,et al.  Efficiently inferring community structure in bipartite networks , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  R Pastor-Satorras,et al.  Dynamical and correlation properties of the internet. , 2001, Physical review letters.

[3]  Alessandro Vespignani,et al.  Large-scale topological and dynamical properties of the Internet. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Giulio Cimini,et al.  Statistically validated network of portfolio overlaps and systemic risk , 2016, Scientific Reports.

[5]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[6]  Sebastiano Vigna,et al.  The Graph Structure in the Web - Analyzed on Different Aggregation Levels , 2015, J. Web Sci..

[7]  Giorgio Fagiolo,et al.  Enhanced reconstruction of weighted networks from strengths and degrees , 2013, 1307.2104.

[8]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[9]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[10]  M. Newman,et al.  Statistical mechanics of networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  Roma,et al.  Fitness model for the Italian interbank money market. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Diego Garlaschelli,et al.  Analytical maximum-likelihood method to detect patterns in real networks , 2011, 1103.0701.

[13]  Luc De Raedt,et al.  Proceedings of the 22nd international conference on Machine learning , 2005 .

[14]  M. Mézard,et al.  Journal of Statistical Mechanics: Theory and Experiment , 2011 .

[15]  F. Chung,et al.  Connected Components in Random Graphs with Given Expected Degree Sequences , 2002 .

[16]  Guido Caldarelli,et al.  The corporate boards networks , 2004 .

[17]  Yoshi Fujiwara,et al.  Systemic Risk and Vulnerabilities of Bank Networks , 2017 .

[18]  Guido Caldarelli,et al.  Scale-Free Networks , 2007 .

[19]  Andrea Gabrielli,et al.  Inferring monopartite projections of bipartite networks: an entropy-based approach , 2016 .

[20]  G. Caldarelli,et al.  Using Networks To Understand Medical Data: The Case of Class III Malocclusions , 2012, PloS one.

[21]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[22]  Yili Hong,et al.  On computing the distribution function for the Poisson binomial distribution , 2013, Comput. Stat. Data Anal..

[23]  Albert-László Barabási,et al.  Scale-Free Networks: A Decade and Beyond , 2009, Science.

[24]  A. Volkova A Refinement of the Central Limit Theorem for Sums of Independent Random Indicators , 1996 .

[25]  Carlo Ratti,et al.  A General Optimization Technique for High Quality Community Detection in Complex Networks , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Yamir Moreno,et al.  The Dynamics of Protest Recruitment through an Online Network , 2011, Scientific reports.

[27]  D. Garlaschelli,et al.  Maximum likelihood: extracting unbiased information from complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Guido Caldarelli,et al.  Grand canonical validation of the bipartite international trade network. , 2017, Physical review. E.

[29]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[30]  M. Tumminello,et al.  Statistically Validated Networks in Bipartite Complex Systems , 2010, PloS one.

[31]  Guido Caldarelli,et al.  Entropy-based approach to missing-links prediction , 2018, Appl. Netw. Sci..

[32]  J. Loscalzo,et al.  Putting the Patient Back Together - Social Medicine, Network Medicine, and the Limits of Reductionism. , 2017, The New England journal of medicine.

[33]  Andrea Gabrielli,et al.  Randomizing bipartite networks: the case of the World Trade Web , 2015, Scientific Reports.

[34]  D. Garlaschelli,et al.  Reconstructing the world trade multiplex: the role of intensive and extensive biases. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  R. Cox,et al.  Journal of the Royal Statistical Society B , 1972 .

[37]  Olaf Sporns,et al.  Communication dynamics in complex brain networks , 2017, Nature Reviews Neuroscience.

[38]  M. Buchanan,et al.  Networks in cell biology , 2010 .

[39]  Guido Caldarelli,et al.  Organization and hierarchy of the human functional brain network lead to a chain-like core , 2017, Scientific Reports.