Statistical analysis of Nomao customer votes for spots of France

We investigate the statistical properties of votes of customers for spots of France collected by the startup company Nomao. The frequencies of votes per spot and per customer are characterized by a power law distribution which remains stable on a time scale of a decade when the number of votes is varied by almost two orders of magnitude. Using the computer science methods we explore the spectrum and the eigenvalues of a matrix containing user ratings to geolocalized items. Eigenvalues nicely map to large towns and regions but show certain level of instability as we modify the interpretation of the underlying matrix. We evaluate imputation strategies that provide improved prediction performance by reaching geographically smooth eigenvectors. We point on possible links between distribution of votes and the phenomenon of self-organized criticality.

[1]  Tomoharu Iwata,et al.  Travel route recommendation using geotags in photo sharing sites , 2010, CIKM.

[2]  Li Chen,et al.  Factorization vs. regularization: fusing heterogeneous social relationships in top-n recommendation , 2011, RecSys '11.

[3]  S. Zamir,et al.  Lower Rank Approximation of Matrices by Least Squares With Any Choice of Weights , 1979 .

[4]  Padhraic Smyth,et al.  KDD Cup and workshop 2007 , 2007, SKDD.

[5]  Michael R. Lyu,et al.  Fused Matrix Factorization with Geographical and Social Influence in Location-Based Social Networks , 2012, AAAI.

[6]  George Karypis,et al.  Item-based top-N recommendation algorithms , 2004, TOIS.

[7]  Jie Bao,et al.  A Survey on Recommendations in Location-based Social Networks , 2013 .

[8]  Tommi S. Jaakkola,et al.  Weighted Low-Rank Approximations , 2003, ICML.

[9]  Domonkos Tikk,et al.  Investigation of Various Matrix Factorization Methods for Large Recommender Systems , 2008, 2008 IEEE International Conference on Data Mining Workshops.

[10]  B. Nemeth,et al.  A unified approach of factor models and neighbor based methods for large recommender systems , 2008, 2008 First International Conference on the Applications of Digital Information and Web Technologies (ICADIWT).

[11]  Miklos Kurucz,et al.  Spectral clustering in telephone call graphs , 2007, WebKDD/SNA-KDD '07.

[12]  Michael H. Pryor,et al.  The Effects of Singular Value Decomposition on Collaborative Filtering , 1998 .

[13]  Fredric C. Gey,et al.  Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval , 1999, SIGIR 1999.

[14]  Abelian sandpile model. , 1993, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[15]  John Riedl,et al.  Application of Dimensionality Reduction in Recommender Systems , 2000 .

[16]  Sergey N. Dorogovtsev,et al.  Lectures on Complex Networks , 2010 .

[17]  Roberto Turrin,et al.  Performance of recommender algorithms on top-n recommendation tasks , 2010, RecSys '10.

[18]  Panagiotis Symeonidis,et al.  Location-Based Social Networks , 2014 .

[19]  Domonkos Tikk,et al.  Investigation of Various Matrix Factorization Methods for Large Recommender Systems , 2008, ICDM Workshops.

[20]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[21]  Fillia Makedon,et al.  Using singular value decomposition approximation for collaborative filtering , 2005, Seventh IEEE International Conference on E-Commerce Technology (CEC'05).

[22]  John F. Canny,et al.  Collaborative filtering with privacy via factor analysis , 2002, SIGIR '02.

[23]  Kenneth Y. Goldberg,et al.  Jester 2.0 (poster abstract): evaluation of an new linear time collaborative filtering algorithm , 1999, SIGIR '99.

[24]  Gene H. Golub,et al.  Matrix computations , 1983 .

[25]  P. Bak,et al.  Self-organized criticality. , 1988, Physical review. A, General physics.

[26]  Yehuda Koren,et al.  Lessons from the Netflix prize challenge , 2007, SKDD.

[27]  Tang,et al.  Self-Organized Criticality: An Explanation of 1/f Noise , 2011 .

[28]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[29]  Ahmed Eldawy,et al.  LARS: A Location-Aware Recommender System , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[30]  James Bennett,et al.  The Netflix Prize , 2007 .

[31]  Michael J. Pazzani,et al.  Learning Collaborative Information Filters , 1998, ICML.

[32]  Mao Ye,et al.  Exploiting geographical influence for collaborative point-of-interest recommendation , 2011, SIGIR.

[33]  Mao Ye,et al.  Location recommendation for location-based social networks , 2010, GIS '10.

[34]  Paul Resnick,et al.  Recommender systems , 1997, CACM.