Movie Recommendation System Employing Latent Graph Features in Extremely Randomized Trees

E-commerce development has made recommender systems compulsory to prune large amount of data so that users can be directed toward the items that are best suitable to them. Mainly, recommender systems are classified into 1) Content Based approach, 2) Collaborative Filtering approach and 3) Hybrid approaches; which in general relies on various features of users and items requiring a detailed and organized information about them. In many situations, it is next to impossible to gather such a massive data in systematic form. Therefore, our main aim in this paper is to implement an efficient recommendation system which requires only three attributes: User-Id, Item-Id and Rating given by User to Item. We propose a completely different approach, which generates latent graph features from User-Item rating links and uses them for recommendation. We apply it for predicting Movie Ratings, using just User-ids, Movie-ids, rating and not taking in consideration factors such as actors, genres, director, etc. In the experimental section, numerical results are provided for the proposed algorithm. MovieLens dataset is used for the experimental purpose. Results show that it outperforms existing techniques in current literature.

[1]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[2]  Vahab S. Mirrokni,et al.  Local Computation of PageRank Contributions , 2007, Internet Math..

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Huy Nguyen,et al.  A Modified Regularized Non-Negative Matrix Factorization for MovieLens , 2012, 2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future.

[5]  Tsvi Kuflik,et al.  Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems (HetRec 2011) : 27th October 2011, Chicago, IL, USA , 2011 .

[6]  Tsvi Kuflik,et al.  Workshop on information heterogeneity and fusion in recommender systems (HetRec 2010) , 2010, RecSys '10.

[7]  Matthieu Latapy,et al.  Basic notions for the analysis of large two-mode networks , 2008, Soc. Networks.

[8]  Byeong Man Kim,et al.  Constructing User Profiles for Collaborative Recommender System , 2004, APWeb.

[9]  Vahab S. Mirrokni,et al.  Local Computation of PageRank Contributions , 2007, WAW.

[10]  David F. Gleich,et al.  Algorithms and Models for the Web Graph , 2014, Lecture Notes in Computer Science.

[11]  Tsvi Kuflik,et al.  Second workshop on information heterogeneity and fusion in recommender systems (HetRec2011) , 2011, RecSys '11.

[12]  Z. Di,et al.  Clustering coefficient and community structure of bipartite networks , 2007, 0710.0117.