Personal Information Prediction Based on Movie Rating Data

Movies are a major form of entertainment in the US. There are a dozens of websites focusing on movie information. On most of the websites, ratings and reviews from the users play an important role. When a user gives a movie a certain score, the user not only reflects his taste toward that movie but also potentially exposes his personal information. In this paper, we investigated several movie genres. In each genre, movies were classified into different clusters by using expectationmaximization (EM) algorithm. The classification criteria were built upon audience movie rating scores and existing user information. As a result, a new or anonymous users personal information could be predicted when he rated movies on movie-related websites. Moreover, newly released movies could be easily classified into corresponding clusters to assistant user information discovery. The revealed personal information was very useful and could be utilized in different ways such as increasing the accuracy for delivering user-related ads.

[1]  Alejandro Bellogín,et al.  Discovering Relevant Preferences in a Personalised Recommender System using Machine Learning Techniques , 2008 .

[2]  Jianyong Wang,et al.  Personalizing Web Page Recommendation via Collaborative Filtering and Topic-Aware Markov Model , 2010, 2010 IEEE International Conference on Data Mining.

[3]  Kyoung-jae Kim,et al.  A recommender system using GA K-means clustering in an online shopping market , 2008, Expert Syst. Appl..

[4]  Yee Whye Teh,et al.  Variational Bayesian Approach to Movie Rating Prediction , 2007, KDD 2007.

[5]  Adam Prügel-Bennett,et al.  An Improved Switching Hybrid Recommender System Using Naive Bayes Classifier and Collaborative Filtering , 2010 .

[6]  Yingshu Li,et al.  Collective Data-Sanitization for Preventing Sensitive Information Inference Attacks in Social Networks , 2018, IEEE Transactions on Dependable and Secure Computing.

[7]  Zhipeng Cai,et al.  FakeMask: A Novel Privacy Preserving Approach for Smartphones , 2016, IEEE Transactions on Network and Service Management.

[8]  Sheetal Vij,et al.  A Hybrid Web Recommendation System based on the Improved Association Rule Mining Algorithm , 2013, ArXiv.

[9]  Aysenur Birturk,et al.  A Tag-Based Hybrid Music Recommendation System Using Semantic Relations and Multi-domain Information , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[10]  Yang Gao,et al.  An incentive mechanism with privacy protection in mobile crowdsourcing systems , 2016, Comput. Networks.

[11]  Yingshu Li,et al.  An energy efficient privacy-preserving content sharing scheme in mobile social networks , 2016, Personal and Ubiquitous Computing.

[12]  Xiongcai Cai,et al.  Collaborative Filtering for People to People Recommendation in Social Networks , 2010, Australasian Conference on Artificial Intelligence.