Application of hybrid metaheuristic with perturbation-based K-nearest neighbors algorithm and densest imputation to collaborative filtering in recommender systems

Abstract Since the rise of E-commerce companies and many other web services, the applications of recommender systems have been adopted more broadly than ever before. Although collaborative filtering is the most well-known approach which utilizes customer’s preference to discover their interest, the problems of data sparsity and similarities selection still exist in it. Thus, this study intends to propose a hybrid metaheuristic with perturbation-based K-nearest neighbors and densest imputation for collaborative filtering (KDI-KNN) algorithm to reduce the effects of data sparsity. A similarities union function is proposed to determine the fittest similarity and enhance the prediction performance. Eventually, the experimental results indicate that hybrid metaheuristics with perturbation-based KDI-KNN algorithms are superior to basic KNN, original KDI-KNN, and most single metaheuristic-based KDI-KNN. In addition, a real-world dataset, fund transaction dataset is adopted in the case study. The analysis reveals that the similarity is seriously affected by the different content of the dataset.

[1]  Russ B. Altman,et al.  Missing value estimation methods for DNA microarrays , 2001, Bioinform..

[2]  Mohd Naz'ri Mahrin,et al.  A survey on data mining techniques in recommender systems , 2019, Soft Comput..

[3]  Christian Blum,et al.  Hybrid metaheuristics in combinatorial optimization: A survey , 2011, Appl. Soft Comput..

[4]  Kiyana Zolfaghar,et al.  Estimating customer lifetime value based on RFM analysis of customer purchase behavior: Case study , 2011, WCIT.

[5]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[6]  Michel Gendreau,et al.  A guide to vehicle routing heuristics , 2002, J. Oper. Res. Soc..

[7]  Taghi M. Khoshgoftaar,et al.  Imputed Neighborhood Based Collaborative Filtering , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[8]  Fernando Ortega,et al.  DeepFair: Deep Learning for Improving Fairness in Recommender Systems , 2020, Int. J. Interact. Multim. Artif. Intell..

[9]  T. Stijnen,et al.  Review: a gentle introduction to imputation of missing values. , 2006, Journal of clinical epidemiology.

[10]  Mária Bieliková,et al.  Personalized hybrid recommendation for group of users: Top-N multimedia recommender , 2016, Inf. Process. Manag..

[11]  Seyedali Mirjalili,et al.  SCA: A Sine Cosine Algorithm for solving optimization problems , 2016, Knowl. Based Syst..

[12]  J. Reed,et al.  Simulation of biological evolution and machine learning. I. Selection of self-reproducing numeric patterns by data processing machines, effects of hereditary control, mutation type and crossing. , 1967, Journal of theoretical biology.

[13]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[14]  Hui Tian,et al.  A new user similarity model to improve the accuracy of collaborative filtering , 2014, Knowl. Based Syst..

[15]  Hyung Jun Ahn,et al.  A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem , 2008, Inf. Sci..

[16]  Martin Ester,et al.  TrustWalker: a random walk model for combining trust-based and item-based recommendation , 2009, KDD.

[17]  Mihalis Yannakakis,et al.  How easy is local search? , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[18]  Leonardo Franco,et al.  Missing data imputation using statistical and machine learning methods in a real breast cancer problem , 2010, Artif. Intell. Medicine.

[19]  John Riedl,et al.  An Algorithmic Framework for Performing Collaborative Filtering , 1999, SIGIR Forum.

[20]  Bam Bahadur Sinha,et al.  Evolution of recommender system over the time , 2019, Soft Computing.

[21]  Jesús Bobadilla,et al.  A new collaborative filtering metric that improves the behavior of recommender systems , 2010, Knowl. Based Syst..

[22]  Fernando Ortega,et al.  Improving collaborative filtering recommender system results and performance using genetic algorithms , 2011, Knowl. Based Syst..

[23]  Hisao Ishibuchi,et al.  A multi-objective genetic local search algorithm and its application to flowshop scheduling , 1998, IEEE Trans. Syst. Man Cybern. Part C.

[24]  Rui Jiang,et al.  Improving accuracy and diversity of personalized recommendation through power law adjustments of user similarities , 2013, Decis. Support Syst..