User Value Identification Based on Improved RFM Model and K -Means++ Algorithm for Complex Data Analysis

In recent years, with the development of machine learning and big data technology, user data has become an important element in the production process of enterprises. For today’s e-commerce platforms, the deep mining of user’s purchase behavior is helpful to understand user’s purchase preferences and accurately recommend products that meet user expectations, which can not only improve user satisfaction but also reduce platform marketing cost. To accurately identify the user value of online purchasing on an e-commerce platform, this paper uses an improved RFM model to extract user features and uses the K -means++ clustering algorithm to realize user classification. The indicators of the traditional RFM model characterize user features from three angles: recent purchase time ( R ), purchase frequency ( F ), and total consumption amount ( M ). The user group and scenarios studied in this paper are different from the previous literature: (1) the user group is relatively fixed, (2) the consumer goods are relatively single, and (3) the characteristics of repeated purchase are obvious. Therefore, based on the existing literature, this paper extracts the user characteristics studied and improves and models the traditional indicators. Based on the real purchasing data from September to December 2018, it calculates the indicators that improved RFM, empowers the weight to indicators, and finally classifies the value of users by using the K -means++ algorithm. The experimental results show that the user classification based on the improved RFM model is more accurate than the user classification based on the traditional RFM model, and the improved RFM model can identify the user value more accurately, which provides a strong support for the e-commerce platform to realize the accurate marketing strategy based on big data.

[1]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[2]  Douglas Steinley,et al.  K-means clustering: a half-century synthesis. , 2006, The British journal of mathematical and statistical psychology.

[3]  Raúl Parada,et al.  Anomaly Detection Using RFID-Based Information Management in an IoT Context , 2018, J. Organ. End User Comput..

[4]  Sang-Bing Tsai,et al.  An Empirical Study on Customer Segmentation by Purchase Behaviors Using a RFM Model and K-Means Algorithm , 2020, Mathematical Problems in Engineering.

[5]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[6]  Mohd Sharifuddin Ahmad,et al.  Factors that Influence the Acceptance of Internet of Things Services by Customers of Telecommunication Companies in Jordan , 2018, J. Organ. End User Comput..

[7]  Farshid Abdi,et al.  Hybrid soft computing approach based on clustering, rule mining, and decision tree analysis for customer segmentation problem: Real case of customer-centric industries , 2018, Appl. Soft Comput..

[8]  Peter Kolarovszki,et al.  The New Model of Customer Segmentation in Postal Enterprises , 2016 .

[9]  Katerina Pramatari,et al.  Retail business analytics: Customer visit segmentation using market basket data , 2018, Expert Syst. Appl..

[10]  Wen-Yu Chiang,et al.  To mine association rules of customer values via a data mining procedure with improved model: An empirical case study , 2011, Expert Syst. Appl..

[11]  Meina Song,et al.  Statistics-based CRM approach via time series segmenting RFM on large scale data , 2017, Knowl. Based Syst..

[12]  Keun Ho Ryu,et al.  Implementation of personalized recommendation system using k-means clustering of item category based on RFM , 2012, 2012 IEEE International Conference on Management of Innovation & Technology (ICMIT).

[13]  Paul Wang,et al.  Strategic database marketing , 1994 .

[14]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[15]  Kun Guo,et al.  Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining , 2012 .

[16]  Fouzi Lezzar,et al.  Smart Agricultural Enterprise System Based on Integration of Internet of Things and Agent Technology , 2018, J. Organ. End User Comput..

[17]  K. G. Srinivasa,et al.  Data Analytics Assisted Internet of Things Towards Building Intelligent Healthcare Monitoring Systems: IoT for Healthcare , 2018, J. Organ. End User Comput..

[18]  Niraj Kumar,et al.  Benefit-based consumer segmentation and performance evaluation of clustering approaches: An evidence of data-driven decision-making , 2018, Expert Syst. Appl..

[19]  M. J. Tarokh,et al.  Customer Profitability Segmentation for SMEs Case Study: Network Equipment Company , 2013 .

[20]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[21]  Fernando Bins Luce,et al.  Predicting customer value per product: From RFM to RFM/P , 2019 .

[22]  K. G. Srinivasa,et al.  Smart Vehicular System based on the Internet of Things , 2018, J. Organ. End User Comput..

[23]  Hui-Chu Chang Developing EL-RFM model for quantification learner's learning behavior in distance learning , 2010, 2010 2nd International Conference on Education Technology and Computer.

[24]  You-Shyang Chen,et al.  Classifying the segmentation of customer value via RFM model and RS theory , 2009, Expert Syst. Appl..

[25]  Saroj K. Biswas,et al.  A Hybrid Case Based Reasoning Model for Classification in Internet of Things (IoT) Environment , 2018, J. Organ. End User Comput..