Customer segmentation by web content mining

Abstract This article introduces a new dimension, Interpurchase Time (T), into the existing RFM (Recency, Frequency, and Monetary) model to form an expanded RFMT model for parsing consumers' online purchase sequences in a long period to implement customer segmentation. The proposed RFMT model can track and discern changes in customer purchasing behaviors during their whole shopping cycle. Firstly, a web content retrieving system was developed to fetch publicly available customer data on a retailer's website, including demographic information (gender, age, location, etc.) and product information (name, price, date, etc.) of each purchase in a period from 2008 to 2019. The RFMT values of a customer were then computed from the retrieved data and subsequently analyzed by the hierarchical clustering to derive seven homogeneous clusters with specific customer profiles. Subsequently, demographic features and product preferences were identified for each cluster with business insights that can help the retailer to improve customer relationships and to implement targeted recommendation strategies.

[1]  S. Fazli,et al.  K-Mean Clustering Method For Analysis Customer Lifetime Value With LRFM Relationship Model In Banking Services , 2012 .

[2]  I-Cheng Yeh,et al.  Knowledge discovery on RFM model using Bernoulli sequence , 2009, Expert Syst. Appl..

[3]  Doaa S. Elzanfaly,et al.  Investigation in Customer Value Segmentation Quality under Different Preprocessing Types of RFM Attributes , 2016, Int. J. Recent Contributions Eng. Sci. IT.

[4]  P. Anitha,et al.  RFM model for customer purchase behavior using K-Means algorithm , 2019, J. King Saud Univ. Comput. Inf. Sci..

[5]  Ruey-Shan Guo A multi-category inter-purchase time model based on hierarchical Bayesian theory , 2009, Expert Syst. Appl..

[6]  J. Miglautsch Thoughts on RFM scoring , 2000 .

[7]  Na Liu,et al.  Recommend products with consideration of multi-category inter-purchase time and price , 2018, Future Gener. Comput. Syst..

[8]  Hsiao-Ping Tsai,et al.  Group RFM analysis as a novel framework to discover better customer consumption behavior , 2011, Expert Syst. Appl..

[9]  Donald G. Morrison Interpurchase Time and Brand Loyalty , 1966 .

[10]  S. Somogyi,et al.  Consumer adoption of sustainable shellfish in China: Effects of psychological factors and segmentation , 2019, Journal of Cleaner Production.

[11]  F. Bass,et al.  The relationship between purchase regularity and propensity to accelerate , 2002 .

[12]  Casimir J. H. Ludwig,et al.  Bayesian and maximum likelihood estimation of hierarchical response time models , 2008, Psychonomic bulletin & review.

[13]  Shyla Afroge,et al.  RFM Based Market Segmentation Approach Using Advanced K-means and Agglomerative Clustering: A Comparative Study , 2019, 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE).

[14]  Bernard J. Jansen,et al.  Customer segmentation using online platforms: isolating behavioral and demographic segments for persona creation via aggregated user data , 2018, Social Network Analysis and Mining.

[15]  Phan Duy Hung,et al.  Customer Segmentation Using Hierarchical Agglomerative Clustering , 2019, Proceedings of the 2019 2nd International Conference on Information Science and Systems.

[16]  Brett Massimino,et al.  Accessing Online Data: Web‐Crawling and Information‐Scraping Techniques to Automate the Assembly of Research Data , 2016 .

[17]  Mohamad Abdul Kadir,et al.  Customer Segmentation on Online Retail using RFM Analysis: Big Data Case of Bukku.id , 2019, Proceedings of the International Conference on Environmental Awareness for Sustainable Development in conjunction with International Conference on Challenge and Opportunities Sustainable Environmental Development, ICEASD & ICCOSED 2019, 1-2 April 2019, Ke.

[18]  Markku Heikkila,et al.  Segmenting Retail Customers with an Enhanced RFM and a Hybrid Regression/Clustering Method , 2018, 2018 International Conference on Machine Learning and Data Engineering (iCMLDE).

[19]  M. Clarke,et al.  Segmenting the growing UK convenience store market for retail location planning , 2016 .

[20]  Cassandra C. Elrod,et al.  Empirical Study Utilizing QFD to Develop an International Marketing Strategy , 2015 .

[21]  P. Praveen,et al.  IMPROVING EFFICIENCY AND EFFECTIVENESS OF HIERARCHICAL CLUSTERING , 2018 .

[22]  Wawan Laksito Yuly Saptomo,et al.  Penerapan Agglomerative Hierarchical Clustering Untuk Segmentasi Pelanggan , 2020 .

[23]  Rudolf Scitovski,et al.  Searching for an Optimal Partition of Incomplete Data with Application in Modeling Energy Efficiency of Public Buildings , 2018, Croatian Operational Research Review.

[24]  Alok Sharma,et al.  Divisive hierarchical maximum likelihood clustering , 2017, BMC Bioinformatics.

[25]  Dwiza Riana,et al.  Clustering and profiling of customers using RFM for customer relationship management recommendations , 2017, 2017 5th International Conference on Cyber and IT Service Management (CITSM).

[26]  Sutrisno,et al.  Customer Segmentation based on RFM model and Clustering Techniques With K-Means Algorithm , 2018, 2018 Third International Conference on Informatics and Computing (ICIC).

[27]  Hui Xiong,et al.  Understanding of Internal Clustering Validation Measures , 2010, 2010 IEEE International Conference on Data Mining.

[28]  Hsin-Hung Wu,et al.  A review of the application of RFM model , 2010 .

[29]  Hsin-Hung Wu,et al.  A case study of applying LRFM model in market segmentation of a children's dental clinic , 2012, Expert Syst. Appl..

[30]  Young Soo Kim Understanding Online Consumer’s Inter-Purchase Time , 2013 .

[31]  A. Umamakeswari,et al.  RFM ranking - An effective approach to customer segmentation , 2018, J. King Saud Univ. Comput. Inf. Sci..

[32]  L. Meyer-Waarden,et al.  The influence of loyalty programme membership on customer purchase behaviour , 2008 .

[33]  Ujjwal Maulik,et al.  Performance Evaluation of Some Clustering Algorithms and Validity Indices , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Arthur Middleton Hughes,et al.  Strategic database marketing , 2005 .

[35]  Chin‐Feng Lin,et al.  Segmenting customer brand preference: demographic or psychographic , 2002 .