Analyzing existing customers' websites to improve the customer acquisition process as well as the profitability prediction in B-to-B marketing

Research highlights? Prediction of the profitability of new customers. ? Used web mining to extract latent semantic concepts from customers' websites. ? Used clustering of latent semantic concepts to identify prevalent terms. ? Used prevalent terms to identify addresses of new profitable customers. We investigate the issue of predicting new customers as profitable based on information about existing customers in a business-to-business environment. In particular, we show how latent semantic concepts from textual information of existing customers' websites can be used to uncover characteristics of websites of companies that will turn into profitable customers. Hence, the use of predictive analytics will help to identify new potential acquisition targets. Additionally, we show that a regression model based on these concepts is successful in the profitability prediction of new customers. In a case study, the acquisition process of a mail-order company is supported by creating a prioritized list of new customers generated by this approach. It is shown that the density of profitable customers in this list outperforms the density of profitable customers in traditional generated address lists (e.g. from list brokers). From a managerial point of view, this approach supports the identification of new business customers and helps to estimate the future profitability of these customers in a company. Consequently, the customer acquisition process can be targeted more effectively and efficiently. This leads to a competitive advantage for B2B companies and improves the acquisition process that is time- and cost-consuming with traditionally low conversion rates.

[1]  Yuh-Min Chen,et al.  Developing a semantic-enable information retrieval mechanism , 2010, Expert Syst. Appl..

[2]  Kristof Coussement,et al.  Integrating the voice of customers through call center emails into a decision support system for churn prediction , 2008, Inf. Manag..

[3]  Dirk Thorleuchter,et al.  Mining Social Behavior Ideas of Przewalski Horses , 2011 .

[4]  Dirk Thorleuchter,et al.  Companies website optimising concerning consumer's searching for new products , 2011, 2011 International Conference on Uncertainty Reasoning and Knowledge Engineering.

[5]  Dirk Thorleuchter,et al.  Extracting Consumers Needs for New Products - A Web Mining Approach , 2010, 2010 Third International Conference on Knowledge Discovery and Data Mining.

[6]  You-Jin Park,et al.  Individual and group behavior-based customer profile model for personalized product recommendation , 2009, Expert Syst. Appl..

[7]  Wagner A. Kamakura,et al.  Defection Detection: Measuring and Understanding the Predictive Accuracy of Customer Churn Models , 2006 .

[8]  Paul D. Allison,et al.  Logistic Regression Using the SAS System : Theory and Application , 1999 .

[9]  Koen W. De Bock,et al.  Predicting Website Audience Demographics forWeb Advertising Targeting Using Multi-Website Clickstream Data , 2010, Fundam. Informaticae.

[10]  Kun-Chang Lee,et al.  Identification of Customer Segmentation Sttrategies by Using Machine Learning-Oriented Web-mining Technique , 2003 .

[11]  Yufei Yuan,et al.  Managing business-to-business relationships throughout the e-commerce procurement life cycle , 2000, Internet Res..

[12]  Dirk Thorleuchter,et al.  Vertrauliche Verarbeitung staatlich eingestufter Information – die Informationstechnologie im Geheimschutz , 2008, Informatik-Spektrum.

[13]  Mu-Chen Chen,et al.  Mining changes in customer behavior in retail marketing , 2005, Expert Syst. Appl..

[14]  Christian Homburg,et al.  Understanding Customer Value in Business-to-Business Relationships , 2005 .

[15]  Yasuharu Ukai,et al.  Research Note: Statistical Analysis on E-mail Magazines used by Japanese Prime Ministers , 2011, Rev. Socionetwork Strateg..

[16]  Dirk Van den Poel,et al.  Data augmentation by predicting spending pleasure using commercially available external data , 2009, Journal of Intelligent Information Systems.

[17]  Dirk Thorleuchter,et al.  High granular multi-level-security model for improved usability , 2011, 2011 International Conference on System science, Engineering design and Manufacturing informatization.

[18]  P. Verhoef,et al.  CRM in Data-Rich Multichannel Retailing Environments: A Review and Future Research Directions , 2010 .

[19]  Andrew Kusiak,et al.  Mining Authoritativeness of Collaborative Innovation Partners , 2010, Int. J. Comput. Commun. Control.

[20]  R. Brodie,et al.  Understanding contemporary marketing: Development of a classification scheme , 1997 .

[21]  Kristof Coussement,et al.  Improving customer attrition prediction by integrating emotions from client/company interaction emails and evaluating multiple classifiers , 2009, Expert Syst. Appl..

[22]  Dirk Thorleuchter Finding New Technological Ideas and Inventions with Text Mining and Technique Philosophy , 2007, GfKl.

[23]  Dirk Van den Poel,et al.  Improving Purchasing Behavior Predictions by Data Augmentation with Situational Variables , 2010, Int. J. Inf. Technol. Decis. Mak..

[24]  Warren R. Greiff,et al.  A theory of term weighting based on exploratory data analysis , 1998, SIGIR '98.

[25]  Xue Li,et al.  Unified collaborative filtering model based on combination of latent features , 2010, Expert Syst. Appl..

[26]  RadhaKanta Mahapatra,et al.  Business data mining - a machine learning perspective , 2001, Inf. Manag..

[27]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[28]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[29]  Dirk Thorleuchter,et al.  Mining ideas from textual information , 2010, Expert Syst. Appl..

[30]  Dirk Thorleuchter,et al.  Mining Innovative Ideas to Support New Product Research and Development , 2010 .

[31]  E. DeLong,et al.  Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.

[32]  Dirk Thorleuchter,et al.  Semantic technology classification — A defence and security case study , 2011, 2011 International Conference on Uncertainty Reasoning and Knowledge Engineering.

[33]  Sunil Gupta,et al.  Brand Choice, Purchase Incidence, and Segmentation: An Integrated Modeling Approach , 1992 .

[34]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[35]  Dirk Van den Poel,et al.  Predicting online-purchasing behaviour , 2005, Eur. J. Oper. Res..

[36]  Dirk Thorleuchter,et al.  A compared R&D-based and patent-based cross impact analysis for identifying relationships between technologies , 2010 .

[37]  Kohei Ichikawa,et al.  Social Network Rebuilder: A Tool to Estimate a Social Network of Financial Crisis Propagation , 2011, Rev. Socionetwork Strateg..

[38]  Yan Diqun,et al.  Quantization Step Parity-based Steganography for MP3 Audio , 2009 .

[39]  Li-Hsien Lin,et al.  Rubik's cube watermark technology for grayscale images , 2010, Expert Syst. Appl..

[40]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[41]  James Allan,et al.  Automatic structuring and retrieval of large text files , 1994, CACM.

[42]  Shan Ling Pan,et al.  Using e-CRM for a unified view of the customer , 2003, CACM.

[43]  W. Reinartz,et al.  The Impact of Customer Relationship Characteristics on Profitable Lifetime Duration , 2003 .

[44]  YongSeog Kim,et al.  Toward a successful CRM: variable selection, sampling, and ensemble , 2006, Decis. Support Syst..

[45]  Tomás Bayón,et al.  The chain from customer satisfaction via word-of-mouth referrals to new customer acquisition , 2007 .

[46]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.