Scalable bundling via dense product embeddings

Bundling, the practice of jointly selling two or more products at a discount, is a widely used strategy in industry and a well examined concept in academia. Historically, the focus has been on theoretical studies in the context of monopolistic firms and assumed product relationships, e.g., complementarity in usage. We develop a new machine-learning-driven methodology for designing bundles in a large-scale, cross-category retail setting. We leverage historical purchases and consideration sets created from clickstream data to generate dense continuous representations of products called embeddings. We then put minimal structure on these embeddings and develop heuristics for complementarity and substitutability among products. Subsequently, we use the heuristics to create multiple bundles for each product and test their performance using a field experiment with a large retailer. We combine the results from the experiment with product embeddings using a hierarchical model that maps bundle features to their purchase likelihood, as measured by the add-to-cart rate. We find that our embeddings-based heuristics are strong predictors of bundle success, robust across product categories, and generalize well to the retailer's entire assortment.

[1]  David M. Blei,et al.  SHOPPER: A Probabilistic Model of Consumer Choice with Substitutes and Complements , 2017, The Annals of Applied Statistics.

[2]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[3]  V. Rao,et al.  A General Choice Model for Bundles with Multiple-Category Products: Application to Market Segmentation and Optimal Pricing for Bundles , 2003 .

[4]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[5]  David M. Blei,et al.  Structured Embedding Models for Grouped Data , 2017, NIPS.

[6]  Hemant K. Bhargava,et al.  Retailer-Driven Product Bundling in a Distribution Channel , 2012, Mark. Sci..

[7]  Richard Schmalensee,et al.  Commodity Bundling by Single-Product Monopolies , 1982, The Journal of Law and Economics.

[8]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9]  Tzyy-Ching Yang,et al.  Comparison of product bundling strategies on different online shopping behaviors , 2006, Electron. Commer. Res. Appl..

[10]  Vineet Kumar,et al.  The Dynamic Effects of Bundling as a Product Strategy , 2012, Mark. Sci..

[11]  Andrew Y. Ng,et al.  Transfer learning for text classification , 2005, NIPS.

[12]  John R. Hauser,et al.  Identifying Customer Needs from User-Generated Content , 2019, Mark. Sci..

[13]  Yuan-Chun Jiang,et al.  Optimizing E-tailer Profits and Customer Savings: Pricing Multistage Customized Online Bundles , 2011, Mark. Sci..

[14]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[15]  David M. Blei,et al.  Exponential Family Embeddings , 2016, NIPS.

[16]  Janet L. Yellen,et al.  Commodity Bundling and the Burden of Monopoly , 1976 .

[17]  Richard Schmalensee,et al.  Gaussian Demand and Commodity Bundling , 1984 .

[18]  G. Tellis,et al.  Strategic Bundling of Products and Prices: A New Synthesis for Marketing , 2002 .

[19]  M. Salinger A Graphical Analysis of Bundling , 1995 .

[20]  R. Venkatesh,et al.  A Probabilistic Approach to Pricing a Bundle of Products or Services , 1993 .

[21]  Georgiana Dinu,et al.  Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[22]  W. Kamakura,et al.  Optimal Bundling and Pricing Under a Monopoly: Contrasting Complements and Substitutes from Independently Valued Products , 2003 .

[23]  Arthur Lewbel,et al.  Bundling of substitutes or complements , 1985 .