Probabilistic simulation of spatial demand for intelligent product allocation

Connecting consumers with relevant products is a very important problem in both online and offline commerce. In many offline retail settings, product distributors extend bids to place and manage product displays within a retail outlet. The distributor aims to choose a spatial allocation strategy that maximizes revenue given a preset budget constraint. Prior work shows that carefully selecting product locations within a store can minimize search costs and and induce consumers to make "impulse" purchases. Such impulse purchases are influenced by the spatial configuration of the store. However, learning important spatial patterns in offline retail is challenging due to the scarcity of data and the high cost of exploration and experimentation in the physical world. To address these challenges, we propose a stochastic model of spatial demand in physical retail, which we call the Probabilistic Spatial Demand Simulator (PSD-sim). PSD-sim is an effective mirror of the real environment because it exploits the structure of common retail datasets through a hierarchical parameter sharing structure, and is able to incorporate spatial and economic knowledge through informative priors. We show that PSD-sim can both recover ground truth test data better than baselines, and generate new data for unseen states. The simulator can naturally be used to train policy estimators that discover intelligent, spatial allocation strategies. Finally, we perform a preliminary study into different optimization techniques and find that Deep Q-Learning can learn an effective spatial allocation policy.

[1]  ειδικούς στόχους,et al.  (2016) , 2018 .

[2]  Geert Wets,et al.  A data mining framework for optimal product selection in retail supermarket data: the generalized PROFSET model , 2000, KDD '00.

[3]  Andrew Gelman,et al.  The No-U-turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo , 2011, J. Mach. Learn. Res..

[4]  D. Binu,et al.  An approach to products placement in supermarkets using PrefixSpan algorithm , 2013, J. King Saud Univ. Comput. Inf. Sci..

[5]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[6]  Zhenhui Li,et al.  IntelliLight: A Reinforcement Learning Approach for Intelligent Traffic Light Control , 2018, KDD.

[7]  Anton van den Hengel,et al.  Image-Based Recommendations on Styles and Substitutes , 2015, SIGIR.

[8]  Dustin Tran,et al.  Automatic Differentiation Variational Inference , 2016, J. Mach. Learn. Res..

[9]  Abhijit Gosavi,et al.  Joint Optimization of Product Price, Display Orientation and Shelf-Space Allocation in Retail Category Management , 2010 .

[10]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[11]  Jun Wang,et al.  Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning , 2019, WWW.

[12]  Jochen Wirtz,et al.  The role of store environmental stimulation and social factors on impulse purchasing , 2008 .

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Fred S. Zufryden,et al.  A Dynamic Programming Approach for Product Selection and Supermarket Shelf-Space Allocation , 1986 .

[15]  David B. Dunson,et al.  Bayesian data analysis, third edition , 2013 .

[16]  Anshul Verma,et al.  Does urge to buy impulsively differ from impulsive buying behaviour? Assessing the impact of situational factors , 2015 .

[17]  Sergey Levine,et al.  Model-Based Reinforcement Learning for Atari , 2019, ICLR.

[18]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[19]  Nan Xu,et al.  CoLight: Learning Network-level Cooperation for Traffic Signal Control , 2019, CIKM.

[20]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[21]  مسعود رسول آبادی,et al.  2011 , 2012, The Winning Cars of the Indianapolis 500.

[22]  Dorota Kurowicka,et al.  Generating random correlation matrices based on vines and extended onion method , 2009, J. Multivar. Anal..

[23]  Pascal Frossard,et al.  Learning Laplacian Matrix in Smooth Graph Signal Representations , 2014, IEEE Transactions on Signal Processing.

[24]  P. Farris,et al.  A Model for Determining Retail Product Category Assortment and Shelf Space Allocation , 1994 .

[25]  R. Quentin Grafton,et al.  truncated normal distribution , 2012 .