论文信息 - Nonparametric Pricing Analytics with Customer Covariates

Nonparametric Pricing Analytics with Customer Covariates

Personalized pricing analytics is becoming an essential tool in retailing. Upon observing the personalized information of each arriving customer, the firm needs to set a price accordingly based on the covariates such as income, education background, past purchasing history to extract more revenue. For new entrants of the business, the lack of historical data may severely limit the power and profitability of personalized pricing. We propose a nonparametric pricing policy to simultaneously learn the preference of customers based on the covariates and maximize the expected revenue over a finite horizon. The policy does not depend on any prior assumptions on how the personalized information affects consumers' preferences (such as linear models). It is adaptively splits the covariate space into smaller bins (hyper-rectangles) and clusters customers based on their covariates and preferences, offering similar prices for customers who belong to the same cluster trading off granularity and accuracy. We show that the algorithm achieves a regret of order $O(\log(T)^2 T^{(2+d)/(4+d)})$, where $T$ is the length of the horizon and $d$ is the dimension of the covariate. It improves the current regret in the literature \citep{slivkins2014contextual}, under mild technical conditions in the pricing context (smoothness and local concavity). We also prove that no policy can achieve a regret less than $O(T^{(2+d)/(4+d)})$ for a particular instance and thus demonstrate the near optimality of the proposed policy.

Guillermo Gallego | Ningyuan Chen

[1] Benjamin Van Roy,et al. Dynamic Pricing with a Prior on Market Response , 2010, Oper. Res..

[2] A. Zeevi,et al. Woodroofe's One-Armed Bandit Problem Revisited , 2009, 0909.0119.

[3] Eli Upfal,et al. Multi-Armed Bandits in Metric Spaces ∗ , 2008 .

[4] Mohsen Bayati,et al. Online Decision-Making with High-Dimensional Covariates , 2015 .

[5] Csaba Szepesvári,et al. –armed Bandits , 2022 .

[6] Wang Chi Cheung,et al. Dynamic Pricing and Demand Learning with Limited Price Experimentation , 2017 .

[7] Peter Auer,et al. Improved Rates for the Stochastic Continuum-Armed Bandit Problem , 2007, COLT.

[8] Bert Zwart,et al. Simultaneously Learning and Optimizing Using Controlled Variance Pricing , 2014, Manag. Sci..

[9] David Simchi-Levi,et al. Dynamic Learning and Price Optimization with Endogeneity Effect , 2016 .

[10] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.

[11] Near-Optimal Bisection Search for Nonparametric Dynamic Pricing with Inventory Constraint , 2014 .