Robust Factorization Machines for User Response Prediction

Factorization machines (FMs) are a state-of-the-art model class for user response prediction in the computational advertising domain. Rapid growth of internet and mobile device usage has given rise to multiple customer touchpoints. This coupled with factors like high cookie churn rate results in a fragmented view of user activity at the advertiser»s end. Current literature assumes procured user signals as the absolute truth, which is contested by the absence of deterministic identity linkage across a user's multiple avatars. In this work, we characterize the data uncertainty using Robust Optimization (RO) paradigm to design approaches that are immune against perturbations. We propose two novel algorithms: robust factorization machine (RFM) and its field-aware variant (RFFM), under interval uncertainty. These formulations are generic and can find applicability in any classification setting under noise. We provide a distributed and scalable Spark implementation using parallel stochastic gradient descent. In the experiments conducted on three real-world datasets, the robust counterparts outperform the baselines significantly under perturbed settings. Our experimental findings reveal interesting connections between choice of uncertainty set and the noise-proofness of resulting models.

[1]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[2]  M. de Rijke,et al.  A Neural Click Model for Web Search , 2016, WWW.

[3]  Vyacheslav Tuzlukov,et al.  Signal Processing Noise , 2002 .

[4]  Naeini Mahdi Pakdaman,et al.  Binary Classifier Calibration Using an Ensemble of Near Isotonic Regression Models , 2016 .

[5]  Reid A. Johnson,et al.  Calibrating Probability with Undersampling for Unbalanced Classification , 2015, 2015 IEEE Symposium Series on Computational Intelligence.

[6]  Olivier Chapelle,et al.  Field-aware Factorization Machines in a Real-world Online Advertising System , 2017, WWW.

[7]  Sergei Vassilvitskii,et al.  Handling forecast errors while bidding for display advertising , 2012, WWW.

[8]  Jun Wang,et al.  Display Advertising with Real-Time Bidding (RTB) and Behavioural Targeting , 2016, Found. Trends Inf. Retr..

[9]  Mahdi Pakdaman Naeini,et al.  Binary Classifier Calibration Using an Ensemble of Near Isotonic Regression Models , 2015, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[10]  Le Thi Hoai An,et al.  Robust Feature Selection for SVMs under Uncertain Data , 2013, ICDM.

[11]  Constantine Caramanis,et al.  Theory and Applications of Robust Optimization , 2010, SIAM Rev..

[12]  Martin Wattenberg,et al.  Ad click prediction: a view from the trenches , 2013, KDD.

[13]  Lise Getoor,et al.  Probabilistic Visitor Stitching on Cross-Device Web Logs , 2017, WWW.

[14]  Reynold Xin,et al.  Apache Spark , 2016 .

[15]  Jun Wang,et al.  Product-Based Neural Networks for User Response Prediction , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[16]  Gideon S. Mann,et al.  Distributed Training Strategies for the Structured Perceptron , 2010, NAACL.

[17]  Rómer Rosales,et al.  Simple and Scalable Response Prediction for Display Advertising , 2014, ACM Trans. Intell. Syst. Technol..

[18]  Qian Zhao,et al.  GB-CENT: Gradient Boosted Categorical Embedding and Numerical Trees , 2017, WWW.

[19]  Silvio Lattanzi,et al.  Linking Users Across Domains with Location Data: Theory and Validation , 2016, WWW.

[20]  Enhong Chen,et al.  Sparse Factorization Machines for Click-through Rate Prediction , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[21]  L. Ghaoui,et al.  Robust Classification with Interval Data , 2003 .

[22]  Joaquin Quiñonero Candela,et al.  Practical Lessons from Predicting Clicks on Ads at Facebook , 2014, ADKDD'14.

[23]  Alexander J. Smola,et al.  Parallelized Stochastic Gradient Descent , 2010, NIPS.

[24]  Chih-Jen Lin,et al.  Field-aware Factorization Machines for CTR Prediction , 2016, RecSys.

[25]  Dick den Hertog,et al.  A practical guide to robust optimization , 2015, 1501.02634.