RLNF: Reinforcement Learning based Noise Filtering for Click-Through Rate Prediction

Click-through rate (CTR) prediction aims to recall the advertisements that users are interested in and to lead users to click, which is of critical importance for a variety of online advertising systems. In practice, CTR prediction is generally formulated as a conventional binary classification problem, where the clicked advertisements are positive samples and the others are negative samples. However, directly treating unclicked advertisements as negative samples would suffer from the severe label noise issue, since there exist many reasons why users are interested in a few advertisements but do not click. To address such serious issue, we propose a reinforcement learning based noise filtering approach, dubbed RLNF, which employs a noise filter to select effective negative samples. In RLNF, such selected, effective negative samples can be used to enhance the CTR prediction model, and meanwhile the effectiveness of the noise filter can be enhanced through reinforcement learning using the performance of CTR prediction model as reward. Actually, by alternating the enhancements of the noise filter and the CTR prediction model, the performance of both the noise filter and the CTR prediction model is improved. In our experiments, we equip 7 state-of-the-art CTR prediction models with RLNF. Extensive experiments on a public dataset and an industrial dataset present that RLNF significantly improves the performance of all these 7 CTR prediction models, which indicates both the effectiveness and the generality of RLNF.

[1]  Paul Covington,et al.  Deep Neural Networks for YouTube Recommendations , 2016, RecSys.

[2]  Dongbai Guo,et al.  Deep Interest with Hierarchical Attention Network for Click-Through Rate Prediction , 2020, SIGIR.

[3]  Liang Wang,et al.  Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction , 2019, CIKM.

[4]  Jaana Kekäläinen,et al.  IR evaluation methods for retrieving highly relevant documents , 2000, SIGIR '00.

[5]  Shengyu Zhang,et al.  Field-aware probabilistic embedding neural network for CTR prediction , 2018, RecSys.

[6]  Patrick P. K. Chan,et al.  Convolutional Neural Networks based Click-Through Rate Prediction with Multiple Feature Sequences , 2018, IJCAI.

[7]  Jieming Zhu,et al.  Ensembled CTR Prediction via Knowledge Distillation , 2020, CIKM.

[8]  Gang Fu,et al.  Deep & Cross Network for Ad Click Predictions , 2017, ADKDD@KDD.

[9]  Guorui Zhou,et al.  Deep Interest Network for Click-Through Rate Prediction , 2017, KDD.

[10]  Chengfu Huo,et al.  Deep Match to Rank Model for Personalized Click-Through Rate Prediction , 2020, AAAI.

[11]  Shaowei Cai,et al.  PULNS: Positive-Unlabeled Learning with Effective Negative Sample Selector , 2021, AAAI.

[12]  Yang Chen,et al.  Interpretable Click-Through Rate Prediction through Hierarchical Attention , 2020, WSDM.

[13]  Weinan Zhang,et al.  User Behavior Retrieval for Click-Through Rate Prediction , 2020, SIGIR.

[14]  Keping Yang,et al.  Deep Session Interest Network for Click-Through Rate Prediction , 2019, IJCAI.

[15]  Chang Zhou,et al.  Deep Interest Evolution Network for Click-Through Rate Prediction , 2018, AAAI.

[16]  Bo Yuan,et al.  A Dual Input-aware Factorization Machine for CTR Prediction , 2020, IJCAI.

[17]  Hongxia Jin,et al.  Disguise Adversarial Networks for Click-through Rate Prediction , 2017, IJCAI.

[18]  Lucas Theis,et al.  Addressing delayed feedback for continuous training with neural networks in CTR prediction , 2019, RecSys.

[19]  Wei Guo,et al.  Order-aware Embedding Neural Network for CTR Prediction , 2019, SIGIR.

[20]  Stefanos Zafeiriou,et al.  AutoGroup: Automatic Feature Grouping for Modelling Explicit High-Order Feature Interactions in CTR Prediction , 2020, SIGIR.

[21]  Jian Yang,et al.  Attention Convolutional Neural Network for Advertiser-level Click-through Rate Forecasting , 2018, WWW.

[22]  Roi Blanco,et al.  Click Through Rate Prediction for Local Search Results , 2017, WSDM.

[23]  Yuandong Tian,et al.  Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction , 2020, KDD.

[24]  Steffen Rendle,et al.  Factorization Machines , 2010, 2010 IEEE International Conference on Data Mining.

[25]  Heng-Tze Cheng,et al.  Wide & Deep Learning for Recommender Systems , 2016, DLRS@RecSys.

[26]  Matthew Richardson,et al.  Predicting clicks: estimating the click-through rate for new ads , 2007, WWW '07.

[27]  Jianke Zhu,et al.  Deep Time-Stream Framework for Click-Through Rate Prediction by Tracking Interest Evolution , 2020, AAAI.

[28]  Tat-Seng Chua,et al.  HoAFM: A High-order Attentive Factorization Machine for CTR Prediction , 2020, Inf. Process. Manag..

[29]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[30]  Amin Mantrach,et al.  Deep Character-Level Click-Through Rate Prediction for Sponsored Search , 2017, SIGIR.

[31]  Ying Cui,et al.  Dynamic ad layout revenue optimization for display advertising , 2012, ADKDD '12.

[32]  Bin Liu,et al.  AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction , 2020, KDD.

[33]  Ji-Rong Wen,et al.  CTRec: A Long-Short Demands Evolution Model for Continuous-Time Recommendation , 2019, SIGIR.

[34]  Fuchun Sun,et al.  Regularized Adversarial Sampling and Deep Time-aware Attention for Click-Through Rate Prediction , 2019, CIKM.

[35]  Yunming Ye,et al.  DeepFM: A Factorization-Machine based Neural Network for CTR Prediction , 2017, IJCAI.

[36]  Yi Ren,et al.  Graph Intention Network for Click-through Rate Prediction in Sponsored Search , 2019, SIGIR.

[37]  Xiuwu Zhang,et al.  Representation Learning-Assisted Click-Through Rate Prediction , 2019, IJCAI.

[38]  Enhong Chen,et al.  An Ad CTR Prediction Method Based on Feature Learning of Deep and Shallow Layers , 2017, CIKM.

[39]  Tian Zhou,et al.  DeepLight: Deep Lightweight Feature Interactions for Accelerating CTR Predictions in Ad Serving , 2020, WSDM.

[40]  Qiang Liu,et al.  TFNet: Multi-Semantic Feature Interaction for CTR Prediction , 2020, SIGIR.

[41]  Bin Liu,et al.  AutoHash: Learning Higher-Order Feature Interactions for Deep CTR Prediction , 2022, IEEE Transactions on Knowledge and Data Engineering.