Impression Allocation for Combating Fraud in E-commerce Via Deep Reinforcement Learning with Action Norm Penalty
暂无分享,去创建一个
Bo An | Zhao Li | Yifan Yang | Haifeng Lu | Mengchen Zhao | Chen Chu | Bo An | Zhao Li | Mengchen Zhao | Chen Chu | Yifan Yang | Haifeng Lu
[1] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[2] R. Bucklin,et al. Modeling Purchase Behavior at an E-Commerce Web Site: A Task-Completion Approach , 2004 .
[3] Hyun Ah Song,et al. FRAUDAR: Bounding Graph Fraud in the Face of Camouflage , 2016, KDD.
[4] Bo An,et al. Data Poisoning Attacks on Multi-Task Relationship Learning , 2018, AAAI.
[5] Dirk Van den Poel,et al. Predicting online-purchasing behaviour , 2005, Eur. J. Oper. Res..
[6] Ee-Peng Lim,et al. Detecting product review spammers using rating behaviors , 2010, CIKM.
[7] Jie Zhang,et al. Towards Collusive Fraud Detection in Online Reviews , 2015, 2015 IEEE International Conference on Data Mining.
[8] Sebastian Scherer,et al. Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution , 2017, ICML.
[9] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[10] Chih-Jen Lin,et al. Field-aware Factorization Machines for CTR Prediction , 2016, RecSys.
[11] Yan Hong,et al. Reinforcement Mechanism Design, with Applications to Dynamic Pricing in Sponsored Search Auctions , 2017, ArXiv.
[12] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[13] Zhao Li,et al. Fraud Transaction Recognition: A Money Flow Network Approach , 2015, CIKM.
[14] Pingzhong Tang,et al. Reinforcement mechanism design , 2017, IJCAI.
[15] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[16] Yiwei Zhang,et al. Reinforcement Mechanism Design for Fraudulent Behaviour in e-Commerce , 2018, AAAI.
[17] Yiwei Zhang,et al. Reinforcement Mechanism Design for e-commerce , 2017, WWW.
[18] Guy Lever,et al. Deterministic Policy Gradient Algorithms , 2014, ICML.
[19] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[20] Bo An,et al. Efficient Label Contamination Attacks Against Black-Box Learning Models , 2017, IJCAI.
[21] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[22] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..