暂无分享,去创建一个
Jianfeng Gao | Xiujun Li | Li Deng | Lihong Li | Xiaodong He | Jianshu Chen | Ji He | Lihong Li | Xiujun Li | Jianfeng Gao | Xiaodong He | Jianshu Chen | L. Deng | Ji He
[1] G. G. Stokes. "J." , 1890, The New Yale Book of Quotations.
[2] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[3] Gerald Tesauro,et al. Temporal Difference Learning and TD-Gammon , 1995, J. Int. Comput. Games Assoc..
[4] Gerald Tesauro,et al. Temporal difference learning and TD-Gammon , 1995, CACM.
[5] Andrew McCallum,et al. Reinforcement learning with selective perception and hidden state , 1996 .
[6] F. Dwyer. Customer lifetime valuation to support marketing decision making , 1997 .
[7] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.
[8] Leslie Pack Kaelbling,et al. Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..
[9] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.
[10] Richard S. Sutton,et al. Predictive Representations of State , 2001, NIPS.
[11] Naoki Abe,et al. Sequential cost-sensitive decision making with reinforcement learning , 2002, KDD.
[12] A. ADoefaa,et al. ? ? ? ? f ? ? ? ? ? , 2003 .
[13] Joelle Pineau,et al. Point-based value iteration: An anytime algorithm for POMDPs , 2003, IJCAI.
[14] Michail G. Lagoudakis,et al. Least-Squares Policy Iteration , 2003, J. Mach. Learn. Res..
[15] Michael J. A. Berry,et al. Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management , 2004 .
[16] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[17] Steve J. Young,et al. Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..
[18] Oded Netzer,et al. A Hidden Markov Model of Customer Relationship Dynamics , 2008, Mark. Sci..
[19] Byron Boots,et al. Closing the learning-planning loop with predictive state representations , 2011, Int. J. Robotics Res..
[20] David Silver,et al. Concurrent Reinforcement Learning from Customer Interactions , 2013, ICML.
[21] Geoffrey Zweig,et al. Recent advances in deep learning for speech research at Microsoft , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.
[22] Dong Yu,et al. Deep Learning: Methods and Applications , 2014, Found. Trends Signal Process..
[23] Philip S. Thomas,et al. Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees , 2015, IJCAI.
[24] Vukosi Marivate,et al. Improved empirical methods in reinforcement-learning evaluation , 2015 .
[25] Yegor Tkachenko,et al. Autonomous CRM Control via CLV Approximation with Deep Reinforcement Learning in Discrete and Continuous Action Space , 2015, ArXiv.
[26] Regina Barzilay,et al. Language Understanding for Text-based Games using Deep Reinforcement Learning , 2015, EMNLP.
[27] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[28] Honglak Lee,et al. Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.
[29] Peter Stone,et al. Deep Recurrent Q-Learning for Partially Observable MDPs , 2015, AAAI Fall Symposia.
[30] Moshe Dor,et al. אבן, and: Stone , 2017 .
[31] Xinyun Chen. Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .
[32] Julia Eichmann. Customer Relationship Management Concept Strategy And Tools , 2016 .