Learning to Selectively Transfer: Reinforced Transfer Learning for Deep Text Matching
暂无分享,去创建一个
Feng Ji | Jun Huang | W. Bruce Croft | Haiqing Chen | Liu Yang | Minghui Qiu | Chen Qu | Zhiyu Min | Minghui Qiu | Liu Yang | Chen Qu | Jun Huang | Haiqing Chen | Feng Ji | Zhiyu Min
[1] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.
[2] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[3] Yuan Li,et al. Learning how to Active Learn: A Deep Reinforcement Learning Approach , 2017, EMNLP.
[4] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.
[5] Yash Patel,et al. Learning Sampling Policies for Domain Adaptation , 2018, ArXiv.
[6] Jian Shen,et al. Wasserstein Distance Guided Representation Learning for Domain Adaptation , 2017, AAAI.
[7] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[8] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[9] Jun Huang,et al. Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems , 2018, SIGIR.
[10] Zhen-Hua Ling,et al. Enhanced LSTM for Natural Language Inference , 2016, ACL.
[11] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[12] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.
[13] Wei Zhang,et al. R3: Reinforced Ranker-Reader for Open-Domain Question Answering , 2018, AAAI.
[14] Hal Daumé,et al. Frustratingly Easy Domain Adaptation , 2007, ACL.
[15] Mahesan Niranjan,et al. On-line Q-learning using connectionist systems , 1994 .
[16] Bernhard Schölkopf,et al. Correcting Sample Selection Bias by Unlabeled Data , 2006, NIPS.
[17] Marc Peter Deisenroth,et al. Deep Reinforcement Learning: A Brief Survey , 2017, IEEE Signal Processing Magazine.
[18] Xuanjing Huang,et al. Adversarial Multi-task Learning for Text Classification , 2017, ACL.
[19] Tao Qin,et al. Learning What Data to Learn , 2017, ArXiv.
[20] Joelle Pineau,et al. An Actor-Critic Algorithm for Sequence Prediction , 2016, ICLR.
[21] Bowen Zhou,et al. ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs , 2015, TACL.
[22] Lei Li,et al. Reinforced Co-Training , 2018, NAACL.
[23] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[24] Ruslan Salakhutdinov,et al. Transfer Learning for Sequence Tagging with Hierarchical Recurrent Networks , 2016, ICLR.
[25] Jakob Uszkoreit,et al. A Decomposable Attention Model for Natural Language Inference , 2016, EMNLP.
[26] Peter Clark,et al. SciTaiL: A Textual Entailment Dataset from Science Question Answering , 2018, AAAI.
[27] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[28] John N. Tsitsiklis,et al. Actor-Critic Algorithms , 1999, NIPS.
[29] Barbara Plank,et al. Learning to select data for transfer learning with Bayesian Optimization , 2017, EMNLP.
[30] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.
[31] R. J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.
[32] Wei Chu,et al. Modelling Domain Relationships for Transfer Learning on Retrieval-based Question Answering Systems in E-commerce , 2017, WSDM.
[33] Wei Chu,et al. AliMe Assist: An Intelligent Assistant for Creating an Innovative E-commerce Experience , 2017, CIKM.
[34] W. Bruce Croft,et al. aNMM: Ranking Short Answer Texts with Attention-Based Neural Matching Model , 2016, CIKM.
[35] W. Bruce Croft,et al. A Deep Relevance Matching Model for Ad-hoc Retrieval , 2016, CIKM.
[36] John Blitzer,et al. Co-Training for Domain Adaptation , 2011, NIPS.
[37] Li Zhao,et al. Reinforcement Learning for Relation Classification From Noisy Data , 2018, AAAI.
[38] Jianfeng Gao,et al. Deep Reinforcement Learning for Dialogue Generation , 2016, EMNLP.
[39] Samuel R. Bowman,et al. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.
[40] Dan Roth,et al. End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions , 2018, ACL.
[41] Rui Yan,et al. Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System , 2016, SIGIR.
[42] Jeffrey Pennington,et al. Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection , 2011, NIPS.
[43] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 2005, IEEE Transactions on Neural Networks.
[44] Rui Yan,et al. How Transferable are Neural Networks in NLP Applications? , 2016, EMNLP.