论文信息 - Learning to Ask Medical Questions using Reinforcement Learning

Learning to Ask Medical Questions using Reinforcement Learning

We propose a novel reinforcement learning-based approach for adaptive and iterative feature selection. Given a masked vector of input features, a reinforcement learning agent iteratively selects certain features to be unmasked, and uses them to predict an outcome when it is sufficiently confident. The algorithm makes use of a novel environment setting, corresponding to a non-stationary Markov Decision Process. A key component of our approach is a guesser network, trained to predict the outcome from the selected features and parametrizing the reward function. Applying our method to a national survey dataset, we show that it not only outperforms strong baselines when requiring the prediction to be made based on a small number of input features, but is also highly more interpretable. Our code is publicly available at \url{this https URL}.

[1] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[2] Debaditya Roy,et al. Feature selection using Deep Neural Networks , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[3] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[4] Ofir Lindenbaum,et al. Deep supervised feature selection using Stochastic Gates , 2018, ArXiv.

[5] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[6] Tianqi Chen,et al. XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[7] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[8] Qinghua Hu,et al. Heterogeneous Feature Selection With Multi-Modal Deep Neural Networks and Sparse Group LASSO , 2015, IEEE Transactions on Multimedia.

[9] Huang Hu,et al. Playing 20 Question Game with Policy-Based Reinforcement Learning , 2018, EMNLP.

[10] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[11] Scott M. Lundberg,et al. Consistent Individualized Feature Attribution for Tree Ensembles , 2018, ArXiv.

[12] Yue Wang,et al. Learning-to-Ask: Knowledge Acquisition via 20 Questions , 2018, KDD.

[13] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[14] Ofir Lindenbaum,et al. Deep supervised feature selection using Stochastic Gates , 2018, ICML.

[15] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[16] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[17] Satinder Singh,et al. On Learning Intrinsic Rewards for Policy Gradient Methods , 2018, NeurIPS.

[18] Maxine Eskénazi,et al. Towards End-to-End Learning for Dialog State Tracking and Management using Deep Reinforcement Learning , 2016, SIGDIAL Conference.

[19] James Zou,et al. Concrete Autoencoders for Differentiable Feature Selection and Reconstruction , 2019, ArXiv.

[20] Max Welling,et al. Learning Sparse Neural Networks through L0 Regularization , 2017, ICLR.

[21] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .

[22] Kewei Cheng,et al. Feature Selection , 2016, ACM Comput. Surv..

[23] Wyeth W. Wasserman,et al. Deep Feature Selection: Theory and Application to Identify Enhancers and Promoters , 2015, RECOMB.