论文信息 - Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

Rethinking Cooperative Rationalization: Introspective Extraction and Complement Control

Selective rationalization has become a common mechanism to ensure that predictive models reveal how they use any available features. The selection may be soft or hard, and identifies a subset of input features relevant for prediction. The setup can be viewed as a co-operate game between the selector (aka rationale generator) and the predictor making use of only the selected features. The co-operative setting may, however, be compromised for two reasons. First, the generator typically has no direct access to the outcome it aims to justify, resulting in poor performance. Second, there’s typically no control exerted on the information left outside the selection. We revise the overall co-operative framework to address these challenges. We introduce an introspective model which explicitly predicts and incorporates the outcome into the selection process. Moreover, we explicitly control the rationale complement via an adversary so as not to leave any useful information out of the selection. We show that the two complementary mechanisms maintain both high predictive accuracy and lead to comprehensive rationales.

[1] Christine D. Piatko,et al. Using “Annotator Rationales” to Improve Machine Learning for Text Categorization , 2007, NAACL.

[2] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[3] Xinlei Chen,et al. Visualizing and Understanding Neural Models in NLP , 2015, NAACL.

[4] Ankur Taly,et al. Axiomatic Attribution for Deep Networks , 2017, ICML.

[5] Tommi S. Jaakkola,et al. Learning Corresponded Rationales for Text Matching , 2018 .

[6] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[7] Bart De Schutter,et al. Multi-Agent Reinforcement Learning: A Survey , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.

[8] Ralph Grishman,et al. Combining Neural Networks and Log-linear Models to Improve Relation Extraction , 2015, ArXiv.

[9] Carlos Guestrin,et al. "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[10] Le Song,et al. L-Shapley and C-Shapley: Efficient Model Interpretation for Structured Data , 2018, ICLR.

[11] Regina Barzilay,et al. Rationalizing Neural Predictions , 2016, EMNLP.

[12] Dan Klein,et al. Learning to Compose Neural Networks for Question Answering , 2016, NAACL.

[13] Tommi S. Jaakkola,et al. Game-Theoretic Interpretability for Temporal Modeling , 2018, ArXiv.

[14] Regina Barzilay,et al. Aspect-augmented Adversarial Networks for Domain Adaptation , 2017, TACL.

[15] Ronald J. Williams,et al. Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[16] Demis Hassabis,et al. Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.

[17] Fei Liu,et al. Guiding Extractive Summarization with Question-Answering Rewards , 2019, NAACL.

[18] Dan Klein,et al. Neural Module Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Li Fei-Fei,et al. Inferring and Executing Programs for Visual Reasoning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).