相关论文

Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

Abstract:One long-term goal of machine learning research is to produce methods that are applicable to reasoning and natural language, in particular building an intelligent dialogue agent. To measure progress towards that goal, we argue for the usefulness of a set of proxy tasks that evaluate reading comprehension via question answering. Our tasks measure understanding in several ways: whether a system is able to answer questions via chaining facts, simple induction, deduction and many more. The tasks are designed to be prerequisites for any system that aims to be capable of conversing with a human. We believe many existing learning systems can currently not solve them, and hence our aim is to classify these tasks into skill sets, so that researchers can identify (and then rectify) the failings of their systems. We also extend and improve the recently introduced Memory Networks model, and show it is able to solve some, but not all, of the tasks.

参考文献

[1]  Marvin Minsky,et al.  Perceptrons: An Introduction to Computational Geometry , 1969 .

[2]  Terry Winograd,et al.  Understanding natural language , 1974 .

[3]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[4]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[5]  Gunnar Rätsch,et al.  Predicting Time Series with Support Vector Machines , 1997, ICANN.

[6]  Hwee Tou Ng,et al.  A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[7]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[8]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[9]  Nick Montfort,et al.  Twisty Little Passages: An Approach to Interactive Fiction , 2003 .

[10]  David Baxter,et al.  On the Effective Use of Cyc in a Question Answering System , 2005 .

[11]  Dan Klein,et al.  Simple Coreference Resolution with Rich Syntactic and Semantic Features , 2009, EMNLP.

[12]  Peter Norvig,et al.  The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.

[13]  Jason Weston,et al.  Towards Understanding Situated Natural Language , 2010, AISTATS.

[14]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[15]  Hector J. Levesque,et al.  The Winograd Schema Challenge , 2011, AAAI Spring Symposium: Logical Formalizations of Commonsense Reasoning.

[16]  Raymond J. Mooney,et al.  Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.

[17]  Dan Klein,et al.  Learning Dependency-Based Compositional Semantics , 2011, CL.

[18]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[19]  James Pustejovsky,et al.  TempEval-3: Evaluating Events, Time Expressions, and Temporal Relations , 2012, ArXiv.

[20]  Matthew Richardson,et al.  MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text , 2013, EMNLP.

[21]  Percy Liang,et al.  Lambda Dependency-Based Compositional Semantics , 2013, ArXiv.

[22]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[23]  Oren Etzioni,et al.  Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.

[24]  Peter Clark,et al.  Modeling Biological Processes for Reading Comprehension , 2014, EMNLP.

[25]  Mo Yu Factor-based Compositional Embedding Models , 2014 .

[26]  Xuchen Yao,et al.  Freebase QA: Information Extraction or Semantic Parsing? , 2014, ACL 2014.

[27]  Alex Graves,et al.  Neural Turing Machines , 2014, ArXiv.

[28]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[29]  Oren Etzioni,et al.  Open question answering over curated and extracted knowledge bases , 2014, KDD.

[30]  Kam-Fai Wong,et al.  Towards Neural Network-based Reasoning , 2015, ArXiv.

[31]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[32]  Jason Weston,et al.  Large-scale Simple Question Answering with Memory Networks , 2015, ArXiv.

[33]  Jason Weston,et al.  Memory Networks , 2014, ICLR.

[34]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[35]  Jason Weston,et al.  The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations , 2015, ICLR.

[36]  Richard Socher,et al.  Ask Me Anything: Dynamic Memory Networks for Natural Language Processing , 2015, ICML.

[37]  Xinyun Chen Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .

引用
Abstractions for Narrative Comprehension Tasks
INT/WICED@AIIDE
2018
A zoom tracking algorithm based on deep learning
Multimedia Tools and Applications
2021
Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking
ACL
2020
Enhancing Semantic Word Representations by Embedding Deep Word Relationships
ICCAE 2019
2019
Diagnosing BERT with Retrieval Heuristics
ECIR
2020
Mapping distributional to model-theoretic semantic spaces: a baseline
ArXiv
2016
Learning Word Embeddings with Domain Awareness
ArXiv
2019
Few-Shot Object Recognition from Machine-Labeled Web Images
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
2016
Streaming MANN: A Streaming-Based Inference for Energy-Efficient Memory-Augmented Neural Networks
ArXiv
2018
WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval
SIGIR
2018
A^3: Accelerating Attention Mechanisms in Neural Networks with Approximation
2020 IEEE International Symposium on High Performance Computer Architecture (HPCA)
2020
ALICE: A Natural Language Question Answering System Using Dynamic Attention and Memory
2018
Textual Inference for Machine Comprehension
2016
CommAI: Evaluating the first steps towards a useful general AI
ICLR
2017
Question Answering Using Deep Learning
2016
Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering
ArXiv
2016
RuleBERT: Teaching Soft Rules to Pre-Trained Language Models
EMNLP
2021
Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking
STARSEM
2021
Generating Fine-Grained Open Vocabulary Entity Type Descriptions
ACL
2018
Using Natural Language Relations between Answer Choices for Machine Comprehension
NAACL
2019