论文信息 - Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

Jason Weston

Antoine Bordes

Tomas Mikolov

Sumit Chopra

Antoine Bordes

J. Weston

S. Chopra

Tomas Mikolov

Abstract:One long-term goal of machine learning research is to produce methods that are applicable to reasoning and natural language, in particular building an intelligent dialogue agent. To measure progress towards that goal, we argue for the usefulness of a set of proxy tasks that evaluate reading comprehension via question answering. Our tasks measure understanding in several ways: whether a system is able to answer questions via chaining facts, simple induction, deduction and many more. The tasks are designed to be prerequisites for any system that aims to be capable of conversing with a human. We believe many existing learning systems can currently not solve them, and hence our aim is to classify these tasks into skill sets, so that researchers can identify (and then rectify) the failings of their systems. We also extend and improve the recently introduced Memory Networks model, and show it is able to solve some, but not all, of the tasks.

参考文献

[1] Marvin Minsky,et al. Perceptrons: An Introduction to Computational Geometry , 1969 .

[2] Terry Winograd,et al. Understanding natural language , 1974 .

[3] Geoffrey E. Hinton,et al. Learning internal representations by error propagation , 1986 .

[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[5] Gunnar Rätsch,et al. Predicting Time Series with Support Vector Machines , 1997, ICANN.

[6] Hwee Tou Ng,et al. A Machine Learning Approach to Coreference Resolution of Noun Phrases , 2001, CL.

[7] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[8] Zoubin Ghahramani,et al. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[9] Nick Montfort,et al. Twisty Little Passages: An Approach to Interactive Fiction , 2003 .

[10] David Baxter,et al. On the Effective Use of Cyc in a Question Answering System , 2005 .

[11] Dan Klein,et al. Simple Coreference Resolution with Rich Syntactic and Semantic Features , 2009, EMNLP.

[12] Peter Norvig,et al. The Unreasonable Effectiveness of Data , 2009, IEEE Intelligent Systems.

[13] Jason Weston,et al. Towards Understanding Situated Natural Language , 2010, AISTATS.

[14] Heeyoung Lee,et al. A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[15] Hector J. Levesque,et al. The Winograd Schema Challenge , 2011, AAAI Spring Symposium: Logical Formalizations of Commonsense Reasoning.

[16] Raymond J. Mooney,et al. Learning to Interpret Natural Language Navigation Instructions from Observations , 2011, Proceedings of the AAAI Conference on Artificial Intelligence.

[17] Dan Klein,et al. Learning Dependency-Based Compositional Semantics , 2011, CL.

[18] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[19] James Pustejovsky,et al. TempEval-3: Evaluating Events, Time Expressions, and Temporal Relations , 2012, ArXiv.

[20] Matthew Richardson,et al. MCTest: A Challenge Dataset for the Open-Domain Machine Comprehension of Text , 2013, EMNLP.

[21] Percy Liang,et al. Lambda Dependency-Based Compositional Semantics , 2013, ArXiv.

[22] Andrew Chou,et al. Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[23] Oren Etzioni,et al. Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.

[24] Peter Clark,et al. Modeling Biological Processes for Reading Comprehension , 2014, EMNLP.

[25] Mo Yu. Factor-based Compositional Embedding Models , 2014 .

[26] Xuchen Yao,et al. Freebase QA: Information Extraction or Semantic Parsing? , 2014, ACL 2014.

[27] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[28] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[29] Oren Etzioni,et al. Open question answering over curated and extracted knowledge bases , 2014, KDD.

[30] Kam-Fai Wong,et al. Towards Neural Network-based Reasoning , 2015, ArXiv.

[31] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.

[32] Jason Weston,et al. Large-scale Simple Question Answering with Memory Networks , 2015, ArXiv.

[33] Jason Weston,et al. Memory Networks , 2014, ICLR.

[34] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[35] Jason Weston,et al. The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations , 2015, ICLR.

[36] Richard Socher,et al. Ask Me Anything: Dynamic Memory Networks for Natural Language Processing , 2015, ICML.

[37] Xinyun Chen. Under Review as a Conference Paper at Iclr 2017 Delving into Transferable Adversarial Ex- Amples and Black-box Attacks , 2016 .

引用

Abstractions for Narrative Comprehension Tasks

INT/WICED@AIIDE

2018

A zoom tracking algorithm based on deep learning

Multimedia Tools and Applications

2021

Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking

ACL

2020

Streaming MANN: A Streaming-Based Inference for Energy-Efficient Memory-Augmented Neural Networks

ArXiv

2018

WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval

SIGIR

2018

A^3: Accelerating Attention Mechanisms in Neural Networks with Approximation

2020 IEEE International Symposium on High Performance Computer Architecture (HPCA)

2020

ALICE: A Natural Language Question Answering System Using Dynamic Attention and Memory

2018

Textual Inference for Machine Comprehension

2016

CommAI: Evaluating the first steps towards a useful general AI

ICLR

2017

Question Answering Using Deep Learning

2016

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

ArXiv

2016

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

EMNLP

2021

Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking

STARSEM

2021

Generating Fine-Grained Open Vocabulary Entity Type Descriptions

ACL

2018

Using Natural Language Relations between Answer Choices for Machine Comprehension

NAACL

2019

Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

Abstractions for Narrative Comprehension Tasks

A zoom tracking algorithm based on deep learning

Zero-Shot Transfer Learning with Synthesized Data for Multi-Domain Dialogue State Tracking

Enhancing Semantic Word Representations by Embedding Deep Word Relationships

Diagnosing BERT with Retrieval Heuristics

Mapping distributional to model-theoretic semantic spaces: a baseline

Learning Word Embeddings with Domain Awareness

Few-Shot Object Recognition from Machine-Labeled Web Images

Streaming MANN: A Streaming-Based Inference for Energy-Efficient Memory-Augmented Neural Networks

WikiPassageQA: A Benchmark Collection for Research on Non-factoid Answer Passage Retrieval

A^3: Accelerating Attention Mechanisms in Neural Networks with Approximation

ALICE: A Natural Language Question Answering System Using Dynamic Attention and Memory

Textual Inference for Machine Comprehension

CommAI: Evaluating the first steps towards a useful general AI

Question Answering Using Deep Learning

Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question Answering

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking

Generating Fine-Grained Open Vocabulary Entity Type Descriptions

Using Natural Language Relations between Answer Choices for Machine Comprehension