论文信息 - Toward an Architecture for Never-Ending Language Learning - 字舞流文

Toward an Architecture for Never-Ending Language Learning

We consider here the problem of building a never-ending language learner; that is, an intelligent computer agent that runs forever and that each day must (1) extract, or read, information from the web to populate a growing structured knowledge base, and (2) learn to perform this task better than on the previous day. In particular, we propose an approach and a set of design principles for such an agent, describe a partial implementation of such a system that has already learned to extract a knowledge base containing over 242,000 beliefs with an estimated precision of 74% after running for 67 days, and discuss lessons learned from this preliminary attempt to build a never-ending learning agent.

Estevam R. Hruschka | Tom M. Mitchell | Burr Settles | Bryan Kisiel | Justin Betteridge | Andrew Carlson | Tom Michael Mitchell | Andrew Carlson | B. Settles | J. Betteridge | B. Kisiel | Burr Settles | Estevam Hruschka

[1] Victor R. Lesser,et al. The Hearsay-II Speech-Understanding System: Integrating Knowledge to Resolve Uncertainty , 1980, CSUR.

[2] Douglas B. Lenat,et al. EURISKO: A Program That Learns New Heuristics and Domain Concepts , 1983, Artif. Intell..

[3] Allen Newell,et al. SOAR: An Architecture for General Intelligence , 1987, Artif. Intell..

[4] Oren Etzioni,et al. PRODIGY: an integrated architecture for planning and learning , 1991, SGAR.

[5] Pat Langley,et al. A design for the ICARUS architecture , 1991, SGAR.

[6] Marti A. Hearst. Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[7] R. Mike Cameron-Jones,et al. FOIL: A Midterm Report , 1993, ECML.

[8] Sebastian Thrun,et al. Lifelong robot learning , 1993, Robotics Auton. Syst..

[9] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[10] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[11] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.

[12] Yoram Singer,et al. Unsupervised Models for Named Entity Classification , 1999, EMNLP.

[13] Ellen Riloff,et al. Learning Dictionaries for Information Extraction by Multi-Level Bootstrapping , 1999, AAAI/IAAI.

[14] Raymond J. Mooney,et al. A Mutually Beneficial Integration of Data Mining and Information Extraction , 2000, AAAI/IAAI.

[15] Roman Yangarber,et al. Counter-Training in Discovery of Semantic Patterns , 2003, ACL.

[16] Doug Downey,et al. Methods for Domain-Independent Information Extraction from the Web: An Experimental Comparison , 2004, AAAI.

[17] John R Anderson,et al. An integrated theory of the mind. , 2004, Psychological review.

[18] Doug Downey,et al. A Probabilistic Model of Redundancy in Information Extraction , 2005, IJCAI.

[19] Jeffrey P. Bigham,et al. Names and Similarities on the Web: Fact Extraction in the Fast Lane , 2006, ACL.

[20] J. Curran,et al. Minimising semantic drift with Mutual Exclusion Bootstrapping , 2007 .

[21] Oren Etzioni,et al. Strategies for lifelong knowledge extraction from the web , 2007, K-CAP '07.

[22] Ming-Wei Chang,et al. Guiding Semi-Supervision with Constraint-Driven Learning , 2007, ACL.

[23] Patrick Pantel,et al. Entity Extraction via Ensemble Semantics , 2009, EMNLP.

[24] Eric P. Xing,et al. Heterogeneous multitask learning with joint sparsity constraints , 2009, NIPS.

[25] Andrew McCallum,et al. Active Learning by Labeling Features , 2009, EMNLP.

[26] Burr Settles,et al. Active Learning Literature Survey , 2009 .

[27] William W. Cohen,et al. Character-level Analysis of Semi-Structured Documents for Set Expansion , 2009, EMNLP.

[28] Estevam R. Hruschka,et al. Coupled semi-supervised learning for information extraction , 2010, WSDM '10.