论文信息 - A Parallel System for Textual Inference

A Parallel System for Textual Inference

| This paper presents a possible solution for the text inference problem-extracting information unstated in a text, but implied. Text inference is central to natural language applications such as information extraction and dissemination , text understanding, summarization, and translation. Our solution takes advantage of a semantic English dictionary available in electronic form that provides the basis for the development of a large linguistic knowledge base. The inference algorithm consists of a set of highly parallel search methods that when applied to the knowledge base nd contexts in which sentences are interpreted. These contexts reveal information relevant to the text. Implementation, results and parallelism analysis are discussed. T HIS paper addresses the issue of parallelism in a class of problems that is largely unexplored, yet of growing importance. Text inference refers to the problem of extracting information that is not stated directly in a text, but is implied. This may be achieved by reasoning about a text by making logical judgments on the basis of circum-stantial evidence from a large knowledge base that contains knowledge about the world. A related, but much simpler problem is information retrieval where the goal is the recognition of facts, events and properties that are explicitly stated in the text. While current information retrieval systems that process millions of sentences per minute with an accuracy close to that of humans have been built 25], the process of large scale inference has not been automated yet. The major obstacles that need to be resolved are: (1) building knowledge bases large enough to capture world knowledge, (2) nd-ing a knowledge representation scheme good for common sense reasoning, and (3) developing inference methods and control mechanisms able to provide relevant inferences at speeds comparable to humans. In this paper we present a parallel inference system that operates on a very large linguistic knowledge base. The system is scalable both in size and accuracy and is highly parallel. The novelty of this work derives from our use of an extended linguistic knowledge base for English language called WordNet, and an inference algorithm that consists S. Harabagiu is with the and reference IEEECS Log Number D96261. of a set of parallel search procedures over the linguistic semantic network (i.e. the knowledge base). WordNet is being developed at Princeton by a group led by Miller 17]. Text inference is of great importance especially today when there are many newspapers, books and other …

Sanda M. Harabagiu | Dan I. Moldovan | S. Harabagiu | D. Moldovan