TextNet - A text-based intelligent system

A large collection of texts may be reached through the Internet and this provides a powerful platform from which common-sense knowledge may be gathered. This paper presents a system that contains a core knowledge base structured around WordNet, a lexical database, capable of extracting contextual information from a given input text. Such context information is then used to retrieve other texts from the Internet that relate to that context. When processed by the system, these new texts bring more information that represents an enhanced domain context for the initial text. This is an incremental method for text processing that acquires domain knowledge from other texts. The paper describes the system architecture, its core knowledge base and inference engine, and the acquisition of new knowledge from corpora.

[1]  Sanda M. Harabagiu An Application of WordNet to Prepositional Attachment , 1996, ACL.

[2]  Richard Fikes,et al.  Information Brokers: Gathering Information from Heterogeneous Information Sources , 1998 .

[3]  Uri Zernik,et al.  Lexical acquisition: Exploiting on-line resources to build a lexicon. , 1991 .

[4]  Sanda M. Harabagiu,et al.  A parallel algorithm for text inference , 1996, Proceedings of International Conference on Parallel Processing.

[5]  M HarabagiuSanda,et al.  TextNet A text-based intelligent system , 1997 .

[6]  S.M. Harabagiu,et al.  PARIS: a parallel inference system , 1996, Proceedings Eighth IEEE International Conference on Tools with Artificial Intelligence.

[7]  Sung-Hyon Myaeng,et al.  DR-LINK: A System Update for TREC-2 , 1993, TREC.

[8]  Terry Winograd,et al.  Understanding natural language , 1974 .

[9]  Hwee Tou Ng,et al.  On the Role of Coherence in Abductive Explanation , 1990, AAAI.

[10]  Oren Etzioni,et al.  Multi-Service Search and Comparison Using the MetaCrawler , 1995 .

[11]  Dan I. Moldovan,et al.  SNAP: A Market-Propagation Architecture for Knowledge Processing , 1992, IEEE Trans. Parallel Distributed Syst..

[12]  Divesh Srivastava,et al.  The Information Manifold , 1995 .

[13]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[14]  Peter Norvig,et al.  Marker Passing as a Weak Method for Text Inferencing , 1989, Cogn. Sci..

[15]  Sanda M. Harabagiu,et al.  Knowledge processing on an extended wordnet , 1998 .

[16]  John F. Sowa Toward the Expressive Power of Natural Language , 1991, Principles of Semantic Networks.

[17]  Oren Etzioni,et al.  Moving Up the Information Food Chain: Deploying Softbots on the World Wide Web , 1996, AI Mag..

[18]  Sanda M. Harabagiu,et al.  A Marker Propagation Algorithm for Text Coherence , 1995 .

[20]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[21]  Oren Etzioni,et al.  A softbot-based interface to the Internet , 1994, CACM.

[22]  Chung Hee Hwang,et al.  An Episodic Knowledge Representation for Narrative Texts , 1989, KR.

[23]  Eric Horvitz,et al.  Challenge problems for artificial intelligence , 1996, AAAI 1996.

[24]  Lucja Iwanska A General Semantic Model of Negation in Natural Language: Representation and Inference , 1992, KR.

[25]  Oren Etzioni,et al.  Multi-Engine Search and Comparison Using the MetaCrawler , 1995, World Wide Web J..

[26]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[27]  Craig A. Knoblock,et al.  Cooperating Agents for Information Retrieval , 1994, CoopIS.

[28]  Sanda Harabagiu Testing Gricean Constraints on a WordNet-based Coherence Evaluation System , 1996 .

[29]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[30]  Oren Etzioni,et al.  Category Translation: Learning to Understand Information on the Internet , 1995, IJCAI.