A knowledge extaction system (KEYS) based on UNL knowledge infrastructure

With the exponential growth of information available on the internet pages, humans need to extract specific information has also witnessed an ever growing increase. This paper presents KEYS (Knowledge Extraction sYStem). It searches for information inside documents represented in Universal Networking Language (UNL), i.e., in semantic hyper-graphs. This allows for retrieval and extraction practices that are language-independent and semantically-oriented. It is expected to provide high-quality knowledge extraction through a shallow analysis of the source text into the UNL using a specific ontological relations then generate the resulting UNL document into several different target languages in a fully-automatic manner. This is expected to present a novel approach to the topic of identifying named entities; extracting names with all its types from a natural language texts. The Precision measurement of the system is 0.86 while recall measurement is 0.82.

[1]  James R. Cowie,et al.  Automatic Analysis of Descriptive Texts , 1983, ANLP.

[2]  Patrick J. Altomari,et al.  FOCUS OF TIPSTER PHASES I and II , 1996, TIPSTER.

[3]  Oren Etzioni,et al.  Strategies for lifelong knowledge extraction from the web , 2007, K-CAP '07.

[4]  Naomi Sager,et al.  Natural Language Information Processing: A Computer Grammar of English and Its Applications , 1980 .

[5]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[6]  H. Uchida,et al.  The Universal Networking Language beyond Machine Translation , 2001 .

[7]  Richard Edward Cullingford,et al.  Script application: computer understanding of newspaper stories. , 1977 .

[8]  Wendy G. Lehnert,et al.  Information extraction , 1996, CACM.

[9]  Roberta H. Merchant TIPSTER Program Overview , 1993, TIPSTER.

[10]  Siddharth Patwardhan,et al.  Widening the Field of View of Information Extraction Through Sentential Event Recognition , 2010 .

[11]  Gian Piero Zarri,et al.  Automatic Representation of the Semantic Relationships Corresponding to a French Surface Expression , 1983, ANLP.

[12]  Sameh Alansary MUHIT: A Multilingual Harmonized Dictionary , 2014, LREC.

[13]  F. Ruth Gee The TIPSTER Text Program Overview , 1998, TIPSTER.

[14]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[15]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[16]  Gerald DeJong Prediction and substantiation: A new approach to natural language processing , 1979 .