In this paper we describe the design and implementation of the Prolog interface to the Unstructured Information Management Architecture (UIMA) and some of its applications in natural language processing. The UIMA Prolog interface translates unstructured data and the UIMA Common Analysis Structure (CAS) into a Prolog knowledge base, over which, the developers write rules and use resolution theorem proving to search and generate new annotations over the unstructured data. These rules can explore all the previous UIMA annotations (such as, the syntactic structure, parsing statistics) and external Prolog knowledge bases (such as, Prolog WordNet and Extended WordNet) to implement a variety of tasks for the natural language analysis. We also describe applications of this logic programming interface in question analysis (such as, focus detection, answer-type and other constraints detection), shallow parsing (such as, relations in the syntactic structure), and answer selection.
[1]
David A. Ferrucci,et al.
Building an example application with the Unstructured Information Management Architecture
,
2004,
IBM Syst. J..
[2]
Christiane Fellbaum,et al.
Book Reviews: WordNet: An Electronic Lexical Database
,
1999,
CL.
[3]
David A. Ferrucci,et al.
UIMA: an architectural approach to unstructured information processing in the corporate research environment
,
2004,
Natural Language Engineering.
[4]
Miguel Calejo.
InterProlog: Towards a Declarative Embedding of Logic Programming in Java
,
2004,
JELIA.
[5]
Thilo Götz,et al.
Design and implementation of the UIMA Common Analysis System
,
2004,
IBM Syst. J..