In this paper we present a question answering system supported by semantic graphs. Aside from providing answers to natural language questions, the system offers explanations for these answers via a visual representation of documents, their associated list of facts described by subject – verb – object triplets, and their summaries. The triplets, automatically extracted from the Penn Treebank parse tree obtained for each sentence in the document collection, can be searched, and we have implemented a question answering system to serve as a natural language interface to this search. The vocabulary of questions is general because it is not limited to a specific domain, however the questions's grammatical structure is restricted to a predetermined template because our system can understand only a limited number of question types. The answers are retrieved from the set of facts, and they are supported by sentences and their corresponding document. The document overview, comprising the semantic representation of the document generated in the form of a semantic graph, the list of facts it contains and its automatically derived summary, offers an explanation to each answer. The extracted triplets are further refined by assigning the corresponding co referenced named entity, by resolving pronominal anaphors, as well as attaching the associated WordNet synset. The semantic graph belonging to the document is developed based on the enhanced triplets while the document summary is automatically generated from the semantic description of the document and the extracted facts.
[1]
Christiane Fellbaum,et al.
Book Reviews: WordNet: An Electronic Lexical Database
,
1999,
CL.
[2]
Marko Grobelnik,et al.
Learning Sub-structures of Document Semantic Graphs for Document Summarization
,
2004
.
[3]
Enrico Motta,et al.
AquaLog: An ontology-driven question answering system for organizational semantic intranets
,
2007,
J. Web Semant..
[4]
Dunja Mladenic,et al.
Semantic Graphs Derived From Triplets with Application in Document Summarization
,
2009,
Informatica.
[5]
William B. Dolan,et al.
Less is more: Eliminating index terms from subordinate clauses
,
1999,
ACL.
[6]
Kalina Bontcheva,et al.
A Text-based Query Interface to OWL Ontologies
,
2008,
LREC.
[7]
Beatrice Santorini,et al.
Building a Large Annotated Corpus of English: The Penn Treebank
,
1993,
CL.
[8]
Kalina Bontcheva,et al.
A Natural Language Query Interface to Structured Information
,
2008,
ESWC.
[9]
Oren Etzioni,et al.
The Tradeoffs Between Open and Traditional Relation Extraction
,
2008,
ACL.
[10]
Yiming Yang,et al.
RCV1: A New Benchmark Collection for Text Categorization Research
,
2004,
J. Mach. Learn. Res..