This paper presents a work in developing a semantic-based question answering system (QAS) for Indonesian Translation of Quran (ITQ). This research is motivated by the lacks of previous built QAS that caused by a keyword-based retrieval. Instead of keeping the retrieval method, we shifted to a semantic approach where the retrieval process is done by using a semantic similarity measurement. In doing so, we built an ontology of ITQ to get the concepts as well as verses where they appear in. We applied three factoid question types on the QAS that including Who, Where, and When. Furthermore, a weighted vector for each concept that belongs to respective expected answering type (also called as named entity group) i.e. Person, Location, and Time is generated in order to feed semantic interpreter on user question. From 222 concepts defined from the ontology, we clustered them into 77, 24, and 6 concepts for Person, Location, and Time respectively. Since we found there are some characteristics of texts in ITQ, we developed our own modules to deal with including generate the inverted index and named entity recognition. Answer extraction is conducted by applying some features extraction in order to score the answer candidates. Evaluation of the system is designed by providing two data set of question and answer where the first one is purposed to measure the effectiveness of semantic approach comparing with keyword-based retrieval and the last one aims to know system performance in regard the appearance of concepts in ITQ.
[1]
Eric Atwell,et al.
Computational ontologies for semantic tagging of the Quran:A survey of past approaches
,
2014
.
[2]
Tiejun Zhao,et al.
Knowledge-Based Question Answering as Machine Translation
,
2014,
ACL.
[3]
Ayu Purwarianti,et al.
A Machine Learning Approach for an Indonesian-English Cross Language Question Answering System
,
2007,
IEICE Trans. Inf. Syst..
[4]
Dewi Wisnu Wardani,et al.
Finding Structured and Unstructured Features to Improve the Search Result of Complex Question
,
2012
.
[5]
Evgeniy Gabrilovich,et al.
Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis
,
2007,
IJCAI.
[6]
Ayu Purwarianti,et al.
A machine learning approach for indonesian question answering system
,
2007,
Artificial Intelligence and Applications.
[7]
Asep Fajar Firmansyah,et al.
A rule-based question answering system on relevant documents of Indonesian Quran Translation
,
2014,
2014 International Conference on Cyber and IT Service Management (CITSM).
[8]
Meynar Dwi Anggraeny.
Implementasi Question Answering System Dengan Metode Rule-Based Pada Terjemahan Al Qur’an Surat Al Baqarah
,
2007
.
[9]
Nagwa M. El-Makky,et al.
Al-Bayan: An Arabic Question Answering System for the Holy Quran
,
2014,
ANLP@EMNLP.
[10]
Mohd Syazwan Abdullah,et al.
Al-Quran themes classification using ontology
,
2012
.