In this paper, we present an approach to perform reasoning for scalable OWL ontologies in a Hadoop-based distributed computing cluster. Rule-based reasoning is typically used for a scalable OWL-Horst reasoning; typically, the system repeatedly performs many operations involving semantic axioms for big ontology triples until no further inferred data exists. Thus, the reasoning systems suffer from performance limitations when ontology reasoning is performed via disk-based MapReduce approaches. To overcome this drawback, we propose an approach that loads triples to memory in computer nodes that are connected by SPARK - a memory-based cluster computing platform - and executes ontology reasoning. To implement an OWL Horst ontology reasoning system, we first define a set of algorithms such that they divide large triples into Resilient Distributed Datasets (RDDs), taking into account the patterns and interdependencies of the reasoning rules. We then load each RDD into the memory of computers composing a distributed computing cluster and subsequently perform distributed reasoning by rule execution orders. To evaluate the proposed methods, we compare it to WebPIE using the LUBM set, which is formal dataset for evaluating ontology inferences and search speeds. The proposed approach shows throughput is improved by 200% (98k/sec) as compared to WebPIE (33k/sec) using the LUBM6000 (860 million triples, 109 gigabyte).
[1]
Boris Motik,et al.
Reasoning in Description Logics with a Concrete Domain in the Framework of Resolution
,
2004,
ECAI.
[2]
Boris Motik,et al.
Hypertableau Reasoning for Description Logics
,
2009,
J. Artif. Intell. Res..
[3]
Thomas R. Gruber,et al.
A translation approach to portable ontology specifications
,
1993,
Knowl. Acquis..
[4]
Herman J. ter Horst,et al.
Completeness, decidability and complexity of entailment for RDF Schema and a semantic extension involving the OWL vocabulary
,
2005,
J. Web Semant..
[5]
Taha Osman,et al.
A Pragmatic Approach to Semantic Repositories Benchmarking
,
2010,
ESWC.
[6]
Maurizio Lenzerini,et al.
TBox and ABox Reasoning in Expressive Description Logics
,
1996,
KR.
[7]
Michael J. Franklin,et al.
Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing
,
2012,
NSDI.
[8]
Yarden Katz,et al.
Pellet: A practical OWL-DL reasoner
,
2007,
J. Web Semant..
[9]
Ian Horrocks,et al.
FaCT++ Description Logic Reasoner: System Description
,
2006,
IJCAR.