ROCP: A Rapid Ontology Construction Platform from Unstructured Data

The domain ontology, which plays a significant role in knowledge-based systems, still needs the manual work of domain experts to be constructed currently. The main motivation of this paper is to provide a semi-automatic platform which can construct fairly comprehensive domain ontology from unstructured data. Meanwhile, a brief QA System is proposed to simplify the interaction with the domain experts. A novel algorithm MPVW, which extends from the classical algorithm TF-IDF, is proposed to extract the terminologies from domain documents. MPVW balanced more parameters and factors to evaluate the feature of terminologies. The 3-layers taxonomy and terminology hyponymy height provide sufficient guide and prompt for domain experts to construct ontology from terminologies. According to our approach we have developed ROCP, a rapid ontology construction platform which has been applied in the space debris mitigation domain. The experimental data indicates that ROCP has sufficient accuracy to extract terminologies. Meanwhile, it is effective to relieve the labor of domain experts to construct domain ontology.

[1]  Dilek Küçük,et al.  Semi-Automatic Construction of a Domain Ontology for Wind Energy Using Wikipedia Articles , 2014, ArXiv.

[2]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[3]  Teresa Mihwa Chung A corpus comparison approach for terminology extraction , 2003 .

[4]  Ahmed A. Rafea,et al.  TextOntoEx: Automatic ontology construction from natural English text , 2008, Expert Syst. Appl..

[5]  Takahiro Hara,et al.  Improving the extraction of bilingual terminology from Wikipedia , 2009, TOMCCAP.

[6]  Xuefei Chen,et al.  On automatic construction of based-NLP Chinese medicine ontology concept’s description architacture , 2008, 2008 International Conference on Audio, Language and Image Processing.

[7]  Nikita Astrakhantsev,et al.  Automatic construction and enrichment of informal ontologies: A survey , 2013, Programming and Computer Software.

[8]  Raja R. A. Issa,et al.  Developing taxonomy for the domain ontology of construction contractual semantics: A case study on the AIA A201 document , 2015, Adv. Eng. Informatics.

[9]  Cheng-Hsin Hsu,et al.  Ontology construction for information classification , 2006, Expert Syst. Appl..

[10]  Steffen Staab,et al.  Semi-Automatic Engineering of Ontologies from Text , 2000, ICSE 2000.

[11]  Luis E. Ortiz,et al.  The Penn-Lehman Automated Trading Project , 2003, IEEE Intell. Syst..

[12]  Dan Wu,et al.  Bilingual Terminology Extraction Using Multi-level Termhood , 2012, Electron. Libr..

[13]  Els Lefever,et al.  TExSIS: Bilingual terminology extraction from parallel corpora using chunk-based alignment. , 2013 .

[14]  Michael C. McCord,et al.  Terminology extraction for global content management , 2003 .

[15]  Xue Wang,et al.  From Web Resources to Agricultural Ontology: a Method for Semi-Automatic Construction , 2012 .

[16]  Biswanath Dutta,et al.  YAMO: Yet Another Methodology for large-scale faceted Ontology construction , 2015, J. Knowl. Manag..

[17]  Aldo Gangemi,et al.  Ontology Learning and Its Application to Automated Terminology Translation , 2003, IEEE Intell. Syst..

[18]  Víctor Jesús Sosa Sosa,et al.  Learning concept hierarchies from textual resources for ontologies construction , 2013, Expert Syst. Appl..

[19]  Thomas R. Gruber,et al.  A Translation Approach to Portable Ontologies , 1993 .

[20]  Yau-Hwang Kuo,et al.  Automated ontology construction for unstructured text documents , 2007, Data & Knowledge Engineering.

[21]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[22]  Kara Dolinski,et al.  Automating the construction of gene ontologies , 2013, Nature Biotechnology.

[23]  Agnieszka Mykowiecka,et al.  Terminology extraction from medical texts in Polish , 2014, J. Biomed. Semant..

[24]  Mehrnoush Shamsfard,et al.  The state of the art in ontology learning: a framework for comparison , 2003, The Knowledge Engineering Review.

[25]  Tommaso Leo,et al.  PKS: An Ontology-based Learning Construct for Lifelong Learners , 2013, J. Educ. Technol. Soc..

[26]  David Sánchez,et al.  Learning non-taxonomic relationships from web documents for domain ontology construction , 2008, Data Knowl. Eng..

[27]  Sung-Hyon Myaeng,et al.  Terminological paraphrase extraction from scientific literature based on predicate argument tuples , 2012, J. Inf. Sci..

[28]  Valérie Camps,et al.  DYNAMO-MAS: A Multi-Agent System for Building and Evolving Ontologies from Texts , 2012, PAAMS.

[29]  Toshihiro Ashino,et al.  Materials Ontology: An Infrastructure for Exchanging Materials Information and Knowledge , 2010, Data Sci. J..

[30]  David Faure,et al.  Acquisition of Semantic Knowledge using Machine learning methods: The System ASIUM Technical report , 1998 .

[31]  Xiaoming Zhang,et al.  EM3B2 - a semantic integration engine for materials science , 2015, Program.