A multi-phase correlation search framework for mining non-taxonomic relations from unstructured text

Abstract Over the last decade, ontology engineering has been pursued by “learning” the ontology from domain-specific electronic documents. Most of the research works are focused on extraction of concepts and taxonomic relations. The extraction of non-taxonomic relations is often neglected and not well researched. In this paper, we present a multi-phase correlation search framework to extract non-taxonomic relations from unstructured text. Our framework addresses the two main problems in any non-taxonomic relations extraction: (a) the discovery of non-taxonomic relations and (b) the labelling of non-taxonomic relations. First, our framework is capable of extracting correlated concepts beyond ordinary search window size of a single sentence. Interesting correlations are then filtered using association rule mining with lift interestingness measure. Next, our framework distinguishes non-taxonomic concept pairs from taxonomic concept pairs based on existing domain ontology. Finally, our framework features the usage of domain related verbs as labels for the non-taxonomic relations. Our proposed framework has been tested with the marine biology domain. Results have been validated by domain experts showing reliable results as well as demonstrate significant improvement over traditional association rule approach in search of non-taxonomic relations from unstructured text.

[1]  Johanna Völker,et al.  Ontologies on demand? : A description of the state-of-the-art, applications, challenges and trends for ontology learning from text , 2006 .

[2]  Steffen Staab,et al.  Ontology Learning for the Semantic Web , 2002, IEEE Intell. Syst..

[3]  Fabio Rinaldi,et al.  Mining of relations between proteins over biomedical scientific literature using a deep-linguistic approach , 2007, Artif. Intell. Medicine.

[4]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[5]  Jon Atle Gulla,et al.  Association Rules and Cosine Similarities in Ontology Relationship Learning , 2008, ICEIS.

[6]  Carl Gutwin,et al.  KEA: practical automatic keyphrase extraction , 1999, DL '99.

[7]  Jianhua Chen,et al.  Learning non-taxonomical semantic relations from domain texts , 2011, Journal of Intelligent Information Systems.

[8]  Arno Scharl,et al.  Refining non-taxonomic relation labels with external structured data to support ontology learning , 2010, Data Knowl. Eng..

[9]  Steffen Staab,et al.  Ontology Learning from Text , 2000, International Conference on Applications of Natural Language to Data Bases.

[10]  Paul Buitelaar,et al.  RelExt: A Tool for Relation Extraction from Text in Ontology Extension , 2005, SEMWEB.

[11]  Analía Amandi,et al.  Supporting the discovery and labeling of non-taxonomic relationships in ontology learning , 2009, Expert Syst. Appl..

[12]  Nathalie Aussenac-Gilles,et al.  An interactive pattern based approach for extracting non-taxonomic relations from texts , 2008 .

[13]  Zhaohui S. Qin,et al.  Bioinformatics Original Paper an Efficient Comprehensive Search Algorithm for Tagsnp Selection Using Linkage Disequilibrium Criteria , 2022 .

[14]  Sergio A. Alvarez,et al.  Chi-squared computation for association rules: preliminary results , 2003 .

[15]  Duen-Ren Liu,et al.  Extracting semantic relations to enrich domain ontologies , 2012, Journal of Intelligent Information Systems.

[16]  Vojtech Svátek,et al.  Discovery of Lexical Entries for Non-taxonomic Relations in Ontology Learning , 2004, SOFSEM.

[17]  Philipp Cimiano,et al.  Ontology Learning from Text: Methods, Evaluation and Applications , 2005 .

[18]  Timothy W. Finin,et al.  Swoogle: a search and metadata engine for the semantic web , 2004, CIKM '04.

[19]  L.M. Sheikh,et al.  Interesting measures for mining association rules , 2004, 8th International Multitopic Conference, 2004. Proceedings of INMIC 2004..

[20]  David Sánchez,et al.  Learning non-taxonomic relationships from web documents for domain ontology construction , 2008, Data Knowl. Eng..

[21]  Martin Kavalec,et al.  A Study on Automated Relation Labelling in Ontology Learning , 2005 .

[22]  Alberto Lavelli,et al.  Combining Tree Structures, Flat Features and Patterns for Biomedical Relation Extraction , 2012, EACL.

[23]  Peter M. A. Sloot,et al.  A hybrid approach to extract protein-protein interactions , 2011, Bioinform..

[24]  Johanna Völker,et al.  A Framework for Ontology Learning and Data-driven Change Discovery , 2005 .

[25]  Kyu-Chul Lee,et al.  Finding the evidence for protein-protein interactions from PubMed abstracts , 2006, ISMB.

[26]  Paola Velardi,et al.  Evaluation of OntoLearn, a Methodology for Automatic Learning of Domain Ontologies , 2005 .

[27]  Mehrnoush Shamsfard,et al.  Learning ontologies from natural language texts , 2004, Int. J. Hum. Comput. Stud..

[28]  William Kornfeld,et al.  Automatically locating, extracting and analyzing tabular data , 1998, SIGIR '98.

[29]  Rosario Girardi,et al.  Extracting Non-taxonomic Relationships of Ontologies from Texts , 2011, SOCO.

[30]  Siti Sakira Kamaruddin,et al.  Automatic extraction of performance indicators from financial statements , 2009, 2009 International Conference on Electrical Engineering and Informatics.

[31]  Ralf Zimmer,et al.  RelEx - Relation extraction using dependency parse trees , 2007, Bioinform..

[32]  Steffen Staab,et al.  Discovering Conceptual Relations from Text , 2000, ECAI.

[33]  Syed Sibte Raza Abidi,et al.  Mining Non-taxonomic Concept Pairs from Unstructured Text - A Concept Correlation Search Framework , 2011, WEBIST.

[34]  Steffen Staab,et al.  Ontology Learning from Text , 2000, NLDB.

[35]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[36]  Das Amrita,et al.  Mining Association Rules between Sets of Items in Large Databases , 2013 .

[37]  Steffen Staab,et al.  The TEXT-TO-ONTO Ontology Learning Environment , 2000 .

[38]  Claire Nedellec,et al.  Corpus-Based Learning of Semantic Relations by the ILP System, Asium , 2001, Learning Language in Logic.