Employing Ontology Enrichment Algorithm in Classifying Biomedical Text Abstracts

The application of text classification systems on biomedical literature aims to select articles relevant to a specific issue from large corpora. As the amount of online biomedical literature grows, the task of finding relevant information becomes very complicated, due to the difficulties in browsing and searching the relevant information through the web. Ontology is useful for organizing and navigating the Web sites and also for improving the accuracy of Web searches. It provides a shared understanding of domain, to overcome differences in terminology such as synonym, term variants and terms ambiguity. However, one of the problems raised in ontology is the maintenance of these bases of concepts. Therefore, we investigate and propose an ontology enrichment algorithm as one of the methods to modify an existing ontology. In this research, we present a new ontology enrichment algorithm for assigning or associating each concept in the training ontology with the relevant and informative features from biomedical information sources. Experiments are conducted to extract and select the meaningful features from different information sources such as the OHSUMED dataset, Medical Subject Heading (MeSH) terms and heart disease glossaries. Then, we expand these features into the training ontology. Finally, we evaluate the performance of our proposed ontology enrichment algorithm in classifying biomedical text

[1]  Martin Volk,et al.  Enriching an ontology with WordNet based on similarity measures , 2005 .

[2]  Chris Buckley,et al.  OHSUMED: an interactive retrieval evaluation and new large test collection for research , 1994, SIGIR '94.

[3]  Masaki Aono,et al.  An efficient and scalable algorithm for segmented alignment of ontologies of arbitrary size , 2009, J. Web Semant..

[4]  M. Aono,et al.  Ontology based Approach for Classifying Biomedical Text Abstracts , 2011 .

[5]  Frank van Harmelen,et al.  A semantic web primer , 2004 .

[6]  Ee-Peng Lim,et al.  Hierarchical text classification and evaluation , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[7]  BHF Fundraising,et al.  The British Heart Foundation , 1963, Nature.

[8]  Susan Gauch,et al.  Training a hierarchical classifier using inter document relationships , 2009, J. Assoc. Inf. Sci. Technol..

[9]  Earl L. Grinols,et al.  Glossary and Definitions , 2009 .

[10]  Susan Gauch,et al.  Hierarchical Text Classification , .

[11]  Qiang Yang,et al.  Deep classification in large-scale text hierarchies , 2008, SIGIR '08.

[12]  Hiep Phuc Luong,et al.  Enriching concept descriptions in an amphibian ontology with vocabulary extracted from wordnet , 2009, 2009 22nd IEEE International Symposium on Computer-Based Medical Systems.

[13]  Masaki Aono,et al.  Classifying Biomedical Text Abstracts based on Hierarchical 'Concept' Structure , 2011 .

[14]  Erhard Rahm,et al.  Semantic Enrichment of Ontology Mappings: A Linguistic-Based Approach , 2013, ADBIS.

[15]  Susan Gauch,et al.  Using Text Mining to Enrich the Vocabulary of Domain Ontologies , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[16]  Marie-Francine Moens,et al.  Efficient Hierarchical Entity Classifier Using Conditional Random Fields , 2006, OntologyLearning@COLING/ACL.

[17]  Kabita Thaoroijam A Study on Document Classification using Machine Learning Techniques , 2014 .