Semantic-Based Search Engine System for Graph Images in Academic Literature

It is well known that information retrieval is an essential aspect of search engine systems because there is a very large amount of data published on the internet that cannot be manually searched. However, search engine systems should not only present relevant results but also obtain new knowledge from the user’s searches. For example, new knowledge in academic research areas may be present in images that include graphs. In this study, we utilize methods to extract graphical and textual information from graph images and store this new knowledge in an ontology. We also propose a search engine system that is applicable to an ontology that contains this extractable information, which is extracted from images with graphs. The developed ontology is useful because users can acquire considerable amount of knowledge that is discovered from the semantic relations in the ontology. To evaluate the search engine system, we conducted four simulations to address four main issues. The results indicate that the proposed system provides accurate and relevant results; moreover, as indicated by the high F-measure values, the performance of the system is highly acceptable. However, we also found some limitations, which will be mitigated in a future study.

[1]  Masaomi Kimura,et al.  Extraction of Graph Information Based on Image Contents and the Use of Ontology. , 2016 .

[2]  Xia Li-min Ontology-based image retrieval , 2007 .

[3]  Masaomi Kimura,et al.  A Proposal for a Method of Graph Ontology by Automatically Extracting Relationships between Captions and X- and Y-axis Titles , 2015, KEOD.

[4]  Preslav Nakov,et al.  BioText Search Engine: beyond abstract search , 2007, Bioinform..

[5]  Mingxia Gao,et al.  An ontology search engine based on semantic analysis , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[6]  Alexander Pretschner,et al.  Ontology-based personalized search and browsing , 2003, Web Intell. Agent Syst..

[7]  Masaomi Kimura,et al.  Extraction and Identification of Bar Graph Components by Automatic Epsilon Estimation , 2017 .

[8]  Hans-Michael Müller,et al.  Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological Literature , 2004, PLoS biology.

[9]  David Novak,et al.  Building a web-scale image similarity search system , 2010, Multimedia Tools and Applications.

[10]  Björn Buchhold,et al.  Semantic Search on Text and Knowledge Bases , 2016, Found. Trends Inf. Retr..

[11]  Chris Mungall,et al.  AmiGO: online access to ontology and annotation data , 2008, Bioinform..

[12]  Guoliang Li,et al.  Efficient fuzzy full-text type-ahead search , 2011, The VLDB Journal.

[13]  Sophia Ananiadou,et al.  FACTA: a text search engine for finding associated biomedical concepts , 2008, Bioinform..

[14]  Laura Farinetti,et al.  Ontology Driven Semantic Search , 2004 .

[15]  David Bawden,et al.  Is Google enough? Comparison of an internet search engine with academic library resources , 2005, Aslib Proc..

[16]  Thomas Martin Deserno,et al.  Ontology of Gaps in Content-Based Image Retrieval , 2009, Journal of Digital Imaging.

[17]  Axel-Cyrille Ngonga Ngomo,et al.  A service-oriented search framework for full text, geospatial and semantic search , 2014, SEM '14.

[18]  Ji-quan Ma,et al.  Content-Based Image Retrieval with HSV Color Space and Texture Features , 2009, 2009 International Conference on Web Information Systems and Mining.

[19]  William I. Grosky,et al.  Narrowing the semantic gap - improved text-based web document retrieval using visual features , 2002, IEEE Trans. Multim..

[20]  Fabian M. Suchanek,et al.  ESTER: efficient search on text, entities, and relations , 2007, SIGIR.

[21]  Eero Hyvönen,et al.  Ontogator: Combining View- and Ontology-Based Search with Semantic Browsing , 2003 .

[22]  Michael G. Strintzis,et al.  An ontology approach to object-based image retrieval , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[23]  Masaomi Kimura,et al.  Novel Ontologies-based Optical Character Recognition-error Correction Cooperating with Graph Component Extraction , 2017 .