Applying Semantic Reasoning in Image Retrieval

Abstract—With the growth of open sensor networks, multiple applications in different domains make use of a large amount of sensor data, resulting in an emerging need to search semantically over heterogeneous datasets. In semantic search, an important challenge consists of bridging the semantic gap between the high-level natural language query posed by the users and the low-level sensor data. In this paper, we show that state-of-the-art techniques in Semantic Modelling, Computer Vision and Human Media Interaction can be combined to apply semantic reasoning in the field of image retrieval. We propose a system, GOOSE, which is a general-purpose search engine that allows users to pose natural language queries to retrieve corresponding images. User queries are interpreted using the Stanford Parser, semantic rules and the Linked Open Data source ConceptNet. Interpreted queries are presented to the user as an intuitive and insightful graph in order to collect feedback that is used for further reasoning and system learning. A smart results ranking and retrieval algorithm allows for fast and effective retrieval of images.

[1]  Sanja Fidler,et al.  Visual Semantic Search: Retrieving Videos via Complex Textual Queries , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Xinlei Chen,et al.  NEIL: Extracting Visual Knowledge from Web Data , 2013, 2013 IEEE International Conference on Computer Vision.

[3]  Klamer Schutte,et al.  Interactive detection of incrementally learned concepts in images with ranking and semantic query interpretation , 2015, 2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI).

[4]  Wessel Kraaij,et al.  GOOSE: semantic search on internet connected sensors , 2013, Defense, Security, and Sensing.

[5]  Gijs Koot,et al.  Fast Re-ranking of Visual Search Results by Example Selection , 2015, CAIP.

[6]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[7]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[8]  Songyang Lao,et al.  Video Semantic Content Analysis based on Ontology , 2007, International Machine Vision and Image Processing Conference (IMVIP 2007).

[9]  Klamer Schutte,et al.  Incremental concept learning with few training examples and hierarchical classification , 2015, SPIE Security + Defence.

[10]  Alberto Del Bimbo,et al.  Semantic annotation and retrieval of video events using multimedia ontologies , 2007, International Conference on Semantic Computing (ICSC 2007).

[11]  Nihan Kesim Cicekli,et al.  Natural language querying for video databases , 2008, Inf. Sci..

[12]  Catherine Havasi,et al.  Representing General Relational Knowledge in ConceptNet 5 , 2012, LREC.

[13]  Huang Chu-Ren,et al.  Wiktionary and NLP: improving synonymy networks , 2009, ACL 2009.

[14]  Alessandro Oltramari,et al.  Using Ontologies in a Cognitive-Grounded System: Automatic Action Recognition in Video-Surveillance , 2012, STIDS.

[15]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[16]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.