Concept-based indexing of annotated images using semantic DNA

One of the challenges in image retrieval is dealing with concepts which have no visual appearance in the images or are not used as keywords in their annotations. To address this problem, this paper proposes an unsupervised concept-based image indexing technique which uses a lexical ontology to extract semantic signatures called 'semantic chromosomes' from image annotations. A semantic chromosome is an information structure, which carries the semantic information of an image; it is the semantic signature of an image in a collection expressed through a set of semantic DNA (SDNA), each of them representing a concept. Central to the concept-based indexing technique discussed is the concept disambiguation algorithm developed, which identifies the most relevant 'semantic DNA' (SDNA) by measuring the semantic importance of each word/phrase in the annotation. The concept disambiguation algorithm is evaluated using crowdsourcing. The experiments show that the algorithm has better accuracy (79.4%) than the accuracy demonstrated by other unsupervised algorithms (73%) in the 2007 Semeval competition. It is also comparable with the accuracy achieved in the same competition by the supervised algorithms (82-83%) which contrary to the approach proposed in this paper have to be trained with large corpora. The approach is currently applied to the automated generation of mood boards used as an inspirational tool in concept design.

[1]  Nuno Vasconcelos,et al.  On the efficient evaluation of probabilistic similarity functions for image retrieval , 2004, IEEE Transactions on Information Theory.

[2]  Nikolaos D. Doulamis,et al.  Generalized nonlinear relevance feedback for interactive content-based retrieval and organization , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Wei-Ying Ma,et al.  A probabilistic semantic model for image annotation and multi-modal image retrieval , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  Carole Bouchard,et al.  In Search of Design Inspiration: A Semantic-Based Approach , 2010, J. Comput. Inf. Sci. Eng..

[6]  Susan Gauch,et al.  KeyConcept : Un motor de búsqueda conceptual , 2006 .

[7]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[8]  Jonathon S. Hare,et al.  Facing the reality of semantic image retrieval , 2007, J. Documentation.

[9]  Joon Ho Lee,et al.  Combining multiple evidence from different properties of weighting schemes , 1995, SIGIR '95.

[10]  George W. Davidson,et al.  Roget's Thesaurus of English Words and Phrases , 1982 .

[11]  Rossitza Setchi,et al.  Semantic-based information retrieval in support of concept design , 2011, Adv. Eng. Informatics.

[12]  Eero Hyvönen,et al.  Ontology-Based Image Retrieval , 2003, WWW.

[13]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[14]  Philip Edmonds SENSEVAL: The evaluation of word sense disambiguation systems , 2002 .

[15]  Shih-Fu Chang,et al.  Semantic visual templates: linking visual features to semantics , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[16]  Francesco M. Donini,et al.  Structured Knowledge Representation for Image Retrieval , 2011, J. Artif. Intell. Res..

[17]  Rada Mihalcea,et al.  Amazon Mechanical Turk for Subjectivity Word Sense Disambiguation , 2010, Mturk@HLT-NAACL.

[18]  Michael G. Strintzis,et al.  An ontology approach to object-based image retrieval , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[19]  Steffen Staab,et al.  Semantic Multimedia , 2008, Reasoning Web.

[20]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[21]  Qiang Yang,et al.  A unified framework for semantics and feature based relevance feedback in image retrieval systems , 2000, ACM Multimedia.

[22]  Ying Liu,et al.  A survey of content-based image retrieval with high-level semantics , 2007, Pattern Recognit..

[23]  Henrik Bulskov Styltsvig Ontology-based Information Retrieval , 2006 .

[24]  Marin Ferecatu,et al.  Semantic interactive image retrieval combining visual and conceptual content description , 2007, Multimedia Systems.

[25]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[26]  Felix Naumann,et al.  Data fusion , 2009, CSUR.

[27]  Ramanathan V. Guha,et al.  A case for automated large-scale semantic annotation , 2003, J. Web Semant..

[28]  Rossitza Setchi,et al.  Ontology-based indexing of annotated images using semantic DNA and vector space model , 2011, 2011 International Conference on Semantic Technology and Information Retrieval.

[29]  Chih-Fong Tsai,et al.  A review of image retrieval methods for digital cultural heritage resources , 2007, Online Inf. Rev..

[30]  James Allan,et al.  A Comparative Study of Utilizing Topic Models for Information Retrieval , 2009, ECIR.

[31]  Chris Callison-Burch,et al.  Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk , 2009, EMNLP.

[32]  Chris Buckley,et al.  A probabilistic learning approach for document indexing , 1991, TOIS.

[33]  Yixin Chen,et al.  An unsupervised learning approach to content-based image retrieval , 2003, Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings..

[34]  Donna K. Harman,et al.  Overview of the Eighth Text REtrieval Conference (TREC-8) , 1999, TREC.

[35]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[36]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[37]  Rossitza Setchi,et al.  Concept indexing using ontology and supervised machine learning , 2007 .

[38]  William C. Regli,et al.  Putting the crowd to work in a knowledge-based factory , 2010, Adv. Eng. Informatics.

[39]  Rossitza Setchi,et al.  Semantic Approach to Image Retrieval Using Statistical Models Based on a Lexical Ontology , 2010, KES.

[40]  Tat-Seng Chua,et al.  A bootstrapping framework for annotating and retrieving WWW images , 2004, MULTIMEDIA '04.