An interactive framework for image annotation through gaming

Image indexing is one of the most difficult challenges facing the computer vision community. Addressing this issue, we designed an innovative approach to obtain an accurate label for images by taking into account the social aspects of human-based computation. The proposed approach is highly discriminative in comparison to an ordinary content-based image retrieval (CBIR) paradigm. It aims at what millions of individual gamers are enthusiastic to do, to enjoy themselves within a social competitive environment. It is achieved by setting the focus of the system on the social aspects of the gaming environment, which involves a widely distributed network of human players. Furthermore, this framework integrates a number of different algorithms that are commonly found in image processing and game theoretic approaches to obtain an accurate label. As a result, the framework is able to assign (or derive) accurate tags for images by eliminating annotations made by a less-rational (cheater) player. The performance analysis of this framework has been evaluated with a group of 10 game players. The result shows that the proposed approach is capable of obtaining a good annotation through a small number of game players.

[1]  Yannis Avrithis,et al.  Fusing MPEG-7 Visual Descriptors for Image Classification , 2005, ICANN.

[2]  Manuel Blum,et al.  Improving Image Search with PHETCH , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[3]  Nuno Correia,et al.  Playing games as a way to improve automatic image annotation , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[4]  Elliot Mendelson Introducing Game Theory and its Applications , 2004 .

[5]  Stephen E. Robertson,et al.  Rethinking the ESP game , 2009, CHI Extended Abstracts.

[6]  Thomas M. Breuel,et al.  Automatic Image Tagging Using Community-Driven Online Image Databases , 2008, Adaptive Multimedia Retrieval.

[7]  Jürgen Scheible,et al.  Combining Web, Mobile Phones and Public Displays in Large-Scale: Manhattan Story Mashup , 2007, Pervasive.

[8]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[9]  K. J. Ray Liu,et al.  Incentive Cooperation Strategies for Peer-to-Peer Live Multimedia Streaming Social Networks , 2009, IEEE Transactions on Multimedia.

[10]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[11]  Frédéric Béchet,et al.  Tagging with Hidden Markov Models Using Ambiguous Tags , 2004, COLING.

[12]  Manuel Blum,et al.  Verbosity: a game for collecting common-sense facts , 2006, CHI.

[13]  C. Won,et al.  Efficient Use of MPEG‐7 Edge Histogram Descriptor , 2002 .

[14]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[15]  Jane Yung-jen Hsu,et al.  KissKissBan: a competitive human computation game for image annotation , 2010, HCOMP '09.

[16]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[17]  R. Brereton,et al.  Support vector machines for classification and regression. , 2010, The Analyst.