Topical Video Search: Analysing Video Concept Annotation through Crowdsourcing Games

Games with a purpose (GWAPs) are increasingly used in audio-visual collections as a mechanism for annotating videos through tagging. One such GWAP is Waisda? , a video labeling game where players tag streaming video and win points by reaching consensus on tags with other players. The open-ended and unconstrained manner of tagging in the fast-paced setting of the game has fundamental impact on the resulting tags. We find that Waisda? tags predominately describe visual objects and rarely refer to the topics of the videos. In this study we evaluate to what extent the tags entered by players can be regarded as topical descriptors of the video material.  Moreover, we characterize the quality of the user tags as topical descriptors with the aim to detect and filter out the bad ones. Our results show that after filtering,  game tags perform equally well compared to the manually crafted metadata when it comes to accessing the videos based on topic. An important consequence of this finding is that tagging games can provide a cost-effective alternative in situations when manual annotation by professionals is too costly.

[1]  Wolfgang Nejdl,et al.  Can all tags be used for search? , 2008, CIKM '08.

[2]  Falk Schreiber,et al.  Analysis of Biological Networks , 2008 .

[3]  James Caverlee,et al.  PageRank for ranking authors in co-citation networks , 2009, J. Assoc. Inf. Sci. Technol..

[4]  Florian Störkle Combino - A GWAP for Generating Combined Tags , 2012 .

[5]  Manuel Blum,et al.  Verbosity: a game for collecting common-sense facts , 2006, CHI.

[6]  Michiel Hildebrand,et al.  Waisda?: video labeling game , 2013, MM '13.

[7]  Detlef Schoder,et al.  Imitation and Quality of Tags in Social Bookmarking Systems - Collective Intelligence Leading to Folksonomies , 2010 .

[8]  Laura A. Dabbish,et al.  Designing games with a purpose , 2008, CACM.

[9]  Shanshan Li,et al.  Which Tags Are Related to Visual Content? , 2010, MMM.

[10]  Elena Paslaru Bontas Simperl,et al.  SpotTheLink: A Game for Ontology Alignment , 2011, Wissensmanagement.

[11]  Latifur Khan,et al.  Knowledge Based Image Annotation Refinement , 2009, J. Signal Process. Syst..

[12]  Martin Halvey,et al.  Analysis of online video search and sharing , 2007, HT '07.

[13]  Gert R. G. Lanckriet,et al.  A Game-Based Approach for Collecting Semantic Annotations of Music , 2007, ISMIR.

[14]  Abebe Rorissa,et al.  A comparative study of Flickr tags and index terms in a general image collection , 2010, J. Assoc. Inf. Sci. Technol..

[15]  Ji-Lung Hsieh,et al.  Network analysis of tagging structure , 2011, ASIST.

[16]  Wesley De Neve,et al.  Towards data-driven estimation of image tag relevance using visually similar and dissimilar folksonomy images , 2012, SAM '12.

[17]  Stephen E. Robertson,et al.  Rethinking the ESP game , 2009, CHI Extended Abstracts.

[18]  David C. Parkes,et al.  A game-theoretic analysis of the ESP game , 2013, TEAC.

[19]  Henry Lieberman,et al.  Common sense and intelligent user interfaces , 2007, IUI '07.

[20]  Manuel Blum,et al.  Peekaboom: a game for locating objects in images , 2006, CHI.

[21]  P. Jason Morrison,et al.  Tagging and searching: Search retrieval effectiveness of folksonomies on the World Wide Web , 2008, Inf. Process. Manag..

[22]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[23]  Kilian Q. Weinberger,et al.  Reliable tags using image similarity: mining specificity and expertise from large-scale multimedia databases , 2009, WSMC '09.

[24]  Edith Law,et al.  Input-agreement: a new mechanism for collecting data using human computation games , 2009, CHI.

[25]  Chao Wu,et al.  Analysis of Tags as a Social Network , 2008, 2008 International Conference on Computer Science and Software Engineering.

[26]  Allan H. Gilbert,et al.  Studies In Iconology: Humanistic Themes In The Art Of The Renaissance , 1939 .

[27]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[28]  Martin Hepp,et al.  Games with a Purpose for the Semantic Web , 2008, IEEE Intelligent Systems.

[29]  Gary Geisler,et al.  Tagging video: conventions and strategies of the YouTube community , 2007, JCDL '07.

[30]  Michiel Hildebrand,et al.  Linking User Generated Video Annotations to the Web of Data , 2012, MMM.

[31]  Mark Steyvers,et al.  Identifying Emotions, Intentions, and Attitudes in Text Using a Game with a Purpose , 2010, HLT-NAACL 2010.

[32]  R. Hanneman Introduction to Social Network Methods , 2001 .

[33]  Changhu Wang,et al.  Image annotation refinement using random walk with restarts , 2006, MM '06.

[34]  Lide Wu,et al.  Folksonomy as a Complex Network , 2005, ArXiv.

[35]  Wolfgang Nejdl,et al.  Bridging the gap between tagging and querying vocabularies: Analyses and applications for enhancing multimedia IR , 2010, J. Web Semant..

[36]  Sara Shatford,et al.  Analyzing the Subject of a Picture: A Theoretical Approach , 1986 .

[37]  Catherine C. Marshall,et al.  No bull, no spin: a comparison of tags with other forms of user metadata , 2009, JCDL '09.

[38]  E. B. Wilson Probable Inference, the Law of Succession, and Statistical Inference , 1927 .

[39]  Karen Spärck Jones A statistical interpretation of term specificity and its application in retrieval , 2021, J. Documentation.

[40]  Jane Yung-jen Hsu,et al.  Human Computation Game for Commonsense Data Verification , 2010, AAAI Fall Symposium: Commonsense Knowledge.

[41]  Rossano Schifanella,et al.  Design of social games for collecting reliable semantic annotations , 2011, 2011 16th International Conference on Computer Games (CGAMES).

[42]  François Bry,et al.  Karido: A GWAP for telling artworks apart , 2011, 2011 16th International Conference on Computer Games (CGAMES).

[43]  Andreas Hotho,et al.  Information Retrieval in Folksonomies: Search and Ranking , 2006, ESWC.

[44]  Elena Paslaru Bontas Simperl,et al.  An Experiment in Comparing Human-Computation Techniques , 2012, IEEE Internet Computing.

[45]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[46]  Satoshi Nakamura,et al.  Can social bookmarking enhance search in the web? , 2007, JCDL '07.

[47]  Roelof van Zwol,et al.  Flickr tag recommendation based on collective knowledge , 2008, WWW.

[48]  François Bry,et al.  ARTigo: Building an Artwork Search Engine With Games and Higher-Order Latent Semantic Analysis , 2013, AAAI 2013.

[49]  Yong Wang,et al.  Refining image annotation using contextual relations between words , 2007, CIVR '07.

[50]  Matthew Chalmers,et al.  EyeSpy: supporting navigation through play , 2009, CHI.

[51]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[52]  Yong Yu,et al.  Optimizing web search using social annotations , 2007, WWW '07.

[53]  Haojie Li,et al.  Tag ranking by propagating relevance over tag and image graphs , 2012, ICIMCS '12.

[54]  Marcel Worring,et al.  Learning tag relevance by neighbor voting for social image retrieval , 2008, MIR '08.

[55]  Roger B. Dannenberg,et al.  TagATune: A Game for Music and Sound Annotation , 2007, ISMIR.

[56]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[57]  Fabian Kneißl,et al.  Crowdsourcing for linguistic field research and e-learning , 2014 .

[58]  Marcel Worring,et al.  Classification of user image descriptions , 2004, Int. J. Hum. Comput. Stud..

[59]  Bernardo A. Huberman,et al.  Usage patterns of collaborative tagging systems , 2006, J. Inf. Sci..

[60]  Michiel Hildebrand,et al.  Waisda?: making videos findable through crowdsourced annotations , 2014 .

[61]  Luis von Ahn,et al.  Word sense disambiguation via human computation , 2010, HCOMP '10.

[62]  Sourav S. Bhowmick,et al.  Content is still king: the effect of neighbor voting schemes on tag relevance for social image retrieval , 2012, ICMR.

[63]  Wesley De Neve,et al.  Tag refinement in an image folksonomy using visual similarity and tag co-occurrence statistics , 2010, Signal Process. Image Commun..

[64]  Zoran Popovic,et al.  PhotoCity: training experts at large-scale image acquisition through a competitive game , 2011, CHI.

[65]  A. Agresti,et al.  Approximate is Better than “Exact” for Interval Estimation of Binomial Proportions , 1998 .

[66]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[67]  Sourav S. Bhowmick,et al.  Quantifying tag representativeness of visual content of social images , 2010, ACM Multimedia.

[68]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[69]  Zoran Popovic,et al.  Reconstructing the world in 3D: bringing games with a purpose outdoors , 2010, FDG.

[70]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[71]  Ellen M. Voorhees,et al.  The Philosophy of Information Retrieval Evaluation , 2001, CLEF.

[72]  Jennifer Preece,et al.  Odd Leaf Out: Improving Visual Recognition with Games , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[73]  Lora Aroyo,et al.  An Evaluation of Labelling-Game Data for Video Retrieval , 2013, ECIR.

[74]  F. Schreiber,et al.  Centrality Analysis Methods for Biological Networks and Their Application to Gene Regulatory Networks , 2008, Gene regulation and systems biology.

[75]  James Allan,et al.  A comparison of statistical significance tests for information retrieval evaluation , 2007, CIKM '07.

[76]  François Bry,et al.  A Gaming Ecosystem Crowdsourcing Deep Semantic Annotations , 2015 .

[77]  Luis von Ahn,et al.  Human Computation for Attribute and Attribute Value Acquisition , 2011 .