A crowdsourcing framework for the production and use of film and television data

This paper outlines a framework that would enable the detailed indexing of film and television media through crowdsourcing. By making it easier to generate detailed data about these media on a large scale, fans and scholars can more efficiently produce a wide range of artifacts that reflect their interests in this content. Our development of a test collection included detailed indexing of 12 feature films and 8 television programs. We describe the conditions that make crowdsourcing an ideal approach for accomplishing this work on a larger scale; present a three-level development framework; and discuss how automated indexing, crowdsourcing quality, and copyright concerns might influence continued development of the project. Our framework highlights the potential of both multimedia indexing and crowdsourcing and can serve as a model for others embarking on projects that involve indexing, annotating, or labeling large multimedia collections.

[1]  Yochai Benkler,et al.  The wealth of networks: how social production transforms markets and freedom , 2006 .

[2]  Chris Beer,et al.  Visualizing television archives , 2009 .

[3]  Brendan T. O'Connor,et al.  Cheap and Fast – But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks , 2008, EMNLP.

[4]  Omar Alonso,et al.  Crowdsourcing for relevance evaluation , 2008, SIGF.

[5]  David Trottier The Screenwriter's Bible: A Complete Guide to Writing, Formatting, and Selling Your Script , 1994 .

[6]  Bertrand Augst,et al.  No Longer a Shot in the Dark: Engineering a Robust Environment for Film Study , 1999, Comput. Humanit..

[7]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Michael G. Christel,et al.  Addressing the challenge of visual information access from digital image and video libraries , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[9]  Fang Wu,et al.  Crowdsourcing, attention and productivity , 2008, J. Inf. Sci..

[10]  Vikas Sindhwani,et al.  Data Quality from Crowdsourcing: A Study of Annotation Selection Criteria , 2009, HLT-NAACL 2009.

[11]  Bernardo A. Huberman,et al.  Cooperation and quality in wikipedia , 2007, WikiSym '07.

[12]  Leah Hoffmann Content control , 2009, CACM.

[13]  Duncan J. Watts,et al.  Financial incentives and the "performance of crowds" , 2009, HCOMP '09.

[14]  Ohad Shamir,et al.  Vox Populi: Collecting High-Quality Labels from a Crowd , 2009, COLT.

[15]  Rick Kazman,et al.  The metropolis model a new logic for development of crowdsourced systems , 2009, CACM.

[16]  Paul Over,et al.  TRECVID: evaluating the effectiveness of information retrieval tasks on digital video , 2004, MULTIMEDIA '04.

[17]  Clay Shirky Here Comes Everybody: The Power of Organizing Without Organizations , 2008 .

[18]  Gary Geisler Building Blocks For Rapid Development of Information Seeking Support Systems , 2007 .

[19]  Rémi Ronfard Reading movies: an integrated DVD player for browsing movies and their scripts , 2004, MULTIMEDIA '04.

[20]  Huanbo Luan,et al.  Content-based video retrieval: Three example systems from TRECVid , 2008 .

[21]  Andrew B. Whinston,et al.  Research Issues in Social computing , 2007, J. Assoc. Inf. Syst..

[22]  Alan F. Smeaton,et al.  Are Visual Informatics Actually Useful in Practice: A Study in a Film Studies Context , 2009, IVIC.

[23]  James E. Lynch,et al.  Television Program Production , 1956 .

[24]  C. Lintott,et al.  Galaxy Zoo: morphologies derived from visual inspection of galaxies from the Sloan Digital Sky Survey , 2008, 0804.4483.

[25]  Eric S. Raymond,et al.  The Cathedral & the Bazaar , 1999 .

[26]  Chris Callison-Burch,et al.  Fast, Cheap, and Creative: Evaluating Translation Quality Using Amazon’s Mechanical Turk , 2009, EMNLP.

[27]  Henry Jenkins Fans, Bloggers, and Gamers: Exploring Participatory Culture , 2006 .

[28]  Leah Hoffmann,et al.  Crowd control , 2009, CACM.

[29]  Jono Bacon The Art of Community - Building the New Age of Participation, 2nd Edition , 2012, Theory in practice.

[30]  Alan F. Smeaton,et al.  Indexing of Fictional Video Content for Event Detection and Summarisation , 2007, EURASIP J. Image Video Process..

[31]  Alan F. Smeaton,et al.  Developing a MovieBrowser for supporting analysis andbrowsing of movie content , 2008 .

[32]  Gerardo Hermosillo,et al.  Learning From Crowds , 2010, J. Mach. Learn. Res..

[33]  Mubarak Shah,et al.  Semantic classification of movie scenes using finite state machines , 2005 .

[34]  Gerardo Hermosillo,et al.  Supervised learning from multiple experts: whom to trust when everyone lies a bit , 2009, ICML '09.

[35]  Mubarak Shah,et al.  Detection and representation of scenes in videos , 2005, IEEE Transactions on Multimedia.