The thematic modelling of subtext

Narratives form a key component of multimedia knowledge representation on the Web. However, many existing multimedia narrative systems either ignore the narrative qualities of any media, or focus on the literal depicted content ignoring any subtext. Ignoring narrative subtext can lead to erroneous search results, or automatically remixed content that lacks cohesion. We suggest that subtext can be computationally modeled in terms of Tomashevsky’s hierarchy of themes and motifs. These elements can then be used in a semiotic term expansion algorithm, incorporating knowledge of subtext into search and subsequent narrative generation. We present two experimental applications of this technique. In the first, we use our thematic model in the automatic construction of photo montages from Flickr, comparing it to more traditional term expansion based on co-occurrence, and showing that this improves the perceived relevance of images within the montage. In the second, we use the thematic model in order to automatically identify Flickr images to illustrate short stories, where it dampened the perception of unwanted themes (an effect we describe as reducing thematic noise). Our work is among the first in this space, and shows that thematic subtext can be tackled computationally.

[1]  M. Weal,et al.  Measuring Narrative Cohesion: A Five Variables Approach , 2011 .

[2]  Kim Gee The ergonomics of hypertext narative: usability testing as a tool for evaluation and redesign , 2001, AJCD.

[3]  David E. Millard,et al.  Capturing the semiotic relationship between terms , 2010, New Rev. Hypermedia Multim..

[4]  J. Bruner The Narrative Construction of Reality , 1991, Critical Inquiry.

[5]  Mark Gahegan,et al.  Frankenplace: Interactive Thematic Mapping for Ad Hoc Exploratory Search , 2015, WWW.

[6]  Lyle H. Ungar,et al.  Beyond Binary Labels: Political Ideology Prediction of Twitter Users , 2017, ACL.

[7]  David E. Millard,et al.  Using a thematic model to enrich photo montages , 2009, HT '09.

[8]  David E. Millard,et al.  Ontologies as facilitators for repurposing web documents , 2007, Int. J. Hum. Comput. Stud..

[9]  Gerald Friedland,et al.  Narrative theme navigation for sitcoms supported by fan-generated scripts , 2010, AIEMPro '10.

[10]  W. Bruce Croft,et al.  Quary Expansion Using Local and Global Document Analysis , 1996, SIGIR Forum.

[11]  Filippo Menczer,et al.  Clustering memes in social media streams , 2014, Social Network Analysis and Mining.

[12]  Ernesto Diaz-Aviles,et al.  Mining Affective Context in Short Films for Emotion-Aware Recommendation , 2015, HT.

[13]  Jai E. Jung,et al.  A computational model of transmedia ecosystem for story-based contents , 2017, Multimedia Tools and Applications.

[14]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[15]  Meng Wang,et al.  Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Hugh C. Davis,et al.  Folksonomies versus Automatic Keyword Extraction: An Empirical Study , 2006 .

[17]  Wolfgang Nejdl,et al.  How do you feel about "dancing queen"?: deriving mood & theme annotations from user tags , 2009, JCDL '09.

[18]  M. Bal,et al.  Narratology: Introduction to the Theory of Narrative , 1988 .

[19]  Diana Maynard,et al.  Who cares about Sarcastic Tweets? Investigating the Impact of Sarcasm on Sentiment Analysis. , 2014, LREC.

[20]  Arthur C. Graesser,et al.  Coh-Metrix: Analysis of text on cohesion and language , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[21]  Gerhard Weikum,et al.  As Time Goes By: Comprehensive Tagging of Textual Phrases with Temporal Scopes , 2016, WWW.

[22]  Ferdinand de Saussure Course in General Linguistics , 1916 .

[23]  Paolo Rosso,et al.  A WordNet-based Query Expansion Method for Geographical Information Retrieval , 2005, CLEF.

[24]  Vincenzo Lombardo,et al.  Semantic annotation of narrative media objects , 2011, Multimedia Tools and Applications.

[25]  Robert Michael Young,et al.  From linear story generation to branching story graphs , 2005, IEEE Computer Graphics and Applications.

[26]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[27]  Sreenivas Gollapudi,et al.  Diversifying search results , 2009, WSDM '09.

[28]  R. Barthes,et al.  An Introduction to the Structural Analysis of Narrative , 1975 .

[29]  Paul Mulholland,et al.  Storyscope: using theme and setting to guide story enrichment from external data sources , 2013, HT '13.

[30]  Nigel Shadbolt,et al.  OntoMedia: An Ontology for the Representation of Heterogeneous Media , 2005 .

[31]  David S. Rosenblum,et al.  From action to activity: Sensor-based activity recognition , 2016, Neurocomputing.

[32]  Jinhui Tang,et al.  Weakly Supervised Deep Matrix Factorization for Social Image Understanding , 2017, IEEE Transactions on Image Processing.

[33]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[34]  Herwig Unger,et al.  On N-term Co-occurrences , 2014, IC2IT.

[35]  Luming Zhang,et al.  Fortune Teller: Predicting Your Career Path , 2016, AAAI.

[36]  Alia I. Abdelmoty,et al.  Ontology-Based Spatial Query Expansion in Information Retrieval , 2005, OTM Conferences.

[37]  Erkki Sutinen,et al.  Serious storytelling – a first definition and review , 2017, Multimedia Tools and Applications.

[38]  Thomas S. Huang,et al.  Unifying Keywords and Visual Contents in Image Retrieval , 2002, IEEE Multim..

[39]  Jing Liu,et al.  Robust Structured Subspace Learning for Data Representation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  David E. Millard,et al.  A semiotic approach for the generation of themed photo narratives , 2010, HT '10.

[41]  Jinhui Tang,et al.  Weakly-Shared Deep Transfer Networks for Heterogeneous-Domain Knowledge Propagation , 2015, ACM Multimedia.

[42]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[43]  Pushpak Bhattacharyya,et al.  Automatic Sarcasm Detection , 2016, ACM Comput. Surv..

[44]  D. Harrell Shades of Computational Evocation and Meaning: The GRIOT System and Improvisational Poetry Generation , 2005 .

[45]  Maxine S. Cohen Review of “The ergonomics of hypertext narrative: usability testing as a tool for evaluation and redesign” by Kim Gee , 2002 .

[46]  Mitsuru Ishizuka,et al.  Keyword extraction from a single document using word co-occurrence statistical information , 2004, Int. J. Artif. Intell. Tools.

[47]  Takenobu Tokunaga,et al.  Combining multiple evidence from different types of thesaurus for query expansion , 1999, SIGIR '99.

[48]  D. McAdams,et al.  The Problem of Narrative Coherence , 2006 .

[49]  Michael Hausenblas,et al.  Metadata-driven interactive web video assembly , 2009, Multimedia Tools and Applications.

[50]  John Sturrock,et al.  Structuralism and since : from Lévi-Strauss to Derrida , 1983 .

[51]  Alberto Del Bimbo,et al.  Socializing the Semantic Gap , 2015, ACM Comput. Surv..