Story embedding: Learning distributed representations of stories based on character networks

Abstract This study aims to learn representations of stories in narrative works (i.e., creative works that contain stories) using fixed-length vectors. Vector representations of stories enable us to compare narrative works regardless of their media or formats. To computationally represent stories, we focus on social networks among characters (character networks). We assume that the structural features of the character networks reflect the characteristics of stories. By extending substructure-based graph embedding models, we propose models to learn distributed representations of character networks in stories. The proposed models consist of three parts: (i) discovering substructures of character networks, (ii) embedding each substructure (Char2Vec), and (iii) learning vector representations of each character network (Story2Vec). We find substructures around each character in multiple scales based on proximity between characters. We suppose that a character's substructures signify its ‘social roles’. Subsequently, a Char2Vec model is designed to embed a social role based on co-occurred social roles. Since character networks are dynamic social networks that temporally evolve, we use temporal changes and adjacency of social roles to determine their co-occurrence. Finally, Story2Vec models predict occurrences of social roles in each story for embedding the story. To predict the occurrences, we apply two approaches: (i) considering temporal changes in social roles as with the Char2Vec model and (ii) focusing on the final social roles of each character. We call the embedding model with the first approach ‘flow-oriented Story2Vec.’ This approach can reflect the context and flow of stories if the dynamics of character networks is well understood. Second, based on the final states of social roles, we can emphasize the denouement of stories, which is an overview of the static structure of the character networks. We name this model as ‘denouement-oriented Story2Vec.’ In addition, we suggest ‘unified Story2Vec’ as a combination of these two models. We evaluated the quality of vector representations generated by the proposed embedding models using movies in the real world.

[1]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[2]  Zvi Lotker The tale of two clocks , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[3]  GeunSik Jo,et al.  Social network analysis in a movie using character-net , 2012, Multimedia Tools and Applications.

[4]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[5]  Micha Elsner,et al.  Character-based kernels for novelistic plot structure , 2012, EACL.

[6]  Franco Moretti Network theory, plot analysis , 2011 .

[7]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[8]  Lanfen Lin,et al.  SeBPR: Semantics Enhanced Bayesian Personalized Ranking with Comparable Item Pairs , 2016, 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW).

[9]  Jason J. Jung,et al.  A Novel Method for Extracting Dynamic Character Network from Movie , 2016, BDTA.

[10]  Derek Greene,et al.  The Sense and Sensibility of Different Sliding Windows in Constructing Co-occurrence Networks from Literature , 2016, International Workshop on Computational History and Data-Driven Humanities.

[11]  Jai E. Jung,et al.  A computational model of transmedia ecosystem for story-based contents , 2017, Multimedia Tools and Applications.

[12]  Jason J. Jung,et al.  Integrating character networks for extracting narratives from multimodal data , 2019, Inf. Process. Manag..

[13]  Graham Alexander Sack,et al.  Simulating Plot: Towards a Generative Model of Narrative Structure , 2011, AAAI Fall Symposium: Complex Adaptive Systems.

[14]  Mark O. Riedl,et al.  Reading Between the Lines: Using Plot Graphs to Draw Inferences from Stories , 2016, ICIDS.

[15]  Svetha Venkatesh,et al.  A Probabilistic Framework for Extracting Narrative Act Boundaries and Semantics in Motion Pictures , 2005, Multimedia Tools and Applications.

[16]  Iryna Gurevych,et al.  Personality Profiling of Fictional Characters using Sense-Level Links between Lexical Resources , 2015, EMNLP.

[17]  Pablo Gervás,et al.  Composing narrative discourse for stories of many characters: A case study over a chess game , 2014, Lit. Linguistic Comput..

[18]  Tim Miller,et al.  Explanation in Artificial Intelligence: Insights from the Social Sciences , 2017, Artif. Intell..

[19]  Bliznuk Danil,et al.  Similarity Measures and Models for Movie Series Recommender System , 2018 .

[20]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[21]  Yang Liu,et al.  subgraph2vec: Learning Distributed Representations of Rooted Sub-graphs from Large Graphs , 2016, ArXiv.

[22]  Robert McKee,et al.  Story: Substance, Structure, Style, and the Principles of Screenwriting , 1997 .

[23]  Svetha Venkatesh,et al.  Formulating Film Tempo , 2002 .

[24]  Hao Wu,et al.  Hierarchical Neural Language Models for Joint Representation of Streaming Documents and their Content , 2015, WWW.

[25]  Markus H. Gross,et al.  Computer-assisted authoring of interactive narratives , 2015, I3D.

[26]  Jai E. Jung,et al.  Exploiting character networks for movie summarization , 2017, Multimedia Tools and Applications.

[27]  Joshua Reiss,et al.  Constructing narrative using a generative model and continuous action policies , 2017, CC-NLG@INLG.

[28]  Palash Goyal,et al.  Embedding Networks with Edge Attributes , 2018, HT.

[29]  Kathleen McKeown,et al.  Extracting Social Networks from Literary Fiction , 2010, ACL.

[30]  Jason J. Jung,et al.  MBTI-Based Collaborative Recommendation System: A Case Study of Webtoon Contents , 2015, ICCASA.

[31]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Schemas and their Participants , 2009, ACL.

[32]  Jason J. Jung,et al.  Explainable Movie Recommendation Systems by using Story-based Similarity , 2018, IUI Workshops.

[33]  Kenji Sagae,et al.  Combining Distributed Vector Representations for Words , 2015, VS@HLT-NAACL.

[34]  Mark O. Riedl,et al.  Event Representations for Automated Story Generation with Deep Neural Nets , 2017, AAAI.

[35]  Markus H. Gross,et al.  Computational narrative , 2017, SIGGRAPH Courses.

[36]  Graham Alexander Sack,et al.  Character Networks for Narrative Generation: Structural Balance Theory and the Emergence of Proto-Narratives , 2013, CMN.

[37]  Lina Yao,et al.  Adversarially Regularized Graph Autoencoder , 2018, IJCAI.

[38]  Kurt Mehlhorn,et al.  Weisfeiler-Lehman Graph Kernels , 2011, J. Mach. Learn. Res..

[39]  Boyang Li,et al.  Story Generation with Crowdsourced Plot Graphs , 2013, AAAI.

[40]  Baoxin Li,et al.  Video2vec: Learning semantic spatio-temporal embeddings for video representation , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[41]  Jai E. Jung,et al.  Sequence Clustering-based Automated Rule Generation for Adaptive Complex Event Processing , 2017, Future Gener. Comput. Syst..

[42]  Georges Linarès,et al.  Extraction and Analysis of Dynamic Conversational Networks from TV Series , 2018, Social Network Based Big Data Analysis and Applications.

[43]  Micha Elsner,et al.  Linguistic Issues in Language Technology – LiLT , 2015 .

[44]  Vasudeva Varma,et al.  Doc2Sent2Vec: A Novel Two-Phase Approach for Learning Document Representation , 2016, SIGIR.

[45]  Jason J. Jung,et al.  Measuring Character-based Story Similarity by Analyzing Movie Scripts , 2018, Text2Story@ECIR.

[46]  Jason J. Jung,et al.  Modeling affective character network for story analytics , 2018, Future Gener. Comput. Syst..

[47]  Hoang Long Nguyen,et al.  Multi-scaled Spatial Analytics on Discovering Latent Social Events for Smart Urban Services , 2018, J. Univers. Comput. Sci..

[48]  Yang Liu,et al.  graph2vec: Learning Distributed Representations of Graphs , 2017, ArXiv.

[49]  Jure Leskovec,et al.  node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[50]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[51]  Snigdha Chaturvedi,et al.  Unsupervised Learning of Evolving Relationships Between Literary Characters , 2017, AAAI.

[52]  Pablo Gervás Computational Drafting of Plot Structures for Russian Folk Tales , 2015, Cognitive Computation.

[53]  James C. Lester,et al.  Narrative prose generation , 2001, Artif. Intell..

[54]  Vladimir Propp,et al.  Morphology of the folktale , 1959 .

[55]  Jai E. Jung,et al.  CoCharNet: Extracting Social Networks using Character Co-occurrence in Movies , 2015, J. Univers. Comput. Sci..

[56]  Steven Skiena,et al.  DeepWalk: online learning of social representations , 2014, KDD.

[57]  Pinar Yanardag,et al.  Deep Graph Kernels , 2015, KDD.

[58]  Ana Paiva,et al.  Metrics for Character Believability in Interactive Narrative , 2013, ICIDS.

[59]  Mark A. Finlayson,et al.  Computational Models of Narrative: Using Artificial Intelligence to Operationalize Russian Formalist and French Structuralist Theories , 2014, DH.

[60]  Derek Greene,et al.  Novel2Vec: Characterising 19th Century Fiction via Word Embeddings , 2016, AICS.

[61]  Georges Linarès,et al.  Narrative smoothing: Dynamic conversational network for the analysis of TV series plots , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[62]  Palash Goyal,et al.  Graph Embedding Techniques, Applications, and Performance: A Survey , 2017, Knowl. Based Syst..

[63]  Inderjeet Mani,et al.  Animation Motion in NarrativeML , 2016, CMN.

[64]  Wei-Ta Chu,et al.  RoleNet: Movie Analysis from the Perspective of Social Networks , 2009, IEEE Transactions on Multimedia.

[65]  Xavier Bost A storytelling machine? : Automatic video summarization: the case of TV series. (Une machine à raconter des histoires ? / Une machine à raconter des histoires ? : Résumé automatique de vidéos : le cas des séries TV) , 2016 .

[66]  Srini Narayanan,et al.  Toward a Computational Model of Narrative , 2010, AAAI Fall Symposium: Computational Models of Narrative.

[67]  Derek Greene,et al.  Discovering structure in social networks of 19th century fiction , 2016, WebSci.

[68]  Jason J. Jung,et al.  Character Network Embedding-based Plot Structure Discovery in Narrative Multimedia , 2019, WIMS2019.

[69]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[70]  Mingzhe Wang,et al.  LINE: Large-scale Information Network Embedding , 2015, WWW.