An Annotated Corpus of Film Dialogue for Learning and Characterizing Character Style

Interactive story systems often involve dialogue with virtual dramatic characters. However, to date most character dialogue is written by hand. One way to ease the authoring process is to (semi-)automatically generate dialogue based on film characters. We extract features from dialogue of film characters in leading roles. Then we use these character-based features to drive our language generator to produce interesting utterances. This paper describes a corpus of film dialogue that we have collected from the IMSDb archive and annotated for linguistic structures and character archetypes. We extract different sets of features using external sources such as LIWC and SentiWordNet as well as using our own written scripts. The automation of feature extraction also eases the process of acquiring additional film scripts. We briefly show how film characters can be represented by models learned from the corpus, how the models can be distinguished based on different categories such as gender and film genre, and how they can be applied to a language generator to generate utterances that can be perceived as being similar to the intended character model.

[1]  H. Bonner Language and personality. , 1961 .

[2]  Penelope Brown,et al.  Politeness: Some Universals in Language Usage , 1989 .

[3]  Barbara Hayes-Roth,et al.  Multiagent Collaboration in Directed Improvisation , 1997, ICMAS.

[4]  Marilyn A. Walker,et al.  Improvising linguistic style: social and affective bases for agent personality , 1997, AGENTS '97.

[5]  Barbara S. Page Hamlet on the Holodeck: The Future of Narrative in Cyberspace , 1999 .

[6]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[7]  Thomas Rist,et al.  The automated design of believable dialogues for animated presentation teams , 2001 .

[8]  Marilyn A. Walker,et al.  Spoken language generation , 2002, Comput. Speech Lang..

[9]  Andrew Stern,et al.  Façade: An Experiment in Building a Fully-Realized Interactive Drama , 2003 .

[10]  Paul Piwek,et al.  A Flexible Pragmatics-Driven Language Generator for Animated Agents , 2003, EACL.

[11]  Richard E. Mayer,et al.  Cross-Cultural Evaluation of Politeness in Tactics for Pedagogical Agents , 2005, AIED.

[12]  Marc Cavazza,et al.  Dialogue Generation in Character-based Interactive Storytelling , 2005, AIIDE.

[13]  Ning Wang,et al.  The Politeness Effect: Pedagogical Agents and Learning Gains , 2005, AIED.

[14]  James C. Lester,et al.  U-director: a decision-theoretic narrative planning architecture for storytelling environments , 2006, AAMAS '06.

[15]  Ruth Aylett,et al.  Unscripted narrative for affectively driven characters , 2005, IEEE Computer Graphics and Applications.

[16]  Marilyn A. Walker,et al.  How Rude Are You?: Evaluating Politeness and Affect in Interaction , 2007, ACII.

[17]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[18]  Craig H. Martell,et al.  Lexical and Discourse Analysis of Online Chat Dialog , 2007, International Conference on Semantic Computing (ICSC 2007).

[19]  Jonathan P. Rowe,et al.  Archetype-Driven Character Dialogue Generation for Interactive Narrative , 2008, IVA.

[20]  J. Mayer,et al.  Resonance to archetypes in media: There’s some accounting for taste , 2009 .

[21]  Marilyn A. Walker,et al.  Towards personality-based user adaptation: psychologically informed stylistic language generation , 2010, User Modeling and User-Adapted Interaction.

[22]  David Thue,et al.  Player Agency and the Relevance of Decisions , 2010, ICIDS.

[23]  Marilyn A. Walker,et al.  Murder in the Arboretum: Comparing Character Models to Personality Models , 2011, Intelligent Narrative Technologies.

[24]  Marilyn A. Walker,et al.  A Step Towards the Future of Role-Playing Games: The SpyFeet Mobile RPG Project , 2011, AIIDE.

[25]  Marilyn A. Walker,et al.  Perceived or Not Perceived: Film Character Models for Expressive NLG , 2011, ICIDS.

[26]  Marilyn A. Walker,et al.  Controlling User Perceptions of Linguistic Style: Trainable Generation of Personality Traits , 2011, CL.

[27]  Marilyn A. Walker,et al.  All the World's a Stage: Learning Character Models from Film , 2011, AIIDE.