Learning narrative structure from annotated folktales

Narrative structure is an ubiquitous and intriguing phenomenon. By virtue of structure we recognize the presence of Villainy or Revenge in a story, even if that word is not actually present in the text. Narrative structure is an anvil for forging new artificial intelligence and machine learning techniques, and is a window into abstraction and conceptual learning as well as into culture and its influence on cognition. I advance our understanding of narrative structure by describing Analogical Story-Merging (ASM), a new machine learning algorithm that can extract culturally-relevant plot patterns from sets of folktales. I demonstrate that ASM can learn a substantive portion of Vladimir Propp's influential theory of the structure of folktale plots. The challenge was to take descriptions at one semantic level, namely, an event timeline as described in folktales, and abstract to the next higher level: structures such as Villainy, Struggle-Victory, and Reward. ASM is based on Bayesian Model Merging, a technique for learning regular grammars. I demonstrate that, despite ASM's large search space, a carefully-tuned prior allows the algorithm to converge, and furthermore it reproduces Propp's categories with a chance-adjusted Rand index of 0.511 to 0.714. Three important categories are identified with F-measures above 0.8. The data are 15 Russian folktales, comprising 18,862 words. a. subset of Propp's original tales. This subset was annotated for 18 aspects of meaning by 12 annotators using the Story Workbench, a general text-annotation tool I developed for this work. Each aspect was doubly-annotated and adjudicated at inter-annotator F-measures that cluster around 0.7 to 0.8. It is the largest, most deeply-annotated narrative corpus assembled to date. The work has significance far beyond folktales. First, it points the way toward important applications in many domains, including information retrieval, persuasion and negotiation, natural language understanding and generation, and computational creativity. Second, abstraction from natural language semantics is a skill that underlies many cognitive tasks, and so this work provides insight into those processes. Finally, the work opens the door to a computational understanding of cultural influences on cognition and understanding cultural differences as captured in stories.

[1]  Patrick Henry Winston,et al.  Learning and Reasoning by Analogy: The Details , 1979 .

[2]  김종식 동아시아 문화와 경영, Geert Hofstede, Culture's Consequences: International Differences in Work-Related Values (Beverly Hills, CA:Sage Publications, 1980) ; and Cultures and Organizations : Software of the Mind (New York:McGraw-Hill, 1997) , 1999 .

[3]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[4]  D. Berend,et al.  IMPROVED BOUNDS ON BELL NUMBERS AND ON MOMENTS OF SUMS OF RANDOM VARIABLES , 2000 .

[5]  Manfred Pinkal,et al.  Learning Script Knowledge with Web Experiments , 2010, ACL.

[6]  Patrick Henry Winston,et al.  Learning and reasoning by analogy , 1980, CACM.

[7]  John B. Black,et al.  STORY UNDERSTANDING AS PROBLEM-SOLVING * , 1980 .

[8]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Schemas and their Participants , 2009, ACL.

[9]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[10]  C. M. Sperberg-McQueen,et al.  Guidelines for electronic text encoding and interchange , 1994 .

[11]  Kent Beck,et al.  Contributing to Eclipse - principles, patterns, and plug-ins , 2003, The Eclipse series.

[12]  Johanna D. Moore,et al.  Automatic Analysis of Plot for Story Rewriting , 2004, EMNLP.

[13]  Roger C. Schank,et al.  SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[14]  F. C. Bartlett,et al.  Some Experiments on the Reproduction of Folk-Stories , 1920 .

[15]  C. Lévi-Strauss The Structural Study of Myth , 1955 .

[16]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[17]  Andreas Stolcke,et al.  Inducing Probabilistic Grammars by Bayesian Model Merging , 1994, ICGI.

[18]  Mark Liberman,et al.  A formal framework for linguistic annotation , 1999, Speech Commun..

[19]  Mark A. Finlayson,et al.  The Prevalence of Descriptive Referring Expressions in News and Narrative , 2010, ACL.

[20]  G. Lakoff Structural Complexity in Fairy Tales , 1972 .

[21]  Alan Dundes,et al.  From Etic to Emic Units in the Structural Study of Folktales , 1962 .

[22]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[23]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[24]  Wendy G. Lehnert,et al.  Plot Units and Narrative Summarization , 1981, Cogn. Sci..

[25]  A. Dundes The Morphology of North American Indian Folktales , 1980 .

[26]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[27]  C. M. Sperberg-McQueen,et al.  Guidelines for electronic text encoding and interchange : TEI P4 , 2002 .

[28]  Christiane Fellbaum,et al.  The Manually Annotated Sub-Corpus: A Community Resource for and by the People , 2010, ACL.

[29]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[30]  N. S. Johnson,et al.  Remembrance of things parsed: Story structure and recall , 1977, Cognitive Psychology.

[31]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[32]  Mark A. Finlayson,et al.  jMWE: A Java Toolkit for Detecting Multi-Word Expressions , 2011, MWE@ACL.

[33]  R. Ratcliff,et al.  Memory connections between thematically similar episodes. , 1986, Journal of experimental psychology. Learning, memory, and cognition.

[34]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[35]  James Gosling,et al.  The Java Language Specification, 3rd Edition , 2005 .

[36]  Mark A. Finlayson,et al.  Analogical Retrieval via Intermediate Features: The Goldilocks Hypothesis , 2006 .

[37]  Guy L. Steele,et al.  Java(TM) Language Specification, The (3rd Edition) (Java (Addison-Wesley)) , 2005 .

[38]  James Pustejovsky,et al.  TimeML: Robust Specification of Event and Temporal Expressions in Text , 2003, New Directions in Question Answering.

[39]  B. Colby A Partial Grammar of Eskimo Folktales1 , 1973 .

[40]  Georges Polti The Thirty-Six Dramatic Situations , 1916 .

[41]  Brian Falkenhainer,et al.  The Structure-Mapping Engine: Algorithm and Examples , 1989, Artif. Intell..

[42]  C. Vogler The writer's journey : mythic structure for storytellers and screenwriters , 1999 .

[43]  Christopher R. Johnson,et al.  Background to Framenet , 2003 .

[44]  C. Schunn,et al.  Priming, analogy, and awareness in complex reasoning , 1996, Memory & cognition.

[45]  M. González Rodríguez,et al.  Proceedings of the third International Conference on Language Resources and Evaluation , 2002 .

[46]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[47]  Lee Haring,et al.  The Types of International Folktales: A Classification and Bibliography. Based on the System of Antti Aarne and Stith Thompson (review) , 2006 .

[48]  H. J. E. PEAKE,et al.  The Study of Man , 1895, Nature.

[49]  Robert A. Segal Joseph Campbell's Theory of Myth , 2013 .

[50]  Joseph Campbell,et al.  The Hero with a Thousand Faces , 1949 .

[51]  E. Hammel,et al.  LANGUAGE, THOUGHT, AND CULTURE: Advances in the Study of Cognition , 1985 .

[52]  G. Rota The Number of Partitions of a Set , 1964 .

[53]  Eneko Agirre,et al.  Word Sense Disambiguation: Algorithms and Applications , 2007 .

[54]  Colin de la Higuera,et al.  Grammatical Inference: Learning Automata and Grammars , 2010 .

[55]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[56]  Hamish Cunningham,et al.  GATE-a General Architecture for Text Engineering , 1996, COLING.

[57]  G. Hofstede,et al.  Culture′s Consequences: International Differences in Work-Related Values , 1980 .

[58]  Pablo Gervás,et al.  A Case Based Reasoning Approach to Story Plot Generation , 2004, ECCBR.

[59]  Rada Mihalcea,et al.  Word Sense Disambiguation , 2015, Encyclopedia of Machine Learning.

[60]  Barbara Nasto,et al.  The power of myth , 1996, Nature Medicine.

[61]  F. Bartlett,et al.  Remembering: A Study in Experimental and Social Psychology , 1932 .

[62]  Matthew Young-Lai Grammar Inference , 2009, Encyclopedia of Database Systems.

[63]  Vladimir Propp,et al.  Morphology of the folktale , 1959 .

[64]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[65]  Stephen M. Omohundro,et al.  Best-First Model Merging for Dynamic Learning and Recognition , 1991, NIPS.

[66]  H. Spencer-Oatey Culturally Speaking: Managing Rapport Through Talk Across Cultures , 2004 .

[67]  Hans-Jörg Uther,et al.  The types of international folktales : a classification andbibliography. , 2004 .

[68]  Daniel Jurafsky,et al.  Support Vector Learning for Semantic Argument Classification , 2005, Machine Learning.

[69]  Mark Steedman,et al.  Example Selection for Bootstrapping Statistical Parsers , 2003, NAACL.

[70]  J. Pitman Combinatorial Stochastic Processes , 2006 .

[71]  Georgios Paliouras,et al.  Ellogon: A New Text Engineering Platform , 2002, LREC.

[72]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[73]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[74]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[75]  J. L. Fischer The Sociopsychological Analysis of Folktales , 1963, Current Anthropology.

[76]  Patrick Henry Winston,et al.  The Strong Story Hypothesis and the Directed Perception Hypothesis , 2011, AAAI Fall Symposium: Advances in Cognitive Systems.

[77]  Thierry Declerck,et al.  Integration of Linguistic Markup into Semantic Models of Folk Narratives: The Fairy Tale Use Case , 2010, LREC.