Parsing Screenplays for Extracting Social Networks from Movies

In this paper, we present a formalization of the task of parsing movie screenplays. While researchers have previously motivated the need for parsing movie screenplays, to the best of our knowledge, there is no work that has presented an evaluation for the task. Moreover, all the approaches in the literature thus far have been regular expression based. In this paper, we present an NLP and ML based approach to the task, and show that this approach outperforms the regular expression based approach by a large and statistically significant margin. One of the main challenges we faced early on was the absence of training and test data. We propose a methodology for using well structured screenplays to create training data for anticipated anomalies in the structure

[1]  Noah A. Smith,et al.  SEMAFOR: Frame Argument Resolution with Log-Linear Models , 2010, SemEval@ACL.

[2]  Wei-Ta Chu,et al.  RoleNet: treat a movie as a small society , 2007, MIR '07.

[3]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[4]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994, Structural analysis in the social sciences.

[5]  Denilson Barbosa,et al.  Identification of Speakers in Novels , 2013, ACL.

[6]  Owen Rambow,et al.  Automatic Extraction of Social Networks from Literary Text: A Case Study on Alice in Wonderland , 2013, IJCNLP.

[7]  Wei-Ta Chu,et al.  Movie Analysis Based on Roles' Social Network , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[8]  Wei-Ta Chu,et al.  RoleNet: Movie Analysis from the Perspective of Social Networks , 2009, IEEE Transactions on Multimedia.

[9]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[10]  Nevenka Dimitrova,et al.  Screenplay alignment for closed-system speaker identification and analysis of feature films , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[11]  Kathleen McKeown,et al.  Extracting Social Networks from Literary Fiction , 2010, ACL.

[12]  Owen Rambow,et al.  Automatic Detection and Classification of Social Events , 2010, EMNLP.

[13]  Owen Rambow,et al.  SINNET: Social Interaction Network Extractor from Text , 2013, IJCNLP.

[14]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[15]  Caroline Suen,et al.  Extraction and Analysis of Character Interaction Networks From Plays and Movies , 2013, DH.