Understanding memories of the Holocaust - A new approach to neural networks in the digital humanities

This article addresses an important challenge in artificial intelligence research in the humanities, which has impeded progress with supervised methods. It introduces a novel method to creating test collections from smaller subsets. This method is based on what we will introduce as distant supervision’ and will allow us to improve computational modelling in the digital humanities by including new methods of supervised learning. Using recurrent neural networks, we generated a training corpus and were able to train a highly accurate model that qualitatively and quantitatively improved a baseline model. To demonstrate our new approach experimentally, we employ a real-life research question based on existing humanities collections. We use neural network based sentiment analysis to decode Holocaust memories and present a methodology to combine supervised and unsupervised sentiment analysis to analyse the oral history interviews of the United States Holocaust Memorial Museum. Finally, we employed three advanced methods of computational semantics. These helped us decipher the decisions by the neural network and understand, for instance, the complex sentiments around family memories in the testimonies.

[1]  Petko Bogdanov,et al.  Introduction—Topic models: What they are and why they matter , 2013 .

[2]  Miodrag Lovric,et al.  International Encyclopedia of Statistical Science , 2011 .

[3]  M. Jahoda,et al.  Predicting the past , 1982, Nature.

[4]  Vadlamani Ravi,et al.  A survey on opinion mining and sentiment analysis: Tasks, approaches and applications , 2015, Knowl. Based Syst..

[5]  David M. Berry,et al.  Understanding digital humanities , 2012 .

[6]  Justin Grimmer,et al.  Text as Data: The Promise and Pitfalls of Automatic Content Analysis Methods for Political Texts , 2013, Political Analysis.

[7]  Sebastian Raschka,et al.  Python Machine Learning , 2015 .

[8]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[9]  Tobias Blanke,et al.  Integrating Holocaust Research , 2013 .

[10]  Deepika Bahri,et al.  Predicting the Past , 2004 .

[11]  P. Thompson,et al.  The Voice of the Past: Oral History , 1978 .

[12]  RaviVadlamani,et al.  A survey on opinion mining and sentiment analysis , 2015 .

[13]  Greta Franzini,et al.  On Close and Distant Reading in Digital Humanities: A Survey and Future Challenges , 2015, EuroVis.

[14]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[15]  Hannah Arendt,et al.  Eichmann in Jerusalem , 1963 .

[16]  Christopher Zorn,et al.  Corpus-based dictionaries for sentiment analysis of specialized vocabularies , 2019, Political Science Research and Methods.

[17]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .