Automated Compositional Change Detection in Saxo Grammaticus' Gesta Danorum

Saxo Grammaticus’ medieval source Gesta Danorum (“Deeds of the Danes”) represents the beginning of the modern historical research in Denmark. The bipartite composition of Gesta Danorum has however been subject to much academic debate. In particular the nature and location of a transition between early Pre-Christian and late Christian content have given rise to two competing accounts. In this paper, we argue that the debate can be represented as a problem of intratextual dynamics and we combine models from Information Retrieval and Natural Language Processing with techniques for time series analysis in order to reevaluate the debate. Results indicate that the transition is gradual, starting in book eight and ending in book ten, but that a point-like interpretation is possible in book nine. We argue that the approach exemplifies scalable“automated close reading”, which has multiple applications in text-based historical research. Keywords— Change Detection, Cultural Heritage, Medieval Literature, Text Analysis, Topic Modeling

[1]  J. Hodges,et al.  The effects of very early Alzheimer's disease on the characteristics of writing by a renowned author. , 2004, Brain : a journal of neurology.

[2]  Xiaojin Zhu,et al.  How Creative is Your Writing? , 2009 .

[3]  Dorte Haltrup Hansen,et al.  Encompassing a spectrum of LT users in the CLARIN-DK Infrastructure , 2014, LREC.

[4]  Graeme Hirst,et al.  Vocabulary Changes in Agatha Christie's Mysteries as an Indication of Dementia: A Case Study , 2009 .

[5]  Steven Skiena,et al.  Statistically Significant Detection of Linguistic Change , 2014, WWW.

[6]  D R Wekstein,et al.  Linguistic ability in early life and cognitive function and Alzheimer's disease in late life. Findings from the Nun Study. , 1996, JAMA.

[7]  Simon DeDeo,et al.  Exploration and exploitation of Victorian science in Darwin’s reading notebooks , 2015, Cognition.

[8]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[9]  David M. Blei,et al.  Dynamic Bernoulli Embeddings for Language Evolution , 2017, ArXiv.

[10]  W. A. Taylor Change-Point Analysis : A Powerful New Tool For Detecting Changes , 2000 .

[11]  Bei Yu,et al.  Exploring The Relationship Between Mood and Creativity in Rock Lyrics , 2011, ISMIR.

[12]  Yanhui Zhang Entropic evolution of lexical richness of homogeneous texts over time: A dynamic complexity perspective , 2015, J. Lang. Model..

[13]  Visar Berisha,et al.  Tracking discourse complexity preceding Alzheimer's disease diagnosis: a case study comparing the press conferences of Presidents Ronald Reagan and George Herbert Walker Bush. , 2015, Journal of Alzheimer's disease : JAD.

[14]  E. M. Wright,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[15]  Christopher M. Danforth,et al.  The emotional arcs of stories are dominated by six basic shapes , 2016, EPJ Data Science.

[16]  Jianbo Gao,et al.  A multiscale theory for the dynamical evolution of sentiment in novels , 2016, 2016 International Conference on Behavioral, Economic and Socio-cultural Computing (BESC).

[17]  John D. Lafferty,et al.  Dynamic topic models , 2006, ICML.

[18]  Simon DeDeo,et al.  Individuals, institutions, and innovation in the debates of the French Revolution , 2017, Proceedings of the National Academy of Sciences.

[19]  H. S. Heaps,et al.  Information retrieval, computational and theoretical aspects , 1978 .

[20]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[21]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .