A Diachronic Computational Lexical Resource for 800 Years of Swedish

The goal of the work presented in this chapter is to create a set of computational lexical resources, interlinked on the lexical sense level using the persistent sense identifiers designed for the Present-Day Swedish lexical resource SALDO. In this way, all the diverse linguistic information available in our individual lexical resources – modern and historical – becomes available for all resources where the interlinking has been completed. Using this mechanism, we have been able to devise a semantic search application for 19th century fiction which combines a morphological component for this language variety – Late Modern Swedish – and a lexicalsemantic resource for Present-Day Swedish through the lexical sense identifiers. At present, we are extending and cross-linking the modern lexical resources, as well as steadily integrating the 19th century resource into this emerging diachronic lexical resource. We are also working on the intricate and challenging problem of incorporating the most deviant historical language variety – Old Swedish – in this resource, which we clearly have to do before we can truthfully refer to it as a diachronic computational lexical resource for 800 years of Swedish.

[1]  A. F. Dalin Ordbok öfver svenska språket , 1850 .

[2]  Knut Fredrik Söderwall Ordbok öfver svenska medeltids-språket , 2022 .

[3]  C. F. Hockett Two Models of Grammatical Description , 1954 .

[4]  Elias Wessén Grundlinjer till en historisk syntax , 1965 .

[5]  Edgar E. Siskin,et al.  Edward Sapir: Linguist, Anthropologist, Humanist , 1989 .

[6]  Lennart Lönngren,et al.  A Swedish Associative Thesaurus , 1998 .

[7]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[8]  Lauri Karttunen,et al.  Finite State Morphology , 2003, CSLI Studies in Computational Linguistics.

[9]  Peyton Jones,et al.  Haskell 98 language and libraries : the revised report , 2003 .

[10]  Brian D. Joseph,et al.  The handbook of historical linguistics , 2003 .

[11]  Graeme Hirst,et al.  Non-Classical Lexical Semantic Relations , 2004, Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics - CLS '04.

[12]  Markus Forsberg,et al.  Functional morphology , 2004, ICFP '04.

[13]  Aarne Ranta,et al.  Grammatical Framework , 2004, Journal of Functional Programming.

[14]  Lars Borin,et al.  Mannen är faderns mormor: Svenskt associationslexikon reinkarnerat , 2005 .

[15]  Markus Forsberg,et al.  Three Tools for Language Processing: BNF Converter, Functional Morphology, and Extract , 2007 .

[16]  Brian D. Joseph,et al.  On Language, Change, and Language Change – Or, Of History, Linguistics, and Historical Linguistics , 2008 .

[17]  Lars Borin,et al.  SALDO 1.0 (Svenskt associationslexikon version 2) , 2008 .

[18]  Markus Forsberg,et al.  All in the Family: A Comparison of SALDO and WordNet , 2009 .

[19]  Magnus Källström,et al.  Gertrud Pettersson, Svenska språket under sjuhundra år: en historia om svenskan och dess utforskande, Lund, 2005 , 2009 .

[20]  Lene Antonsen,et al.  Reusing Grammatical Resources for New Languages , 2010, LREC.

[21]  Jon Helgason Jesper Swedberg, Swensk Ordabok, utgiven efter Uppsala-handskriften, med tillägg och rättelser ur övriga handskrifter, av Lars Holm, Skara stiftshistoriska sällskaps skriftserie 46 a Acta Bibliothecae Scarensis 12 : (Järpås: Stiftelsen för utgivande av Skaramissalet, 2009). 718 s. , 2011 .