Some Frequency based Differences between Spoken and Written Swedish

This is a report on the differences in word frequency foundbetween two Swedish corpora, - a transcribed spoken languagecorpus of 276,391 words and a written language corpus of 271,216words. The spoken language corpus contains material from 14activity types while the written language corpus contains materialfrom novels (40%) and newspapers (60%). The report expands andcontinues earlier work on differences between spoken and writtenlanguage, e.g. Jirgensen (1976) or Biber (1988).The word frequencies of the two corpora are described andmore fully presented in Allwood 1996. Below I will now presentsome of the main differences between the corpora.