İslenskur Orðasjóður - Building a Large Icelandic Corpus

We introduce an Icelandic corpus of more than 250 million running words and de- scribe the methodology to build it. The re- source is available for use free of charge. We provide automatically generated mono- lingual lexicon entries, comprising fre- quency statistics, samples of usage, co- occurring words and a graphical represen- tation of the word's semantic neighbour- hood.