A corpus platform of Indonesian academic language

Abstract The Indonesian language is the language of education and the language to unify the 701 ethnic languages in Indonesia. To document this language and to determine how it is used in academic texts, a corpus database needs to be collected together with a corpus platform to explore the corpus. This paper presents the features and usage of the first and freely available corpus platform of Indonesian Academic language. The corpus was compiled from over five million word tokens comprising articles from nationally accredited journals and theses from reputable universities. The main features of the software are context, collocate, and frequency. The corpus platform will be an essential resource for linguists, lexicographers, and teachers.