Statistical aspects of reliability in language testing