Large-scale Discriminative n-gram Language Models for Statistical Machine Translation