Sources of variation in news vocabulary: a comparative analysis