Exploiting probabilistic topic models to improve text categorization under class imbalance

In text categorization, it is quite often that the numbers of documents in different categories are different, i.e., the class distribution is imbalanced. We propose a unique approach to improve te...