Using the revised EM algorithm to remove noisy data for improving the one-against-the-rest method in binary text classification

Automatic text classification is the problem of automatically assigning predefined categories to free text documents, thus allowing for less manual labors required by traditional classification met...