A divisive information theoretic feature clustering algorithm for text classification

High dimensionality of text can be a deterrent in applying complex learners such as Support Vector Machines to the task of text classification. Feature clustering is a powerful alternative to featu...