TEXT CATEGORIZATION USING LEARNED DOCUMENT FEATURES