A prototype multilingual document browser for Ancient Greek texts : Digital Libraries

This paper describes a prototype multilingual keyword extraction and information browsing system for texts written in Classical Greek. This system automatically extracts keywords from Greek texts using a tf x idf keyword discovery routine, clusters documents into thematically coherent groups based on these keywords, translates the keywords into English, and presents this information in two different formats so that users with limited knowledge of Ancient Greek can browse the documents and orient themselves to important concepts in the collections of a digital library.