Automatic interpretation of the texts of chemical patent abstracts. 1. Lexical analysis and categorization