Analysis and conversion of documents

This paper deals with the use of grammatical formalisms to recognise the physical and the logical structures of a composite document. We propose a new system based on W-grammar for document analysis and conversions. As an application the system identifies printed "summaries" and converts them into machine readable form such as the hypertext markup language (HTML) which is handled by Navigator software.