Conversion of Microsoft Word and OpenOffice formats into xml-like documents

In this technical report the conversion of most frequently u sed document formats into the specified xml-like format is discussed. The developed conversion software package is presented and its use is describ d. It was created for purposes of building the Slovak National Corpus of writt en language. This package is able to convert all the file formats recognize d by Microsoft Word and OpenOffice writer, which are nowadays the wide-spre ad t xt editors.