The structured information manager (SIM)

SIM, the Structured Information Manager, is an information retrieval system which is designed to manage multi-gigabyte collections of documents containing text, images, and other forms of data, storing them natively in SGML, XML, MARC, RTF, and ASCII formats. It is a fully-fledged system that provides integrated support for efficient ranked full text, boolean, and structural querying via a robust database server capable of practically managing multi-gigabyte document collections.