Data storage and query processing for structured document databases

In this paper, we discuss how to integrate HTML documents into object databases. We compare two policies for the management of structural information: (1) static extraction and (2) dynamic extraction. We also propose a system design for the effective implementation of full-text searching and structural searching for an HTML structured document database. We conclude that the query processors for full-text searching and structural searching can be combined easily in the dynamic extraction approach.