Accurate histogram-based XML summarization

In this paper, we propose the use of histograms to characterize node set distributions in an XML document, which then can be recursively evaluated for query optimization tasks. We identify and deal with special cases for effectively using histograms to summarize structural aspects of XML documents. To reveal the potential of our approach, we perform comparative experiments on our native XML database management system called XTC.