Effective Clustering Schemes for XML Databases

Although clustering problems are in general NP-hard, much research effort on this problem has been invested in the areas of object-oriented databases (OODB) and relational databases systems (RDBMS). With the increasing popularity of XML, researchers have been focusing on various XML data management including query processing and optimization. However, the clustering issues for XML data storage have been disregarded in their work. This paper provides a preliminary study on data clustering for optimizing XML databases. Different clustering schemes are compared through a set of extensive experiments.