Subtree-based XML Data Integration Using Leaf-clustering Based Approximate XML Join Algorithms