Duplicate detection in XML Web data