Approximate single linkage cluster analysis of large data sets in high-dimensional spaces