Cluster generators for large high-dimensional data sets with large numbers of clusters