Accelerating large scale centroid-based clustering with locality sensitive hashing