A Streaming Algorithm for k-Means with Approximate Coreset

For computing the k-means clustering of the streaming and distributed big sparse data, we present an algorithm to obtain the sparse coreset for the k-means in polynomial time. This algorithm is mai...