Properties of the singular value decomposition for efficient data clustering

We introduce some interesting properties of the singular value decomposition (SVD), and illustrate how they may be used in conjunction with the k-means algorithm for efficiently clustering a set of vectors. Specifically, we use the SVD to preprocess and sort the data vectors, and then use the k-means algorithm on the modified vectors. To illustrate the effectiveness of this approach, we compare it to the k-means algorithm without preprocessing and show that significant gains in clustering speed may be realized.

[1]  Stanley C. Ahalt,et al.  A hybrid DCT-SVD image-coding algorithm , 2002, IEEE Trans. Circuits Syst. Video Technol..

[2]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[3]  SangKeun Lee,et al.  Efficient scene segmentation for content-based indexing in the compressed domain , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[4]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[5]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[6]  Young-Min Kim,et al.  Fast Scene Change Detection using Direct Feature Extraction from MPEG Compressed Videos , 2000, IEEE Trans. Multim..