Fine-grained Collaborative K-Means Clustering

We present a collaborative clustering algorithm of fine-grained images, where the subtle inter-class differences are represented using collaborative filters. Cluster centers are represented as optimal weighted collaboration of data points, and the optimal weight matrix is analytically obtained. This may be viewed as a generalization of the K-means clustering algorithm, where these weights would be unity. We also introduce a matrix inversion scheme that allows us to scale up collaborative representations by orders of magnitude and this allows us to apply the scheme when the number of data points is large. Collaborative clustering outperforms K-means and a few of its popular variants K-medians, K-modes and K-medoids. It also outperforms DBSCAN and its recent variation DSets-DBSCAN. We have chosen species recognition (bird and butterflies) as a representative fine-grained image categorization problem, though the proposed algorithm is a general approach applicable to other similar tasks. We also introduce a new benchmark fine-grained image dataset, that of Indian endemic butterfly species (Titli.vl), which is available through the corresponding author.

[1]  James Bailey,et al.  Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance , 2010, J. Mach. Learn. Res..

[2]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[3]  Cordelia Schmid,et al.  Semi-Local Affine Parts for Object Recognition , 2004, BMVC.

[4]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[5]  Joachim Denzler,et al.  Fine-grained Recognition Datasets for Biodiversity Analysis , 2015, ArXiv.

[6]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  Umapada Pal,et al.  A Generalised Formulation for Collaborative Representation of Image Patches (GP-CRC) , 2017, BMVC.

[8]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[9]  Hae-Sang Park,et al.  A simple and fast algorithm for K-medoids clustering , 2009, Expert Syst. Appl..

[10]  Dan Frankowski,et al.  Collaborative Filtering Recommender Systems , 2007, The Adaptive Web.

[11]  Xuelong Li,et al.  DSets-DBSCAN: A Parameter-Free Clustering Algorithm , 2016, IEEE Transactions on Image Processing.

[12]  Cordelia Schmid,et al.  A maximum entropy framework for part-based texture and object recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Umapada Pal,et al.  Collaborative representation based fine-grained species recognition , 2016, 2016 International Conference on Image and Vision Computing New Zealand (IVCNZ).

[15]  Yuning Chai,et al.  Advances in fine-grained visual categorization , 2015 .

[16]  Paul E. Green,et al.  K-modes Clustering , 2001, J. Classif..

[17]  Lei Zhang,et al.  Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[18]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[19]  Pierre Gançarski,et al.  Collaborative clustering: Why, when, what and how , 2018, Inf. Fusion.

[20]  Umapada Pal,et al.  LOOP Descriptor: Local Optimal-Oriented Pattern , 2017, IEEE Signal Processing Letters.