GuCNet: A Guided Clustering-based Network for Improved Classification

We deal with the problem of semantic classification of challenging and highly-cluttered dataset. We present a novel, and yet a very simple classification technique by leveraging the ease of classifiability of any existing well separable dataset for guidance. Since the guide dataset which may or may not have any semantic relationship with the experimental dataset, forms well separable clusters in the feature set, the proposed network tries to embed class-wise features of the challenging dataset to those distinct clusters of the guide set, making them more separable. Depending on the availability, we propose two types of guide sets: one using texture (image) guides and another using prototype vectors representing cluster centers. Experimental results obtained on the challenging benchmark RSSCN, LSUN, and TU-Berlin datasets establish the efficacy of the proposed method as we outperform the existing state-of-the-art techniques by a considerable margin.

[1]  Cordelia Schmid,et al.  Label-Embedding for Image Classification , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Muhammad Tariq Mahmood,et al.  Modeling global geometric spatial information for rotation invariant classification of satellite images , 2019, PloS one.

[3]  Avik Bhattacharya,et al.  Siamese graph convolutional network for content based remote sensing image retrieval , 2019, Comput. Vis. Image Underst..

[4]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Xiangtao Zheng,et al.  A Deep Scene Representation for Aerial Scene Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Limin Wang,et al.  Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs , 2016, IEEE Transactions on Image Processing.

[8]  Alex Krizhevsky,et al.  Learning Multiple Layers of Features from Tiny Images , 2009 .

[9]  Marc Alexa,et al.  How do humans sketch objects? , 2012, ACM Trans. Graph..

[10]  Edoardo Pasolli,et al.  Active-Metric Learning for Classification of Remotely Sensed Hyperspectral Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Sebastian Tschiatschek,et al.  Maximum Margin Bayesian Network Classifiers , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Rehan Ashraf,et al.  A Novel Discriminating and Relative Global Spatial Image Representation with Applications in CBIR , 2018, Applied Sciences.

[13]  Tong Zhang,et al.  Deep Learning Based Feature Selection for Remote Sensing Scene Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[14]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[15]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[16]  Ameya Prabhu,et al.  Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory , 2018, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[17]  Yinda Zhang,et al.  LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[18]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[19]  Zeynep Akata,et al.  Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-Based Image Retrieval , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Wai-tian Tan,et al.  Learning Geographically Distributed Data for Multiple Tasks Using Generative Adversarial Networks , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[21]  Simon Haykin,et al.  GradientBased Learning Applied to Document Recognition , 2001 .

[22]  Xiaochun Cao,et al.  Learning Structural Representations via Dynamic Object Landmarks Discovery for Sketch Recognition and Retrieval , 2019, IEEE Transactions on Image Processing.

[23]  Subhasis Chaudhuri,et al.  Structure Aligning Discriminative Latent Embedding for Zero-Shot Learning , 2018, BMVC.