论文信息 - Content Based Image Retrieval system using Wavelet Transformation and multiple input multiple task Deep Autoencoder

Content Based Image Retrieval system using Wavelet Transformation and multiple input multiple task Deep Autoencoder

In this paper, we propose an algorithm for a Content Based Image Retrieval (CBIR) system based on Wavelet Transformation and Deep Autoencoder (DAE). For the proposed algorithm, the image is first processed by wavelet transform and decomposed into wavelet coefficients. The wavelet coefficients then become the input for a multiple input multiple task deep autoencoder (MIMT-DAE). In our design, only the approximation coefficients (CA) and diagonal detail coefficients (CD) are used. The result of retrieval performance is tested on the MNIST handwriting data base. The testing results show that the combination of wavelet transformation and MIMT-DAE increases the performance of image retrieval for shape and texture compared to a traditional single input single task deep autoencoder with far fewer training parameters required.

Brian Nutter | Xiangyuan Zhao

[1] Cheng-Yuan Liou,et al. Modeling word perception using the Elman network , 2008, Neurocomputing.

[2] Edward K. Wong,et al. Deepshape: Deep learned shape descriptor for 3D shape matching and retrieval , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[4] Geoffrey E. Hinton,et al. Using very deep autoencoders for content-based image retrieval , 2011, ESANN.

[5] Kobus Barnard,et al. Evaluating image retrieval , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[6] Du-Sik Park,et al. Rotating your face using multi-task deep neural network , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[8] Yoshua. Bengio,et al. Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..

[9] Antoni B. Chan,et al. Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[10] Geoffrey Zweig,et al. An introduction to computational networks and the computational network toolkit (invited talk) , 2014, INTERSPEECH.