论文信息 - Spatial Chirp-Z Transformer Networks

Spatial Chirp-Z Transformer Networks

Convolutional Neural Networks are often used for computer vision solutions, because of their inherent modeling of the translation invariance in images. In this paper, we propose a new module to model rotation and scaling invariances in images. To do this, we rely on the chirp-Z transform to perform the desired translation, rotation and scaling in the frequency domain. This approach has the benefit that it scales well and that it is differentiable because of the computationally cheap sincinterpolation.

Sander Dieleman | Jonas Degrave | Joni Dambre | Francis Wyffels

[1] C. Burrus,et al. DFT/FFT and Convolution Algorithms: Theory and Implementation , 1991 .

[2] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[3] Ole Winther,et al. Recurrent Spatial Transformer Networks , 2015, ArXiv.

[4] Patrice Y. Simard,et al. Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[5] E. V. Vlasov,et al. Efficient implementation of the image rotation method using chirp Z-transform , 2014, Pattern Recognition and Image Analysis.

[6] L. Rabiner,et al. The chirp z-transform algorithm and its application , 1969 .

[7] R W Cox,et al. Rotation of NMR images using the 2D chirp‐z transform , 1999, Magnetic resonance in medicine.

[8] Alan R. Jones,et al. Fast Fourier Transform , 1970, SIGP.

[9] Colin Raffel,et al. Lasagne: First release. , 2015 .

[10] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[11] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.