On the Relationship between Self-Attention and Convolutional Layers