Multi-scale attention encoder for street-to-aerial image geo-localization