Joint Representation Learning and Keypoint Detection for Cross-View Geo-Localization