Global Context and Geometric Priors for Effective Non-Local Self-Attention