Capturing Structural Locality in Non-parametric Language Models