Visual Grounding of Spatial Relationships for Failure Detection