Finding Structural Knowledge in Multimodal-BERT