Instance Segmentation with Cross-Modal Consistency