Less than Few: Self-Shot Video Instance Segmentation