RefineVIS: Video Instance Segmentation with Temporal Attention Refinement