Temporal consistent portrait video segmentation