A Unified Sequence Interface for Vision Tasks