MIL-VT: Multiple Instance Learning Enhanced Vision Transformer for Fundus Image Classification