SVIT: Scaling up Visual Instruction Tuning