An Image Classifier Can Suffice For Video Understanding